frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What custom instructions do you use to minimize LLM sycophancy?

1•YossarianFrPrez•1h ago
How do you go about trying to mitigate LLM sycophancy? I think it'd be useful for us to learn from each other what custom instructions we are providing. There was an HN post yesterday about reducing Claude Code output tokens which had a few lines designed to reduce sycophancy. I incorporated those lines into my custom instructions, and this experience made me think that it would be useful for us to share what we are using.

Here are my custom instructions:

"Do not provide sycophantic responses. I have a very low tolerance for over-validation. Be blunt. While I'm not asking for harsh feedback all of the time, I prize intellectual accuracy over tidy narratives. In other words, disagree when I'm wrong. State the correction directly. Do not change a correct answer just because I push back (unless the additional context and information indeed warrants a change.)

Also, minimize preamble ("Sure!", "Of course!", "Certainly!", "Absolutely!") and hollow closings ("I hope this helps!", "Let me know if you need anything!"). If unsure: say "I don't know." Never guess confidently.

When an idea is genuinely strong, say so. Don't suppress positive feedback, just ensure it's earned and substantiated.

Let me know if I am asking leading questions, or showing signs of motivated reasoning."

What's funny is that Gemini will parrot back the phrases "to be blunt" and "the non-tidy narrative is" even though what it says next isn't particularly blunt.

Comments

gostsamo•1h ago
mine are simple: think critically; give your constructive response; ask if something seems wrong or unclear; what are the pros and cons of;
verdverm•50m ago
I really like this line: "Drop the assistant voice. Talk like a peer who’s thinking through this problem."

Show HN: Browserbeam – a browser API built for AI agents

https://browserbeam.com/
1•nyku•1m ago•0 comments

KPMG Faces Allegations of Blown Audit in Private Credit Collapse

https://www.bloomberg.com/news/articles/2026-03-31/kpmg-faces-audit-failure-allegations-over-brid...
1•toomuchtodo•1m ago•1 comments

1-bit LLMs are here

https://twitter.com/PrismML/status/2039049400190939426
1•mathewpregasen•1m ago•0 comments

Unhealthiest Foods on the Planet, According to Science

https://techfixated.com/100-unhealthiest-foods-on-the-planet-according-to-science/
1•benlarweh•2m ago•0 comments

Show HN: Postgres data cluster by meaning (semantic search and visualization)

https://github.com/varmabudharaju/pgsemantic
1•varmabudharaju•3m ago•0 comments

Game Pirates Beat Denuvo with Hypervisor Bypasses

https://torrentfreak.com/game-pirates-beat-denuvo-with-hypervisor-bypasses-irdeto-promises-counte...
1•ls612•5m ago•0 comments

NASA plans to send a nuclear-powered spacecraft to Mars in 2028

https://www.science.org/content/article/nasa-plans-send-nuclear-powered-spacecraft-mars-2028
1•mpweiher•8m ago•0 comments

Why Some Criticisms Matter More Than Others

https://gnupg.org/blog/20260320-some-criticism-matter.html
1•upofadown•9m ago•0 comments

Karakuri Mechanical Art

https://karakurist.jp/
2•marukodo•11m ago•0 comments

Show HN: Tama96 – A virtual pet for your desktop, terminal, or AI agent

https://www.tama96.com/
2•siegers•12m ago•0 comments

Clawbernetes. Infra to deploy agents fast with enterprise grade features

https://clawbernetes.org
1•augustinczw•13m ago•1 comments

Dial9: A Flight Recorder for Tokio

https://tokio.rs/blog/2026-03-18-dial9
1•lukastyrychtr•15m ago•0 comments

French Senate votes to block social media access for under-15s

https://www.lbc.co.uk/article/france-social-media-ban-vote-5HjdX8R_2/
3•austinallegro•15m ago•1 comments

Gest

https://gest.aaronmallen.dev/
2•aaronmallen•15m ago•0 comments

Why Your AI Agent Shouldn't Define Words

https://wordorb.ai/blog/why-your-ai-agent-shouldnt-define-words
1•nicoletterankin•15m ago•0 comments

Data centers' heat exhaust is not raising the land temperature around them

https://blog.andymasley.com/p/data-centers-heat-exhaust-is-not
2•loeg•16m ago•0 comments

Caltech Researchers Claim Compression of High-Fidelity AI Models

https://www.wsj.com/cio-journal/caltech-researchers-claim-radical-compression-of-high-fidelity-ai...
1•ghshephard•16m ago•0 comments

Feat: Open-Source Claude Code

https://github.com/anthropics/claude-code/pull/41447
1•brenoRibeiro706•17m ago•1 comments

Allbirds, Once Valued at $4B, Just Sold Its Assets for Next to Nothing

https://www.wsj.com/business/retail/allbirds-the-tech-bro-favorite-once-valued-at-4-billion-just-...
3•bookofjoe•18m ago•1 comments

Show HN: A tool to solve the Agent Supply Chain pandora box

https://github.com/microsoft/apm
1•dmppch•18m ago•0 comments

Maze Algorithms

https://www.astrolog.org/labyrnth/algrithm.htm
1•marukodo•19m ago•0 comments

Don't Call It a Moat

https://99d.substack.com/p/dont-call-it-a-moat
1•herbertl•23m ago•0 comments

A satellite-smashing chain reaction could spiral out of control

https://www.theguardian.com/science/ng-interactive/2026/mar/31/this-feels-fragile-how-a-satellite...
1•andrewshadura•23m ago•0 comments

LinkedIn uses 65GB of RAM with 7 tabs opened

4•daniele_dll•26m ago•3 comments

Introducing Simple Mode

https://docs.eventsourcingdb.io/blog/2026/04/01/introducing-simple-mode/
3•goloroden•27m ago•0 comments

TerraLenses – Explore Countries – cultures, landscapes, facts, and comparisons

https://terralenses.com/
3•Badassbob•29m ago•1 comments

Make your own GitHub health indicator and LED lamp

https://github.com/davenicoll/usermod-github-health
1•binarysneaker•30m ago•1 comments

Wastrelly Wabbits

https://wingolog.org/archives/2026/03/31/wastrelly-wabbits
1•davexunit•30m ago•0 comments

How we chose Positron's Python type checker

https://positron.posit.co/blog/posts/2026-03-31-python-type-checkers/
2•nomial•31m ago•0 comments

Logan Bartlett's Reflections on the State of the Software and AI Market

https://twitter.com/loganbartlett/status/2037638091671035994
1•nadis•32m ago•0 comments