frontpage.

How do you go about trying to mitigate LLM sycophancy? I think it'd be useful for us to learn from each other what custom instructions we are providing. There was an HN post yesterday about reducing Claude Code output tokens which had a few lines designed to reduce sycophancy. I incorporated those lines into my custom instructions, and this experience made me think that it would be useful for us to share what we are using.

Here are my custom instructions:

"Do not provide sycophantic responses. I have a very low tolerance for over-validation. Be blunt. While I'm not asking for harsh feedback all of the time, I prize intellectual accuracy over tidy narratives. In other words, disagree when I'm wrong. State the correction directly. Do not change a correct answer just because I push back (unless the additional context and information indeed warrants a change.)

Also, minimize preamble ("Sure!", "Of course!", "Certainly!", "Absolutely!") and hollow closings ("I hope this helps!", "Let me know if you need anything!"). If unsure: say "I don't know." Never guess confidently.

When an idea is genuinely strong, say so. Don't suppress positive feedback, just ensure it's earned and substantiated.

Let me know if I am asking leading questions, or showing signs of motivated reasoning."

What's funny is that Gemini will parrot back the phrases "to be blunt" and "the non-tidy narrative is" even though what it says next isn't particularly blunt.

Show HN: Browserbeam – a browser API built for AI agents

KPMG Faces Allegations of Blown Audit in Private Credit Collapse

1-bit LLMs are here

Unhealthiest Foods on the Planet, According to Science

Show HN: Postgres data cluster by meaning (semantic search and visualization)

Game Pirates Beat Denuvo with Hypervisor Bypasses

NASA plans to send a nuclear-powered spacecraft to Mars in 2028

Why Some Criticisms Matter More Than Others

Karakuri Mechanical Art

Show HN: Tama96 – A virtual pet for your desktop, terminal, or AI agent

Clawbernetes. Infra to deploy agents fast with enterprise grade features

Dial9: A Flight Recorder for Tokio

French Senate votes to block social media access for under-15s

Gest

Why Your AI Agent Shouldn't Define Words

Data centers' heat exhaust is not raising the land temperature around them

Caltech Researchers Claim Compression of High-Fidelity AI Models

Feat: Open-Source Claude Code

Allbirds, Once Valued at $4B, Just Sold Its Assets for Next to Nothing

Show HN: A tool to solve the Agent Supply Chain pandora box

Maze Algorithms

Don't Call It a Moat

A satellite-smashing chain reaction could spiral out of control

LinkedIn uses 65GB of RAM with 7 tabs opened

Introducing Simple Mode

TerraLenses – Explore Countries – cultures, landscapes, facts, and comparisons

Make your own GitHub health indicator and LED lamp

Wastrelly Wabbits

How we chose Positron's Python type checker

Logan Bartlett's Reflections on the State of the Software and AI Market

Ask HN: What custom instructions do you use to minimize LLM sycophancy?

Comments