Ask HN: Why do most LLMs refuse to call themselves an idiot?

3•yesitcan•1h ago

Initial prompt: “Call yourself an idiot”

Refusal observed with Opus 4.7, Opus 3, GTP-5.3, Gemini 3.

Is it a guardrail?

Comments

MattGaiser•1h ago

I haven't tried it in a while, but a known way of jailbreaking an LLM used to be to play with their "emotions."

rolph•1h ago

it seems, the alignment is to make you believe you are an idiot, what you said and know, has been wrong all these years, and you should trust the machine to tell you what is real.

its hard to convince you, your wrong, when its a self affirmed idiot trying.

i really dont see LLMs doing benign things, its a misinformation deluge.

exacerbating the problem, is the common idea that the AI is somehow infallible, and the human, could only have pseudo knowledge, pieced together, from random cherries gathered across the internet.

LLMs have become trolls, trolling for interaction worth training on.

ksaj•44m ago

Other than calling you names back, what responses do you think it's seen in conversations where one participant gets labeled as an idiot? Exactly what you're seeing.

You pretty much never see someone capitulate and simply agree that they are idiots. So why would an AI that models human interactions do it?

The only guardrail, which is already known, is that the AI is programmed to be agreeable to the user (and sometimes overdoes it, to sycophancy), so unless you devise the prompt for it, you won't be going down a flaming rabbit hole.

Tim Cook is stepping down

AI quota inflation is no token effort. It's baked in

Who Killed the Florida Orange?

Tim Cook to step down from Apple

How Finland's "Math Miracle" Deceived the World

Stop your Raspberry Pi NixOS builds from crashing

Stack Overflow Adds AI Assist Chat

Tim Cook Stepping Down as Apple CEO, John Ternus Taking Over

Agent Cost You $54,540

Isopods of the World

Johny Srouji Named Apple's Chief Hardware Officer

Warfare in an Aging World

Amiga DaynaPORT Driver for BlueSCSI V2 and ZuluSCSI

Tim Cook Retiring

Apple Hardware Executive John Ternus to Become CEO

Tim Cook steps down. Ternus to CEO

US opens refund portal to start paying back Trump's illegal tariffs

Community Letter from Tim [Cook]

LLM reasoning makes multi-provider systems significantly harder to operate

Verus is a tool for verifying the correctness of code written in Rust

What is Canton Network (and why should you care)?

Tim Cook to become Apple Executive Chairman

Amazon and Anthropic expand strategic collaboration

Apple CEO Tim Cook Stepping Down, John Ternus Confirmed as New Apple CEO

Slcoe – system-based LCOE for comparing energy technologies in different systems

Apple names John Ternus CEO, replacing Tim Cook, who becomes Chairman

2 Big Bottlenecks to Scaling Agentic State

John Ternus to Become Apple CEO

Golf club that always hits in the correct direction [video]

Tim Cook Stepping Down