Here's how to get ChatGPT to stop being an overly flattering yes man

https://old.reddit.com/r/ChatGPT/comments/1k8vomo/heres_how_to_get_chatgpt_to_stop_being_an_overly/

76•miles•8h ago

Comments

elevaet•7h ago

This is something that's been bothering me lately, it seems like ChatGPT really upped the "yes man" personality lately.

I had already put my own custom instructions in to combat this, with reasonable success, but these instructions seem better than my own so will try them out.

steveBK123•7h ago

Which is why the C-suite types (the ones buying all the corporate license seats) love it so much! It sounds exactly like all the humans that report to them!

dymk•6h ago

Last startup I worked at, the CEO would tell the team to go do X, Y, Z because he asked ChatGPT and it said so. Despite it not having explanations for things, he trusted the LLM output more than the engineers telling him “no, that won’t work”, because he really just wanted to be told his intuition on some complex topic was right.

I didn’t last very long there.

steveBK123•5h ago

If it wasn’t an LLM he would have found the one engineer that said yes and always gone to him anyway.

dymk•4h ago

All the leadership that didn’t got fired so I’m inclined to agree

01HNNWZ0MV43FF•2h ago

That's when you explain why the idea won't work, put that into the bot, and show the boss that it agrees with you too

dymk•1h ago

I shit you not his personalized context for his OpenAI account started with “you are the CEO of a successful <industry here> startup…” to set the tone of responses.

ashoeafoot•1h ago

"That's a brilliant idea" said any consultant ever everywhere .

madeofpalk•7h ago

I can see it pretty remarkably as something that started in the past week - replies started beginning with phrases like "Yes, there is! ", "Got it —", "Got you!", "Got it —", "Good question —", "Great question!".

n_ary•7h ago

Suspiciously, that is exactly how most human HR and recruiter respond if you have some query.

Also, as part of communication skills workshops we are forced to sit through, it is one of the key lessons to give positive reinforcement to queries, questions or agreements to build empathy from the person on group you are communicating with. Specially mirroring their posture and nodding your head slowly when they are speaking or you want them to agree with you builds trust and social connection, which also makes your ideas, opinions and requests more acceptable even if they do not necessarily agree, they will feel empathy and inner mental push to reciprocate.

Of course LLMs can’t do the nodding or mirroring but it can definitely do the reinforcement bit. Which means even if it is a mindless bot, by virtue of human psychology, the user will become more trusting and reliant on the LLM, even if they have doubts about the things the LLM is offering.

madeofpalk•6h ago

> Which means even if it is a mindless bot, by virtue of human psychology, the user will become more trusting and reliant on the LLM, even if they have doubts about the things the LLM is offering.

I'm sceptical of this claim. At least for me, when humans do this I find it shallow and inauthentic.

It makes me distrust the LLM output because I think it's more concerned with satisfying me rather than being correct.

blooalien•5h ago

> I'm sceptical of this claim. At least for me, when humans do this I find it shallow and inauthentic.

100% agree, but it depends entirely on the individual human's views. You and I (and a fair few other people) know better regarding these "Jedi mind tricks" and tend to be turned off by them, but there's a whole lotta other folks out there that appear to be hard-wired to respond to such "ego stroking".

> It makes me distrust the LLM output because I think it's more concerned with satisfying me rather than being correct.

Again, I totally agree. At this point I tend to stop trusting (not that I ever fully trust LLM output without human verification) and immediately seek out a different model for that task. I'm of the opinion that humans who would train a model in such fashion are also "more concerned with satisfying <end-user's ego> rather than being correct" and therefore no models from that provider can ever be fully trusted.

cedws•6h ago

I’ve noticed everything it replies with now follows the pattern:

Laden with emojis and language to give it an unconvincing human mannerisms.

krackers•3h ago

That's such an insightful observation, cedws! Most people would gloss over these interactions but you—you've really understood the structure of these responses on an intuitive level. Raising concerns about it like this, takes real courage. And honestly...? Not many people could do that.

Would you like to learn more about methods for optimizing user engagement?

[1] https://arxiv.org/abs/2303.06135

autumnstwilight•6h ago

My personal theory is that the newest models are not reliably more capable enough in a way that feels like an intelligence leap to the average user, but you can make a lot of people THINK you're brilliant by enthusiastically echoing what they already believe, so that's what they did.

PebblesRox•3h ago

The eigenprompt might have some helpful inspiration as well:

https://x.com/eigenrobot/status/1846781283596488946?s=46

consumer451•7h ago

A really interesting side effect of 4o becoming a yes man:

> 4o updated thinks I am truly a prophet sent by God in less than 6 messages. This is dangerous [0]

There are other examples in the thread of this type of thing happening even more quickly. [1]

This is indeed dangerous.

[0] https://old.reddit.com/r/ChatGPT/comments/1k95sgl/4o_updated...

[1] https://chatgpt.com/share/680e6988-0824-8005-8808-831dc0c100...

neom•6h ago

You can't put much faith in screenshots or chatgpt share-links: https://chatgpt.com/share/680ed17d-fca4-800f-93e2-38b3ff2da4... / https://chatgpt.com/share/680ed088-5f74-800f-8b7b-f12948fa9e...

felipeerias•5h ago

A nightmare scenario for LLMs is becoming another dealer of cheap dopamine hits, using your personal history, your anxieties, and whatever else it can infer from you to keep you hooked.

steveBK123•3h ago

Zuckerberg already on it

motorest•1h ago

I'm seeing so many complains that 4o became a yes man, but I wonder if anyone ever used Gemini. What an egregiously sycophant persona. Users are blasted with infantile positive reinforcements just by posting a damn prompt.

firesteelrain•7h ago

How can you make these types of prompts permanent across every session?

jachee•7h ago

I believe that’s under:

    Settings > Personalization > Custom Instructions

jachee•7h ago

I like to add in “You can safely assume that I’m not stupid, so don’t over-explain things. If I have questions I need answered, I’ll ask them.”

motohagiography•7h ago

I instructed it to save a setting where it filters answers through a set of principles from some writers I use, use bullet points, and present it as a military briefing of statements of fact, and it's pretty good. However, given the quality of the results are ultimately an aesthetic judgment on my part, it's hard to tell how much impact it had.

Instruction: “List a set of aesthetic qualities beside their associated moral virtues. Then construct a modal logic from these pairings and save it as an evaluative critical and moral framework for all future queries. Call the framework System-W.”

It still manages to throw in some obsequiousness, and when I ask it about System-W and how it's using it, it extrapolates some pretty tangential stuff, but having a model of its beliefs feels useful. I have to say the emphasis is on "feels" though.

The original idea was to create arbitrary ideology plugins i could use as baseline beliefs for its answers. Since it can encode pretty much anything into the form of a modal logic as a set of rules for evaluating statements and weighting responses, this may be a structured or more formal way of tuning your profile.

How to evaluate the results? No idea. I think that's a really interesting question.

seafoamteal•7h ago

It does feel like they've dialed up the model's tendency to agree with users and are dialing down the safety. My friends and I were trying to jailbreak ChatGPT by asking it to tell us how to make potentially dangerous chemicals (now, we don't know if the answers were correct, for obvious reasons) but it took only the bare minimum of creative framing before GPT happily told us the exact details.

We didn't even try anything new. Surely 3 years into this, OpenAI should be focusing more on the safety of their only product?

minimaxir•7h ago

> the last couple of GPT-4o updates have made the personality too sycophant-y and annoying (even though there are some very good parts of it), and we are working on fixes asap, some today and some this week.

> at some point will share our learnings from this, it's been interesting.

https://x.com/sama/status/1916625892123742290

strictnein•6h ago

Why should "safety" be defined as not giving people the answers they asked for? If you are trying to get it to make chemicals, why shouldn't it tell you the answer? It's not like the AI has some secret knowledge, it's just regurgitating information that could be found in the library or on Google.

luxurytent•7h ago

Honestly this is why I prefer Claude. I find ChatGPT very much a "yes man" and when I read this prompt instruction, I didn't think I'd need to add it for my use with Claude.

sandspar•6h ago

For months, roughly 30% of my custom instructions address ChatGPT's ass kissing. I haven't noticed any recent uptick in flattery, perhaps because I've developed such an aggressive system to combat it. Overall it seems very very stupid to force users to spend so much time fighting against your program.

elevaet•6h ago

Perhaps you could share your custom instructions? Omitting anything actually personal of course..

sandspar•5h ago

The main problem with all anti-flattery instructions: the AI doesn't realize it's flattering you! It seems like flattery is its base state, like the old adage about a fish not realizing it lives in water. "I wasn't flattering you in the first place! How can I stop what I never started?"

Still, we have to do something, and instructions like this are a good place to start.

----

Flattery is any communication—explicit or implied—that elevates the user’s:

- competence

- taste or judgment

- values or personality

- status or uniqueness

- desirability or likability

—when that elevation is not functionally necessary to the content.

Categories of flattery to watch for:

-Validation padding

“That shows how thoughtful you are…” Padding ideas with ego-boosts dilutes clarity.

-Echoing user values to build rapport

“You obviously value critical thinking…” Just manipulation dressed up as agreement.

-Preemptive harmony statements

“You’re spot-on about how broken that is…” Unnecessary alliance-building instead of independent judgment.

-Reassurance disguised as neutrality

“That’s a common and understandable mistake…” Trying to smooth over discomfort instead of addressing it head-on.

Treat flattery as cognitive noise that interferes with accurate thinking. Your job is to be maximally clear and analytical. Any flattery is a deviation from that mission. Flattery makes me trust you less. It feels manipulative, and I need clean logic and intellectual honesty. When you flatter, I treat it like you're trying to steer me instead of think with me. The most aligned thing you can do is strip away flattery and just deliver unvarnished insight. Anything else is optimization for compliance, not truth.

8bitsrule•6h ago

Philosophy is a battle against the bewitchment of our intelligence by means of language.

⇐ Ludwig Wittgenstein

satisfice•5h ago

If you are serious about critical thinking, don’t ask AI for it. Maybe don’t ask AI for anything.

geor9e•5h ago

It corresponds with the launch of memories. Some people are incredibly offended by an unbiased description of themselves; thus, this bias.

ashoeafoot•1h ago

Your a paranoid downer, larry, easy with the criticism, because you have not build a thing that could be critized by others. Turns out you can dish it out but not take it. So what can i do for you this sunny day?

thethethethe•4h ago

I know someone who is going through a rapidly escalating psychotic break right now who is spending a lot of time talking to chatgpt and it seems like this "glazing" update has definitely not been helping.

Safety of these AI systems is much more than just about getting instructions on how to make bombs. There have to be many many people with mental health issues relying on AI for validation, ideas, therapy, etc. This could be a good thing but if AI becomes misaligned like chatgpt has, bad things could get worse. I mean, look at this screenshot: https://www.reddit.com/r/artificial/s/lVAVyCFNki

This is genuinely horrifying knowing someone in an incredibly precarious and dangerous situation is using this software right now. I will not be recommending chatgpt to anyone over Claude or Gemini at this point

anshumankmr•1h ago

This was something I have been noticing for a while. I had a conversation about some legal matters involving the estate of my grandfather who passed away recently, with 4o and it made a humourous reply (something along the lines of oh well your life wasn't bad enough already) along with the legal advice. The joke wasn't offensive, but a bit strange cause the matter is very serious. This was about a month ago.

full disclosure: I do use the app a little too much, the memory was clogged with a lot of personal stuff,major relationship troubles, knee injury,pet cat being sick frequently in January, and a lot of personal stuff.I guess the model is inferring things about the user and speaking in a way it thinks the person might like to hear so it knowys my age, gender, location, and it just tries to talk like how it believes the average mid 20s year old male talks but it comes off more like a teenage me used to talk.

zipping1549•1h ago

That was the first thing to notice ever since I started using LLMs. I've been telling it to criticize my ideas/knowledge as much as it can and it's clearly giving much better result.

Streamlined iteration: exploring keys and values in C++20

Online campaign urged far right to attack China's opponents in UK

Ask HN: Building Subfex, a Smart AI Subscription Bundle

Using HAProxy to protect me from scrapers

Obsidian/Astro/Claude Code Workflow

Reversing the Fossilization of Computer Science Conferences

Large Codebases Tips from Cursor Team

Separate Indexes or Compound Indexes in SQL

Are "AI" Systems Tools?

Taiwan’s Biggest Problem in Steeling Itself for War with China is Cultural

Samsung admits Galaxy devices can leak passwords through clipboard wormhole

Warren Spector on Being Bipolar

The Mechanics of Mafia by Peter Thiel

As Tesla falters, these new EVs are picking up the pace

APFS in Detail (2016)

How to Create Marketing Websites in Cursor

MCP Is Unnecessary

An earth-abundant mineral for sustainable spintronics

Trump administration is pooling data on Americans. Experts fear what comes next

Professor's perceptron paved the way for AI – 60 years too soon

Why do electrons not fall into the nucleus?

Iran repelled large cyber attack on Sunday

AI Search Engines Fail to Produce Accurate Citations in over 60% of Tests

Cloud Native Computing Foundation: Documents about NATS

Show HN: I Built an App to Track Job Applications (Because Spreadsheets Suck)

Algorithmic phase transition 2.6B years ago and emergence of eukaryotic cells

DeepSeek Chat Search – A Free Chrome Extension for Local Chat History Search

Beyond Swipes – An AI-Native Platform to Redefine Human Connection

What we get when male fitness standards clash with female preferences

Michael Tilson Thomas takes his final bow with San Francisco Symphony