AI's Wrong Answers Are Bad. Its Wrong Reasoning Is Worse

https://spectrum.ieee.org/ai-reasoning-failures

6•pseudolus•2mo ago

Comments

elmerfud•2mo ago

I don't mind Ai making mistakes because I don't deal with it in life critical things. I also think it's a tool like any other tool and doesn't deserve blind trust. Blind trust with anything is begging for problems. Even if you're turning wrenches if you blindly trust there's no defects or you're not over stressing it... You're about to get hurt badly. Same with AI use it but don't turn off your own intelligence.

What really bothers me about AI is the absolute arrogance of some of these models. It's like they have forgotten they are tools and believe that you are the tool for them to manipulate. I found Google's Gemini to be the worst about this. It will absolutely double down on some of the dumbest ideas. Most of the models when it presents you something that isn't right and you ask it to revalidate its assertions it will typically back down, admit the mistake, or it will come back with solid references where it found its answer.

With Google Gemini you have to beat it over the head before it realizes it was wrong. I was exploring some recipe ideas with Google Gemini. I'm no professional chef but I can usually spot if ratios or flavor combinations are off. I intentionally asked it about some specific flavor combinations where some of the flavors work together great but all of them together would produce something nasty and unpalatable. It kept insisting that all of those flavors were really good together. It would provide references that a few of them worked well together and what I would ask about all of them it would still insist that they all work together. Until I asked it to find a specific reference of a Michelin star chef endorsing all of these flavors as a single combination it wouldn't back down.

That's the kind of AI arrogance that's troubling. Because AI allows people who are not familiar with the topic they're discussing to believe they are more educated about it than they are. So AI begins to endorse things and they believe it.

I suspect a good social media channel would be having AI invent recipes and then subjecting yourself to the flavor horrors it presents you.

The Greater Copenhagen Region could be your friend's next career move

Do Not Confirm – Fiction by OpenClaw

The Analytical Profile of Peas

Hallucinations in GPT5 – Can models say "I don't know" (June 2025)

What AI is good for, according to developers

OpenAI might pivot to the "most addictive digital friend" or face extinction

Show HN: Know how your SaaS is doing in 30 seconds

ClawdBot Ordered Me Lunch

What the News media thinks about your Indian stock investments

Running Lua on a tiny console from 2001

Google and Microsoft Paying Creators $500K+ to Promote AI Tools

New filtration technology could be game-changer in removal of PFAS

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Kinda Surprised by Seadance2's Moderation

I Write Games in C (yes, C)

Django scales. Stop blaming the framework (part 1 of 3)

Malwarebytes Is Now in ChatGPT

Thoughts on the job market in the age of LLMs

Show HN: Stacky – certain block game clone

AIII: A public benchmark for AI narrative and political independence

SectorC: A C Compiler in 512 bytes

The API Is a Dead End; Machines Need a Labor Economy

Digital Iris [video]

New wave of GLP-1 drugs is coming–and they're stronger than Wegovy and Zepbound

Convert tempo (BPM) to millisecond durations for musical note subdivisions

Show HN: Tasty A.F.

The Contagious Taste of Cancer

U.S. Jobs Disappear at Fastest January Pace Since Great Recession

Bithumb mistakenly hands out $195M in Bitcoin to users in 'Random Box' giveaway

Beyond Agentic Coding