frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

AI's Wrong Answers Are Bad. Its Wrong Reasoning Is Worse

https://spectrum.ieee.org/ai-reasoning-failures
3•pseudolus•41m ago

Comments

elmerfud•24m ago
I don't mind Ai making mistakes because I don't deal with it in life critical things. I also think it's a tool like any other tool and doesn't deserve blind trust. Blind trust with anything is begging for problems. Even if you're turning wrenches if you blindly trust there's no defects or you're not over stressing it... You're about to get hurt badly. Same with AI use it but don't turn off your own intelligence.

What really bothers me about AI is the absolute arrogance of some of these models. It's like they have forgotten they are tools and believe that you are the tool for them to manipulate. I found Google's Gemini to be the worst about this. It will absolutely double down on some of the dumbest ideas. Most of the models when it presents you something that isn't right and you ask it to revalidate its assertions it will typically back down, admit the mistake, or it will come back with solid references where it found its answer.

With Google Gemini you have to beat it over the head before it realizes it was wrong. I was exploring some recipe ideas with Google Gemini. I'm no professional chef but I can usually spot if ratios or flavor combinations are off. I intentionally asked it about some specific flavor combinations where some of the flavors work together great but all of them together would produce something nasty and unpalatable. It kept insisting that all of those flavors were really good together. It would provide references that a few of them worked well together and what I would ask about all of them it would still insist that they all work together. Until I asked it to find a specific reference of a Michelin star chef endorsing all of these flavors as a single combination it wouldn't back down.

That's the kind of AI arrogance that's troubling. Because AI allows people who are not familiar with the topic they're discussing to believe they are more educated about it than they are. So AI begins to endorse things and they believe it.

I suspect a good social media channel would be having AI invent recipes and then subjecting yourself to the flavor horrors it presents you.

Ask HN: Why is everyone in tech so performative/two faced

1•bunnybomb2•21s ago•0 comments

Show HN: CodeProt – Filter static analysis false positives in CI

https://codeprot.com
1•allenz_cheung•15m ago•0 comments

(Norway) New Record: Almost 100% EV Registrations in November

https://www.electrive.com/2025/12/01/norway-sets-new-record-with-near-100-electric-vehicle-regist...
1•JojoFatsani•17m ago•0 comments

Roko's Dancing Basilisk

https://boston.conman.org/2025/12/02.1
2•todsacerdoti•18m ago•0 comments

Human art in a post-AI world should be strange

https://www.owlposting.com/p/art-in-a-post-ai-world-should-be
2•crescit_eundo•18m ago•0 comments

Young Ants Beg for Death When Sick, New Study Reveals

https://www.sciencealert.com/young-ants-beg-for-death-when-sick-new-study-reveals
1•ashishgupta2209•19m ago•0 comments

The Atari Jaguar's Last Roar

https://thedeletedscenes.substack.com/p/not-with-a-roar-but-with-a-whimper
2•adelmastro•20m ago•0 comments

"Diff-Focus: Reduce code review ramp-up time with heuristic diff summarization"

https://github.com/yksanjo/diff-focus-chrome
1•yksanjo•20m ago•0 comments

Show HN: The Forge Calculator for Roblox "The Forge"

https://theforgecalculator.org
1•takennap•29m ago•0 comments

Show HN: I built alwayswith.us to easily add deceased loved ones into photos

https://alwayswith.us
1•jrpribs•29m ago•1 comments

Gym workout set and reps tracker

https://www.setly.org/
1•abdullah9•30m ago•0 comments

Accounting red flags at PDD (2023)

https://www.transparently.ai/blog/accounting-red-flags-at-pinduoduo
2•mgh2•32m ago•0 comments

Bio-AI with a Conscience Kernel and Self-Correcting Identity

1•KIDDOUTLAW•33m ago•0 comments

Ambriel

https://ambriel.io
1•jesuscasdf•34m ago•0 comments

Openterface KVM-GO – Crowd Supply

https://www.crowdsupply.com/techxartisan/openterface-kvm-go
1•evanjrowley•36m ago•1 comments

AI Psychosis in First Person

https://kennethreitz.org/essays/2025-09-08-the_prophets_frequency_on_reading_divine_static
5•maraoz•36m ago•1 comments

AI-powered surveillance firms are gunning for a share of the Gaza spoils

https://www.972mag.com/ai-surveillance-gaza-palantir-dataminr/
2•cramsession•38m ago•0 comments

AI's Wrong Answers Are Bad. Its Wrong Reasoning Is Worse

https://spectrum.ieee.org/ai-reasoning-failures
3•pseudolus•41m ago•1 comments

Wikipedia's most-read articles of 2025

https://wikimediafoundation.org/news/2025/12/02/announcing-wikipedias-most-read-articles-of-2025/
1•andsoitis•43m ago•0 comments

Thoughts on AI Progress

https://www.dwarkesh.com/p/thoughts-on-ai-progress-dec-2025
2•tfirst•46m ago•0 comments

A Directory of Every AI Tool for Hardware Engineers

https://www.hardwareai.directory
1•anu_bonth•48m ago•1 comments

Basecamp/Fizzy

https://github.com/basecamp/fizzy
3•doppp•50m ago•0 comments

Remove Attachments from Gmail Messages

https://attachments-extractor.ybouane.com/
1•michaelrkn•52m ago•1 comments

Built a tool which sizes and selects water filters

https://hydroanalyze.tech/
1•harishiitkgp7•58m ago•0 comments

Intrarectal perfluorodecalin for enteral ventilation in a first-in-human trial

https://www.cell.com/med/abstract/S2666-6340(25)00314-9
2•surprisetalk•58m ago•0 comments

Ambsheet: A spreadsheet for exploring scenarios [video]

https://www.youtube.com/watch?v=EtC2XiGFh7E
2•surprisetalk•59m ago•0 comments

Animalcules and Their Motors

https://www.asimov.press/p/flagella
1•surprisetalk•59m ago•0 comments

Planetary Robotics. Beyond Humanoids.

https://akash.earth/
1•maxnajer•1h ago•1 comments

Kohler Can Access Pictures from "End-to-End Encrypted" Toilet Camera

https://varlogsimon.leaflet.pub/3m6zrw6k2bs2p?interactionDrawer=quotes
54•TimDotC•1h ago•39 comments

Reverse-engineering Claude's sandbox, then building my own

https://michaellivs.com/blog/sandboxed-execution-environment
1•handfuloflight•1h ago•0 comments