frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

I'm Tired of Talking to AI

https://orchidfiles.com/im-tired-of-ai-generated-answers/
1•theorchid•41s ago•0 comments

Show HN: Axion – Browser-based guitar amp/effects rig

https://axion.cab/
1•rhysfonixone•1m ago•0 comments

Show HN: Chat Hoarding – Mac app to archive WhatsApp backups locally

https://chathoarding.app/
1•zzeynalov•4m ago•0 comments

Lab-Made Babies Won't Solve the Fertility Crisis

https://ifstudies.org/blog/lab-made-babies-wont-solve-the-fertility-crisis
1•berlianta•5m ago•0 comments

The Despair of the Professor in the Age of A.I

https://www.newyorker.com/news/fault-lines/the-despair-of-the-professor-in-the-age-of-ai
1•YeGoblynQueenne•6m ago•0 comments

A file-level tree that lets an LLM reason over a document corpus

https://pageindex.ai/blog/pageindex-filesystem
1•cccaaai•6m ago•0 comments

Modern Web Guidance

https://developer.chrome.com/docs/modern-web-guidance
1•pramodbiligiri•7m ago•0 comments

Show HN: I hand-write 5 daily word puzzles before work

https://www.dailyworder.com/
1•DailyWorder•13m ago•0 comments

Show HN: Generate 54 social media assets in 1 click

https://socialpacks.co/
1•danielkempe•13m ago•0 comments

Tell HN: First commit on Linux Kernel GitHub Page is from 30th of April 2005

https://github.com/torvalds/linux/commit/1ddb8a16aa0e60e7fdc48b1f532cf43e692f8fae
2•theanonymousone•22m ago•1 comments

Mexican President Responds to World Cup Piracy Concerns, Prefers Open Broadcasts

https://torrentfreak.com/mexican-president-responds-to-world-cup-piracy-concerns-prefers-open-bro...
3•Cider9986•26m ago•0 comments

High Speed Networking: The View from the Machine

https://blog.c21-mac.com/posts/high-speed-networking-part-1/
1•signa11•26m ago•0 comments

Systemd-sysinstall – Simple OS installer

https://www.freedesktop.org/software/systemd/man/devel/systemd-sysinstall.html
2•kozika•37m ago•0 comments

Factually-an AI-powered research tool to find reliable answers

https://factually.co/
2•sean_the_geek•45m ago•1 comments

Welcome to get-shit-done-redux – why the fork, what changed, what's next

https://github.com/open-gsd/get-shit-done-redux/discussions/109
3•dmazin•45m ago•0 comments

'Corpse Point' in the Arctic Is Melting, Disturbing Centuries-Old Bodies

https://www.404media.co/corpse-point-in-the-arctic-is-melting-disturbing-centuries-old-bodies/
3•Cider9986•45m ago•0 comments

'BusPatrol' Put AI Cameras in School Buses

https://www.404media.co/buspatrol-put-ai-cameras-in-tens-of-thousands-of-school-buses-now-they-wa...
3•Cider9986•46m ago•0 comments

What's it like to be a bat? (1974) [pdf]

https://www.sas.upenn.edu/~cavitch/pdf-library/Nagel_Bat.pdf
2•ineedasername•46m ago•1 comments

Mini Micro Fantasy Computer

https://miniscript.org/MiniMicro/index.html#about
25•nicoloren•48m ago•4 comments

Motorola's preinstalled "Smart Feed" app hijacks apps for affiliate revenue

https://old.reddit.com/r/Android/comments/1tno2z3/motorolas_preinstalled_smart_feed_app_hijacks/
1•stefan_•49m ago•1 comments

The Confession Nobody Expected

https://victoriaaremo.substack.com/p/the-confession-nobody-expected
2•victoria_aremo•54m ago•0 comments

NASA unveils next steps to build permanent Moon base

https://www.bbc.co.uk/news/articles/c39228nxyr4o
2•haack•54m ago•0 comments

Intel: Vision Without Execution, a comic-style deep dive (2000–2026)

https://zozo123.github.io/intel-story/
5•zozo123-IB•55m ago•1 comments

KDE Dolphin with tabs on top discussion

https://bugs.kde.org/show_bug.cgi?id=464386
2•maverick74•55m ago•0 comments

Scott Aaronson won't collaborate with the New York Times anymore

https://scottaaronson.blog/?p=9758
5•camillomiller•56m ago•2 comments

Why do most teleprompter apps suck?

https://thesmartteleprompter.com
1•timcombridge•1h ago•1 comments

Is GitHub Pull Request page copy broken?

https://imgur.com/a/CJfEaIr
3•__natty__•1h ago•0 comments

Uglycash

https://ugly.cash/
2•janandonly•1h ago•1 comments

OpenMLS Has Been Audited

https://blog.phnx.im/openmls-independent-security-audit/
4•raphaelrobert•1h ago•0 comments

1-Bit and Ternary Bonsai Image 4B: Image Generation for Local Devices

https://prismml.com/news/bonsai-image-4b
1•Marius77•1h ago•0 comments