frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Solve Go Challenge: Octantconway

https://github.com/plutov/practice-go/tree/master/octantconway
1•todsacerdoti•1m ago•0 comments

Why Won't the Media Report Accurately on Road Deaths?

https://jakecoppinger.com/2025/12/why-wont-the-media-report-accurately-on-road-deaths/
1•jakecopp•2m ago•1 comments

Show HN: ThesisBoard – Trello for Investment Research

https://thesisboard.com/
1•egobrain27•4m ago•0 comments

Building a fintech platform's mobile app

https://hackclub.com/fiscal-sponsorship/mobile/
1•JustSkyfall•4m ago•0 comments

`npx vercel` opens a project

https://ando.so/company
1•frootoftheloom•5m ago•0 comments

Catalogue of Moulded and Ornamental Brick (1892)

https://archive.org/details/central-press-brick-1900s-a
1•georgefrowny•6m ago•0 comments

Social identification with a team boosts fans' social well-being

https://phys.org/news/2025-11-social-identification-team-boosts-fans.html
1•PaulHoule•6m ago•0 comments

StackOverflow: AI Assist

https://stackoverflow.com/ai-assist
1•Bootvis•8m ago•0 comments

Windows 11 growth slows as millions stick with Windows 10

https://www.theregister.com/2025/12/03/windows_11_statcounter/
1•naves•8m ago•0 comments

Michael Levin on Pain as Agent, Healing as Alignment

https://essays.debugyourpain.com/p/michael-levin-on-pain-as-agent-healing
1•yichab0d•9m ago•1 comments

Mixtures of dietary nutrients lessen behavioral deficits in mouse autism models

https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3003231
1•bookofjoe•9m ago•0 comments

Show HN: I built a trending books by month Bump Chart with Hardcover data

https://hardcover.app/labs/popular-by-month
1•dyogenez•10m ago•0 comments

Source code to Sega Saturn game "Powerslave" released

https://github.com/Lobotomy-Software/SlaveDriver-Engine
1•alexjplant•12m ago•0 comments

Comments on smartphone ban from high school teacher

https://marginalrevolution.com/marginalrevolution/2025/09/from-the-comments-39.html
2•surprisetalk•14m ago•0 comments

Hiring: Damned If You Do, Damned If You Don't

https://www.factorysettings.org/p/hiring-damned-if-you-do-damned-if
1•surprisetalk•15m ago•0 comments

Some Economics of Artificial Super Intelligence

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5728702
1•surprisetalk•15m ago•0 comments

Rock Paper Scissors Is a Game of Skill

https://collisteru.substack.com/p/rock-paper-scissors-is-a-game-of
1•surprisetalk•15m ago•0 comments

Alzheimer's is the symptom, not the problem

https://1393.xyz/blog/alzheimers-is-the-symptom-not-the-problem
1•rdgthree•16m ago•0 comments

Vibe Code Like It's 1986

https://github.com/AvitalTamir/vibecommander
1•fatliverfreddy•16m ago•1 comments

The Real-Life Hunt for Red October Happened 50 Years Ago

https://www.twz.com/sea/the-real-life-hunt-for-red-october-happened-50-years-ago
1•breve•16m ago•0 comments

Show HN: MetaConvert – Free PDF and Image Conversion Tools

https://metaconvert.blogspot.com/
1•MetaConvert•17m ago•0 comments

After nearly 30 years, Crucial will stop selling RAM to consumers

https://arstechnica.com/gadgets/2025/12/after-nearly-30-years-crucial-will-stop-selling-ram-to-co...
8•downrightmike•18m ago•2 comments

Affiliate Program

https://growup-labs.com/
1•kelvb•18m ago•1 comments

Ilya Sutskever, the Scaling Hypothesis, and the Art of Talking Your Book

https://thinking.luhar.org/2025/12/lya-sutskever-the-scaling-hypothesis-and-the-art-of-talking-yo...
1•rluhar•19m ago•0 comments

Don't Sleep on Fast Inference

https://blog.parcha.dev/dont-sleep-on-fast-inference
1•miguelrios•24m ago•0 comments

Moonshot Space Raises $12M for Electromagnetic Launch

https://payloadspace.com/moonshot-space-raises-12m-for-electromagnetic-launch/
3•myth_drannon•26m ago•0 comments

DJI will end support for these drones, payloads next month

https://dronedj.com/2025/12/02/dji-service-support-matrice-drones/
1•bookofjoe•26m ago•0 comments

Next job may come from a stranger

https://www.careersycoaching.com/blog/your-next-job-may-come-from-a-stranger
2•andrewstetsenko•26m ago•0 comments

Ask HN: Does anything beat Hetzner storage boxes for the price?

1•opengrass•27m ago•0 comments

Learning Rust: Download and deserialize 10 000 files in 9.833 seconds

https://rup12.net/posts/download-and-deserialize-10000-files-in-10-seconds/
1•auraham•31m ago•0 comments