frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

These Volcanoes Are Undead

https://nautil.us/when-extinct-volcanoes-reawaken-1280213
1•kristenfrench•5m ago•1 comments

Sam Altman Wants to Know Whether You're Human

https://www.theatlantic.com/newsletters/2026/04/sam-altman-bots-world-id/686950/
1•JumpCrisscross•5m ago•0 comments

Honda CEO says 'we have no chance' against Chinese automakers

https://finance.yahoo.com/sectors/energy/articles/honda-ceo-says-no-chance-020000235.html
1•thelastgallon•5m ago•0 comments

Why Trump wants to spend $1B on Great Salt Lake

https://www.npr.org/2026/04/24/nx-s1-5746844/why-trump-wants-to-spend-1-billion-on-great-salt-lake
1•kianN•6m ago•0 comments

Is Italy the new tax haven for the global rich?

https://www.bbc.com/worklife/article/20260421-is-italy-the-new-tax-haven-for-the-global-rich
1•andsoitis•11m ago•0 comments

Jeff Bezos is raising his game in space

https://www.economist.com/business/2026/04/23/jeff-bezos-is-raising-his-game-in-space
1•andsoitis•12m ago•0 comments

Bdelloid Rotifer

https://en.wikipedia.org/wiki/Bdelloidea
1•embedding-shape•13m ago•0 comments

Tim Cook wrote a winning recipe for Apple

https://www.economist.com/leaders/2026/04/23/tim-cook-wrote-a-winning-recipe-for-apple
1•andsoitis•14m ago•0 comments

Peter Sarnak – The Riemann Hypothesis [video]

https://www.youtube.com/watch?v=DtaFyE9BcXw
1•delhanty•17m ago•1 comments

Google is building a Claude Code challenger, Sergey Brin is involved

https://www.indiatoday.in/technology/news/story/google-is-secretly-building-a-claude-code-challen...
2•nsoonhui•22m ago•0 comments

Michael review: 'A bland and barely competent daytime TV movie'

https://www.bbc.com/culture/article/20260421-michael-review
1•dnnddidiej•32m ago•0 comments

Education must go beyond the mere production of words

https://www.ncregister.com/commentaries/schnell-repairing-the-ruins
2•signor_bosco•34m ago•0 comments

Decoupled DiLoCo for Resilient Distributed Pre-Training

https://arxiv.org/abs/2604.21428
1•matt_d•39m ago•0 comments

Serendipity Machines

https://www.shishyko.com/essays/serendipity-machines.html
1•philip1209•44m ago•0 comments

Mac-use: open-source Codex computer-use clone for your OpenClaw on Mac OS

https://github.com/TheGuyWithoutH/mac-computer-use
1•guywithnoh•48m ago•2 comments

ChatGPT ads targeting farmers (YouTube Link) [video]

https://www.youtube.com/watch?v=4rzeW4dbvlQ
1•ki4jgt•49m ago•0 comments

Prop 13 Didn't Shrink Government. It Handed It to Sacramento

https://maxmautner.com/2026/04/23/prop-13-changed-things.html
1•mslate•53m ago•0 comments

Why does the Rainbow have 7 colors?

https://glorify.com/learn/why-does-the-rainbow-have-seven-colors
2•airstrike•54m ago•0 comments

You're about to feel the AI money squeeze

https://www.theverge.com/ai-artificial-intelligence/917380/ai-monetization-anthropic-openai-token...
2•cdrnsf•56m ago•1 comments

Anthropic now requires Pro Plans to enable/purchase extra usage for Opus

https://support.claude.com/en/articles/11940350-claude-code-model-configuration
8•qdot76367•59m ago•3 comments

Context Pricing and Accounting [video]

https://www.youtube.com/watch?v=xcYhV4S7faI
1•journal•1h ago•0 comments

Chinese National Pleads Guilty to Photographing Air Force Base and Equipment

https://www.justice.gov/usao-wdmo/pr/chinese-national-pleads-guilty-unlawfully-photographing-air-...
2•737min•1h ago•3 comments

Databases Were Not Designed for This

https://arpitbhayani.me/blogs/defensive-databases/
1•mooreds•1h ago•0 comments

James Bosworth on the 'Orange Wave' Happening Across Latin America

https://www.bloomberg.com/news/articles/2026-04-24/james-bosworth-on-the-orange-wave-happening-ac...
1•mooreds•1h ago•1 comments

Alex Bores' AI Policy Framework for Congress [pdf]

https://www.alexbores.nyc/files/Bores_AI_Framework.pdf
1•mooreds•1h ago•0 comments

Andrej Karpathy's microgpt as a Triptych

https://karpathy.art/
1•stared•1h ago•0 comments

Chinese National Arrested for Illegally Photographing Military Aircraft at AFB

https://www.justice.gov/opa/pr/chinese-national-arrested-jfk-international-airport-federal-charge...
2•737min•1h ago•1 comments

Exodus, from former Mass Effect devs, couldn't look more like Mass Effect

https://www.pcgamer.com/games/rpg/exodus-the-sci-fi-rpg-from-former-mass-effect-devs-couldnt-look...
2•evo_9•1h ago•0 comments

Ancient amber reveals a true bug equipped with claws, a highly unusual feature

https://phys.org/news/2026-04-ancient-amber-reveals-true-bug.html
2•bookofjoe•1h ago•0 comments

The bull case for graph DBs in law

https://alanyahya.com/writing/bull-case-graph-dbs-law
2•alansaber•1h ago•0 comments