frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•12mo ago

Comments

tocs3•12mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Copyright Office Rejected My Attempt to Copyright a Tweet

https://www.techdirt.com/2014/08/04/copyright-office-rejected-my-attempt-to-copyright-tweet/
1•danhite•19s ago•0 comments

Ember: 365-day audited record of AI models vs. Polymarket, scored by Brier

https://emberfyi.com/
1•emberfyi•1m ago•0 comments

AI video editing is blowing my mind

https://aivideoediting.io/
1•pekingzcc•6m ago•0 comments

Architect of the UK Online Safety Act Calls for Its Complete Repeal

https://prestonbyrne.com/2026/05/19/architect-of-the-uk-online-safety-act-calls-for-its-complete-...
1•iamnothere•6m ago•0 comments

Rendezvous: A serverless, Zoom-like video conferencing web app

https://github.com/predatorray/rendezvous
1•zetaplusae•13m ago•0 comments

Generalization Dynamics of LM Pre-Training

https://jiaxin-wen.github.io/blog/generalization-dynamics
1•gmays•14m ago•0 comments

Iteration

https://blog.viktomas.com/posts/iteration/
1•luca-sctr•14m ago•0 comments

GPU telemetry anomaly: 146W idle draw on A100 (white paper)

https://github.com/mikebains41-debug/ai-gpu-energy-optimizer-/blob/main/WHITEPAPER.md
1•mikebains•19m ago•0 comments

Who Wins the Future: Chips vs. Frontier LLMs

https://medium.com/@vektormemory/who-wins-the-future-chips-vs-frontier-llms-1e8e0ca42641
1•vektormemory•21m ago•0 comments

The Richest Cat in the World

https://www.theatlantic.com/magazine/2026/06/karl-lagerfeld-cat-heir-choupette/686940/
1•gmays•21m ago•0 comments

Energy Efficiency

https://ai-gpu-brain-v3.onrender.com/metrics
2•mikebains•22m ago•0 comments

Anthropic hires OpenAI cofounder Andrej Karpathy

https://www.cnbc.com/2026/05/19/anthropic-hires-openai-cofounder-andrej-karpathy-former-tesla-ai-...
1•doppp•25m ago•1 comments

CVE-2025-54518

https://nvd.nist.gov/vuln/detail/cve-2025-54518
1•losfair•26m ago•0 comments

Key, in sight [Creative uses of keyboard shortcuts and macros]

https://aresluna.org/key-in-sight/
1•anotherevan•31m ago•1 comments

Show HN: Building a Programming Language for Myself

https://blog.aawadia.dev/2026/05/19/teak-lang/
1•asadawadia•32m ago•0 comments

20 Year old pgcrypto CVE reported

https://thebuild.com/blog/2026/05/15/two-decades-two-rces-what-pgcrypto-has-been-doing-since-2005/
1•sameers•34m ago•1 comments

Coding is solved? Software is not

https://arcplane.ai/journal/software-is-not-solved
2•splash123•34m ago•0 comments

Greg Hyman, Co-Creator of Tickle Me Elmo, Dies at 78

https://www.nytimes.com/2026/05/19/business/greg-hyman-dead.html
1•bookofjoe•39m ago•1 comments

Google Scholar names its most influential papers for 2025

https://www.nature.com/nature-index/news/google-scholar-reveals-most-influential-research-papers-...
1•teleforce•39m ago•0 comments

Japan is gripped by mass allergies. A 1950s project is to blame

https://www.bbc.com/future/article/20260515-the-1950s-blunder-which-causes-mass-hay-fever-in-japan
2•ranit•41m ago•0 comments

Ask HN: Should I learn to code in 2026?

3•vrinda13•42m ago•1 comments

Thioacetone (Wiki)

https://en.wikipedia.org/wiki/Thioacetone
1•sans_souse•44m ago•0 comments

XINF MCP Server

https://xinf.dev/mcp
2•ZeroTerabytes•48m ago•2 comments

Canonical launches Ubuntu Core 26

https://canonical.com/blog/canonical-launches-ubuntu-core-26
2•LopRabbit•49m ago•0 comments

Ben Welsh made an index of all FiveThirtyEight articles on the Internet Archive

https://fivethirtyeightindex.com/
2•ChocMontePy•50m ago•1 comments

'We don't see a robot as a threat: simply another form of presence in the world'

https://english.elpais.com/science-tech/2026-05-16/takeshi-yoro-anatomist-in-japan-we-dont-see-a-...
1•Geekette•52m ago•0 comments

Sci/acc: what happens to science after super-intelligence?

https://willzeng.com/shared/sciacc.html
1•wzeng•54m ago•1 comments

Ubuntu Core 26 targets IoT, offers up to 15 years of security maintenance

https://www.cnx-software.com/2026/05/19/ubuntu-core-26-targets-iot-devices-and-embedded-systems-o...
2•0in•58m ago•0 comments

On Guard! The Story of SAGE [IBM, 1956]

https://www.youtube.com/watch?v=lFGco9ZsFGE
1•doctaj•1h ago•1 comments

Optimize_anything: A Universal API for Optimizing Any Text Parameter

https://arxiv.org/abs/2605.19633
4•LakshyAAAgrawal•1h ago•1 comments