frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Love Your Customers

https://bcantrill.dtrace.org/2025/12/31/love-your-customers/
1•chmaynard•1m ago•0 comments

Concurrency's Shysters (2008)

https://bcantrill.dtrace.org/2008/11/03/concurrencys-shysters/
1•abelanger•2m ago•0 comments

An Open Source Motorized XYZ Micro-Manipulator: Affordable Sub µm Motion Control [video]

https://www.youtube.com/watch?v=MgQbPdiuUTw
1•PaulHoule•13m ago•0 comments

ClearTally Email Spend Tracker

https://cleartally.net
1•royLT•14m ago•0 comments

The Monomyth of Postdoc Life

https://www.randropy.com/posts/monomyth-01.html
1•notomorrow•20m ago•0 comments

Real Biological Clock Is You're Going to Die (2018)

https://hmmdaily.com/2018/10/18/your-real-biological-clock-is-youre-going-to-die/
1•Gooblebrai•21m ago•0 comments

Business valuation calculator using industry multiples

https://businessvaluecalculator.tech/
2•DDARJEAN•21m ago•0 comments

The baffling purple honey found only in North Carolina

https://www.bbc.com/travel/article/20250417-the-baffling-purple-honey-found-only-in-north-carolina
2•rmason•22m ago•0 comments

Focus apps claim to improve your productivity. Do they work?

https://theconversation.com/focus-apps-claim-to-improve-your-productivity-do-they-actually-work-2...
3•billybuckwheat•23m ago•0 comments

Show HN: Chinese Learning Site – I made a free online version of the HSK books

https://learnchinese.ai/
1•qubitspace•23m ago•0 comments

Show HN: A free affinity diagramming tool, in a single HTML file

https://ianarawjo.medium.com/splat-a-free-affinity-diagramming-tool-in-a-single-html-file-a10f89a...
1•fatso784•24m ago•0 comments

100K-Watt Iron Beam laser becomes first drone defense zapper to be deployed

https://www.tomshardware.com/tech-industry/100kw-iron-beam-laser-becomes-worlds-first-drone-defen...
1•rmason•24m ago•0 comments

Show HN: A Prompt-Injection Firewall for AI Agents and RAG Pipelines

1•AadilSayed•26m ago•1 comments

This Post Was Edited by a Rock. Deal with It

https://alec.is/posts/this-post-was-edited-by-a-rock-deal-with-it/
3•arm32•35m ago•4 comments

My Running Wrapped 2025

https://jcdav.is/running-wrapped-2025/
3•jcdavis•38m ago•1 comments

Show HN: Region-proxy – One-command SOCKS proxy through AWS EC2 in any region

https://github.com/M-Igashi/region-proxy
3•jphfa•40m ago•1 comments

Boiling Water

https://www.natemeyvis.com/on-boiling-water/
3•Theaetetus•41m ago•0 comments

1seed: a Rust based CLI for deterministic age/SSH

https://github.com/oeo/1seed
1•genesishash•46m ago•0 comments

Children and Helical Time

https://moultano.wordpress.com/2025/12/30/children-and-helical-time/
3•Gooblebrai•47m ago•0 comments

State of Startups 2025 [pdf]

https://info.carta.com/rs/214-BTD-103/images/State-of-Startups-2025.pdf
2•gmays•47m ago•0 comments

How to Improve a Perfect Join Algorithm

https://remy.wang/blog/ya-fast.html
2•remywang•48m ago•0 comments

Emotional Intelligence: Moving AI from Emotion Detection to True Understanding

1•buttersmoothAI•48m ago•0 comments

Party of One for Code Review

https://tidyfirst.substack.com/p/party-of-one-for-code-review
1•mustaphah•50m ago•0 comments

Y2K Explained: The Real Impact and Myths of the Year 2000 Computer Bug

https://www.investopedia.com/terms/y/y2k.asp
2•throw0101c•53m ago•0 comments

RunAgent Genie – Ultimate Prompt Engineering Game with Advanced Guardrails

https://genie.run-agent.ai/
2•sawradip•55m ago•1 comments

Steam Depot Downloader

https://github.com/SteamRE/DepotDownloader
1•btdmaster•56m ago•0 comments

Terry Tao on the future of mathematics – Math, Inc

https://www.youtube.com/watch?v=4ykbHwZQ8iU
2•artninja1988•58m ago•0 comments

Microarchitecture: What Happens Beneath [video]

https://www.youtube.com/watch?v=BVVNtG5dgks
1•recov•58m ago•1 comments

Understanding Decision Trees: The White Box of Machine Learning

https://mateolafalce.github.io/2025/Understanding%20Decision%20Trees_%20The%20White%20Box%20of%20...
2•lafalce•59m ago•0 comments

If Exercise Is Better Than a Drug, We Should Test It Like One

https://www.outsideonline.com/health/wellness/exercise-colon-cancer-longevity/
2•canucker2016•1h ago•1 comments