frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Crypto money laundering balloons to $82B as Chinese-language services dominate

https://www.coindesk.com/policy/2026/01/27/crypto-money-laundering-balloons-to-usd82b-as-chinese-...
1•PaulHoule•3m ago•0 comments

Six months of yak shaving a Zig web back end stack

https://lalinsky.com/2026/02/19/six-months-of-yak-shaving-a-zig-web-backend-stack.html
1•ibobev•3m ago•0 comments

The Car Wash Problem: A variable isolation study on prompt architecture

1•midmost44•3m ago•0 comments

Behaviour Trees versus State Machines

https://queenofsquiggles.github.io/guides/fsm-vs-bt/
1•ibobev•3m ago•0 comments

Show HN: InkSight – An open-source, LLM-powered e-ink display for "slow info"

https://github.com/datascale-ai/inksight
1•xx123122•4m ago•1 comments

Show HN: AccessiGuard – Free WCAG Scanner with CLI and GitHub Action

https://accessiguard.app
1•PrimeStark•4m ago•0 comments

Reaction-Diffusion: Gray-Scott on a 2D Grid

https://www.4rknova.com//blog/2026/02/15/reaction-diffusion
1•ibobev•4m ago•0 comments

Data centers are becoming power plants – this NJ project proves it

https://electrek.co/2026/02/19/data-centers-are-becoming-power-plants-this-nj-project-proves-it/
1•Bender•5m ago•0 comments

PayPal discloses data breach that exposed user info for 6 months

https://www.bleepingcomputer.com/news/security/paypal-discloses-data-breach-exposing-users-person...
1•el_duderino•5m ago•0 comments

Who Writes the Constitution of Machines?

https://11h.dev/en/2026/02/13/who-writes-the-constitution-of-machines/
1•al3xisb•5m ago•0 comments

Show HN: Groundwork – Organizing the AI-Driven "Time Surplus" for Impact

https://www.groundwork.today
2•Dicemanx•5m ago•0 comments

Chinese car brand Nio performs 165,898 battery swaps – IN A SINGLE DAY

https://electrek.co/2026/02/20/chinese-car-brand-nio-performs-165898-battery-swaps-in-a-single-day/
2•Bender•6m ago•0 comments

The largest meteorite on Earth is still where it landed

https://boingboing.net/2026/01/12/the-largest-meteorite-on-earth-is-still-exactly-where-it-landed...
2•surprisetalk•8m ago•0 comments

Can We Understand the Standard Model Using Octonions? [video]

https://www.youtube.com/watch?v=OH9e9C0xvUg
1•surprisetalk•8m ago•0 comments

How to Stop Being Boring

https://www.joanwestenberg.com/how-to-stop-being-boring/
1•surprisetalk•8m ago•0 comments

The System That Punishes Beautiful Design [video]

https://www.youtube.com/watch?v=4haFPDNSuPY
1•surprisetalk•8m ago•0 comments

Show HN: Docdex – A local tool to reduce LLM tokens and make agents smarter

https://github.com/bekirdag/docdex
1•bekirdag•8m ago•0 comments

AI is a winner-takes-all game

https://omattos.com/2026/02/20/ai_is_a_winner_takes_all_game.html
1•londons_explore•8m ago•0 comments

Broodlink – Multi-agent AI orchestration in Rust with knowledge graph memory

https://github.com/nevenkordic/broodlink
1•yotta25•8m ago•1 comments

Test-driven development ideal for AI, says Agile workshop

https://www.theregister.com/2026/02/20/from_agile_to_ai_anniversary/
1•chmaynard•8m ago•0 comments

Why the Future of Postgres Is Autonomous

https://medium.com/postgresql-blogs/why-the-future-of-postgres-is-autonomous-cefe828aff21
1•vitabaks•8m ago•0 comments

Microsoft deletes blog telling users to train AI on pirated Harry Potter books

https://arstechnica.com/tech-policy/2026/02/microsoft-removes-guide-on-how-to-train-llms-on-pirat...
1•Bender•10m ago•0 comments

Can You Pass the "Underwear Fitness Test"?

https://www.insidehook.com/fitness/underwear-fitness-test-balance
1•RickJWagner•11m ago•0 comments

Rork – create a mobile app using AI in minutes

https://rork.com
1•bilsbie•11m ago•0 comments

Agentic AI isn't eating software – it's feeding market volatility

https://bondvigilantes.com/blog/2026/02/agentic-ai-isnt-eating-software/
1•RickJWagner•11m ago•0 comments

Cleaner fish show intelligence typical of mammals

https://www.omu.ac.jp/en/info/research-news/entry-103609.html
1•geox•13m ago•0 comments

Show HN: On Device Personal Wellness Tracking

https://statushealthy.com/
2•zahirbmirza•15m ago•0 comments

Delta IQ – See which past approvals may break after a contract change

https://app.deltaiq.tech
2•avin01•17m ago•1 comments

Temporal.io is the AGENTS.MD you needed all along

https://temporal.io/
3•JohnMatthias•18m ago•1 comments

Breaking free from GitHub Discussions' limitations

https://www.jvt.me/posts/2026/02/20/renovate-discussions-data/
3•speckx•19m ago•0 comments