frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

I created an extension for Claude that shares context on how you work

https://github.com/stubbleapp/Stubble
1•satay_chicken31•1m ago•0 comments

A multi-agent system for automating scientific discovery

https://www.nature.com/articles/s41586-026-10652-y
1•Timofeibu•2m ago•0 comments

Chewing gum restores dad's taste and smell years after Covid

https://discover.swns.com/2026/05/chewing-gum-restores-dads-taste-and-smell-years-after-covid/
1•speckx•3m ago•0 comments

Show HN: From one Claude agent to a fleet – in five small steps

1•sermakarevich•4m ago•0 comments

Sony Flamingo - The Coolest Record Player Ever Made

https://obsoletesony.substack.com/p/the-coolest-record-player-ever-made
1•reconnecting•4m ago•0 comments

A permissively licensed Vita FPGA Architecture in only 380 lines of Verilog

https://github.com/VitaSetLLC/VitaOS-Libre
1•VitaSetLLC•5m ago•0 comments

Nature's Hardware Store: building the future with biology [video]

https://aeon.co/videos/fungi-homes-and-more-ways-biology-could-sustain-life-beyond-earth
1•bryanrasmussen•5m ago•0 comments

Inside the next phase of OpenAI's political strategy

https://www.politico.com/news/2026/05/20/chatgpt-state-ai-fight-00928903
2•1vuio0pswjnm7•6m ago•0 comments

Trump Postpones AI Executive Order Due to Concerns About Overregulation

https://www.wsj.com/tech/ai/trump-executive-order-ai-advanced-models-57bcc955
2•berkeleyjunk•8m ago•0 comments

Japanese Verb Conjugation the Simple Hard Way

https://underreacted.leaflet.pub/3mmevu6woys27
1•danabramov•8m ago•0 comments

Show HN: Canonry tracks how AI cites you – agent-first, open source

https://github.com/AINYC/canonry
1•arberx•8m ago•0 comments

Show HN: Online Sound Test

https://soundtestx.com/
1•artiomyak•9m ago•0 comments

IRS requires identity verification with a private company for refunds?

https://help.id.me/hc/en-us/articles/8214940302999-IRS-and-ID-me
1•SilverElfin•10m ago•2 comments

Pivoting Out of Healthcare

https://saffron.health/
1•brandonb•12m ago•0 comments

AMD Ryzen AI Halo for AI Developers

https://www.amd.com/en/products/processors/desktops/ryzen/ryzen-ai-halo.html
1•9front•12m ago•0 comments

Show HN: My independent search engine focused on user control

https://slicksearchhq.com
1•nox21125•12m ago•0 comments

Adults who return to childhood games are searching for person they used to be

https://dailygalaxy.com/2026/05/psychology-childhood-games-nostalgia-adults-former-self/
1•amichail•13m ago•0 comments

Twitter Launches have become a scam and its visible

https://twitter.com/i/status/2057437455243153653
2•Fariz_Anjum•13m ago•0 comments

I had to do therapy on my AI

https://tinthe.dev/p/t/posts/therapy-for-ai
2•tinthedev•16m ago•0 comments

Rust for Linux Live

https://corrode.dev/podcast/s06e04-rust4linux/
1•K0nserv•16m ago•0 comments

Show HN: IDEViewer – Security scanner for malicious IDE Extensions

https://github.com/securient/ideviewer-oss
1•securient•17m ago•0 comments

Coding is solved? Software is not

https://arcplane.ai/journal/software-is-not-solved
2•arzak•17m ago•1 comments

US Government takes $2B equity stake in nine quantum computing firms

https://arstechnica.com/gadgets/2026/05/us-government-takes-2-billion-equity-stake-in-nine-quantu...
2•joozio•17m ago•1 comments

Tobacco Giant Donated $5M to MAGA Inc. Shortly Before Vaping Decision

https://www.wsj.com/business/tobacco-giant-donated-5-million-to-maga-inc-shortly-before-vaping-de...
1•petethomas•17m ago•1 comments

Amp Labs

https://ampcode.com/news/amp-labs
1•tosh•18m ago•0 comments

New Fragrance Tech Company

https://techcrunch.com/2026/05/21/a-new-fragrance-company-raises-2-million-to-find-new-scent-mole...
1•nate•18m ago•1 comments

More than 340 local news outlets are limiting the Internet Archive's access

https://www.niemanlab.org/2026/05/more-than-340-local-news-outlets-are-limiting-the-internet-arch...
2•jaredwiener•19m ago•0 comments

Ask HN: Anyone else struggling with AI and work?

4•carlgreene•19m ago•0 comments

Stop paying $360/year to access your own email history

https://mailvaulty.com
1•khaledsabae•19m ago•0 comments

UK radio station wrongly announces King Charles' death "due to computer error"

https://www.ctvnews.ca/world/article/radio-station-apologizes-after-accidentally-announcing-death...
2•theanonymousone•21m ago•0 comments