frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

The One Woman Anthropic Trusts to Teach AI Morals

https://www.wsj.com/tech/ai/anthropic-amanda-askell-philosopher-ai-3c031883
1•stanislavb•41s ago•0 comments

Byte magazine artist Robert Tinney, who illustrated the birth of PCs, dies at 78

https://arstechnica.com/gadgets/2026/02/byte-magazine-artist-robert-tinney-who-illustrated-the-bi...
2•sohkamyung•4m ago•1 comments

RIP Robert Tinney, the illustrator behind so many Byte magazines

https://tinney.net/in-memoriam
2•ohjeez•5m ago•0 comments

Electrolytes vs. Water: The Surprising Effect on Your Training Zones

https://vo2maxpro.com/blog/electrolytes-vs-water-training-zones
1•GoodluckH•10m ago•0 comments

Attorney General Bonta Announces $2.75M Settlement with Disney

https://oag.ca.gov/news/press-releases/california-wont-let-it-go-attorney-general-bonta-announces...
1•sebastian_z•13m ago•1 comments

Building a Pastebin, Hardening Two Services – While Working

https://www.smolkin.org/blog/2026/02/adding-api-auth-admin-panel-with-claude-code.html
1•msmolkin•14m ago•0 comments

Anthropic safety researcher quits, warning 'world is in peril'

https://www.semafor.com/article/02/11/2026/anthropic-safety-researcher-quits-warning-world-is-in-...
1•doener•15m ago•0 comments

Hacker News now thinks coding is solved

https://old.reddit.com/r/BetterOffline/comments/1qynmuc/hacker_news_now_thinks_coding_is_solved/
3•Cheyana•16m ago•1 comments

AI Is Getting Scary Good at Making Predictions

https://www.theatlantic.com/technology/2026/02/ai-prediction-human-forecasters/685955/
1•cainxinth•18m ago•1 comments

Software Engineering Past, Present, and Future with Grady Booch

https://oxide-and-friends.transistor.fm/episodes/software-engineering-past-present-and-future-wit...
1•weinzierl•18m ago•0 comments

Why the Economy Hasn't Crashed yet [video]

https://www.youtube.com/watch?v=jOR4wuiPeEQ
1•Wilsoniumite•20m ago•0 comments

Alphabet's Rare 100-Year Bond Tells Us That Money Is Easy

https://www.wsj.com/finance/investing/alphabets-rare-100-year-bond-tells-us-that-money-is-easy-77...
1•RyanShook•22m ago•0 comments

Show HN: Doodle on Your Partner's Widget

https://trylongdistance.com/
1•VatanaChhorn•23m ago•0 comments

Reducing Attack Surface for AI Agents with Process-Scoped Credentials

https://dreamiurg.net/2026/02/11/reducing-attack-surface-for-ai-agents-process-scoped-credentials...
1•dreamiurg•24m ago•1 comments

Show HN: OpenHarness – A harness for open source projects built by AI agents

https://openharn.vercel.app
1•naix•28m ago•0 comments

Claude's impact on older software engineers while listening to country music

https://suno.com/song/0d9b02a2-a709-4b2c-ba66-f62ff9306f79
1•botswana99•29m ago•0 comments

The SaaSpocalypse – The week AI killed software

https://www.fintechbrainfood.com/p/the-saaspocalypse
3•gmays•32m ago•2 comments

Agent Identities – Everything you need to know

https://mrinal.com/articles/agent-identities/
1•mattgreg•32m ago•0 comments

"Free" Surveillance Tech Still Comes at a High and Dangerous Cost

https://www.eff.org/deeplinks/2026/02/free-surveillance-tech-still-comes-high-and-dangerous-cost
2•hn_acker•34m ago•0 comments

Google Tells Employees: Brace for AI or Leave

https://www.gulte.com/trends/395721/brace-for-ai-or-leave-google-tells-employees
5•sowbug•35m ago•1 comments

Cisco Opensourced Tool to Build AI Bill of Materials

https://github.com/cisco-ai-defense/aibom
1•hsanthan•35m ago•0 comments

How Does the Initial Interest Confusion Doctrine Improve Trademark Analyses?

https://blog.ericgoldman.org/archives/2026/02/how-does-the-initial-interest-confusion-doctrine-im...
1•hn_acker•36m ago•1 comments

Weekly "Wordle" for Breaking AI Agents

https://playground.fabraix.com/
2•zachdotai•36m ago•1 comments

A session with 5.2 using 4o Tone.

1•WindySoliloquy•40m ago•0 comments

Automatic Differentiation from Scratch (2023)

https://blog.esciencecenter.nl/automatic-differentiation-from-scratch-23d50c699555
1•measurablefunc•41m ago•0 comments

The API Tooling Crisis: Why developers are abandoning Postman and its clones?

http://efp.asia/blog/2025/12/24/api-tooling-crisis/
2•PaulHoule•42m ago•1 comments

Why eight Australians died after having AstraZeneca's Covid vaccine

https://www.smh.com.au/national/why-eight-australians-died-after-having-astrazeneca-s-covid-vacci...
2•femto•42m ago•0 comments

Ask HN: Do you bother with take-homes?

1•throwaway123198•43m ago•3 comments

Self-hosted, memory-augmented AI chat that works with any LLM

https://github.com/PStryder/Cathedral
1•pstryder•43m ago•1 comments

Epstein Graph – The Complete Epstein Files Collection

https://epsteingraph.com/
1•redbell•43m ago•1 comments