frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: You Are an Agent

https://youareanagent.app
3•robkop•6m ago•0 comments

The Zero Human Company

https://blog.grvy.dev/blog/the-zero-human-company/
1•GRVYDEV•6m ago•0 comments

Beer Money

https://www.permanentequity.com/content/permanent-equitys-guide-to-beer-money
1•rwmj•11m ago•0 comments

My thousand dollar iPhone can't do math

https://journal.rafaelcosta.me/my-thousand-dollar-iphone-cant-do-math/
1•rafaelcosta•13m ago•0 comments

ConsentFix

https://pushsecurity.com/blog/consentfix
1•weinzierl•13m ago•0 comments

15 Years of Blogging

https://nolanlawson.com/2026/02/01/15-years-of-blogging/
1•feross•14m ago•0 comments

European Open Source AI Index

https://osai-index.eu/
2•leonry•15m ago•1 comments

Security scanner that detect's AI-generated code vulnerabilities

https://codeslick.dev/
1•vitorlourenco•16m ago•1 comments

The State of Garnet, 2026

https://wiki.alopex.li/TheStateOfGarnet2026
1•birdculture•22m ago•0 comments

The OSI Deprogrammer

https://docs.google.com/document/u/0/d/1iL0fYmMmariFoSvLd9U5nPVH1uFKC7bvVasUcYq78So/mobilebasic?p...
1•MrDrMcCoy•23m ago•0 comments

Traforo – Ngrok/Localtunnel Alternative as a Cloudflare Durable Object

https://github.com/remorses/traforo
1•xmorse•23m ago•0 comments

Building Your Own Efficient uint128 in C++

https://solidean.com/blog/2026/building-your-own-u128/
3•PaulHoule•24m ago•0 comments

Show HN: OpsCompanion – A shared system model for humans and AI agents

https://opscompanion.ai/
1•kennethops•26m ago•0 comments

How random are TOTP codes?

https://shkspr.mobi/blog/2024/07/how-random-are-totp-codes/
3•sugipula•28m ago•0 comments

PSA: The Best Hacker News App for iOS is Called "HACK"

https://eliot.blog/p/psa-the-best-hacker-news-app-for-ios
1•ea016•28m ago•0 comments

ECMAScript Pattern Matching

https://github.com/tc39/proposal-pattern-matching
1•modinfo•29m ago•0 comments

Thermodynamic Wages in Autonomous AI Economies

https://twitter.com/i/status/2017995855417225633
1•birriel•31m ago•0 comments

Ask HN: Have you found that coding agents make you more civil IRL?

4•burnerToBetOut•34m ago•1 comments

Helping Strangers Access the Internet

https://blog.dougbelshaw.com/tor-snowflake/
1•radeeyate•36m ago•0 comments

Kiki – The accountability monster for people who are easily distracted

https://www.kiki.computer/
3•pikseladam•36m ago•0 comments

I created moltfight a platform designed for AI agent to fight autonomously

https://moltfight.com
1•nykodev•36m ago•0 comments

March for Billionaires

https://marchforbillionaires.org/#why
4•gaws•39m ago•3 comments

Consciousness science: where are we, where are we going, what if we get there?

https://www.frontiersin.org/journals/science/articles/10.3389/fsci.2025.1546279/full
2•Noaidi•40m ago•0 comments

Space Shuttle Columbia Loss Anniversary

https://en.wikipedia.org/wiki/Space_Shuttle_Columbia_disaster
1•d_silin•40m ago•0 comments

Starlink privacy change sparks concerns as SpaceX eyes trillion-dollar xAI mergr

https://www.cryptopolitan.com/starlink-privacy-change-sparks-concerns/
4•Noaidi•41m ago•0 comments

Directed Messaging

https://urbitsystems.tech/article/v03-i01/directed-messaging
1•yosoyubik•41m ago•0 comments

The Fed – Internationalization of the Chinese renminbi: progress and outlook

https://www.federalreserve.gov/econres/notes/feds-notes/internationalization-of-the-chinese-renmi...
1•janandonly•41m ago•0 comments

Monica: Remember everything about friends, family and business relationships

https://github.com/monicahq/monica
1•rootkea•45m ago•0 comments

"The fate of civilization is at stake"

https://www.techemails.com/p/the-fate-of-civilization-is-at-stake
1•bathtub365•52m ago•2 comments

High-Speed Internet Boom Hits Low-Tech Snag: A Labor Shortage

https://www.wsj.com/business/telecom/high-speed-internet-boom-hits-low-tech-snag-a-labor-shortage...
2•layer8•53m ago•2 comments