frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Chrome removes claim of On-device Al not sending data to Google Servers

https://old.reddit.com/r/chrome/comments/1t5qayz/chrome_removes_claim_of_ondevice_al_not_sending/
1•newsoftheday•1m ago•0 comments

Energy-Based Transformers

https://blog.serendeep.tech
1•serendope•1m ago•0 comments

Need expert help without the endless back-and-forth? – GigsPool

1•nucleas•2m ago•0 comments

The surprisingly complex journey to text-selectable client-side generated PDFs

https://sdocs.dev/blogs/journey-to-pdf-generation
1•FailMore•2m ago•0 comments

Show HN: Drumforge – Free open-source interactive drum exercises

https://drumforge.app/
1•narghev•2m ago•0 comments

How to Choose Better Chocolate

https://chof.nl/how-to-choose-chocolate
1•felipevb•3m ago•0 comments

The Frame-Dependent Mind

https://softmax.com/blog/the-frame-dependent-mind
1•tosh•3m ago•0 comments

Honey, I Shrunk the Circuits

https://tokenbender.com/posts/honey-i-shrunk-the-circuits/
1•dejavucoder•3m ago•0 comments

Thefacebook.com's Darker Side (2004)

http://web.archive.org/web/20041101143311/http://www.stanforddaily.com/tempo?page=content&id=1349...
1•downbad_•3m ago•1 comments

The AI-free site builder for genius founders

https://fraude.design
1•cdrnsf•5m ago•0 comments

The Intolerable Hypocrisy of Cyberlibertarianism

https://matduggan.com/the-intolerable-hypocrisy-of-cyberlibertarianism/
1•brycewray•5m ago•0 comments

Building my own embedded WebKit macOS browser with dark reader

https://wkdomains.com/2026/may/on-the-dark-side/
1•andrewfromx•6m ago•0 comments

Mining WhatsApp, WeChat, Alibaba, Gmail to Create a Unified Supplier Dashboard

https://theautomatedoperator.substack.com/p/mining-whatsapp-wechat-alibaba-and
1•idopmstuff•6m ago•0 comments

Show HW: Vectors.Space – An free service for embeddings

https://vectors.space
1•marcobambini•6m ago•0 comments

What's Next for IVF

https://www.technologyreview.com/2026/05/07/1136946/whats-next-for-ivf-ai-robot-pgt-gene-editing/
1•Brajeshwar•7m ago•0 comments

Informity AI – Chat with your documents locally on your Mac (MIT, free)

https://www.informity.ai/
1•informity•8m ago•0 comments

Giga Launches Realtime Hallucination Correction

https://giga.ai/hallucinations
1•varunvummadi•9m ago•0 comments

DS4, a specialized inference engine for DeepSeek v4 Flash

https://twitter.com/antirez/status/2052405820235678175
4•tosh•9m ago•1 comments

Use the Hyper Key

https://brianlovin.com/writing/use-the-hyper-key-o5ozwGC
1•surprisetalk•12m ago•1 comments

Security Debt We Never Created

https://www.formal.ai/blog/security-debt-we-never-created/
3•saligrama•12m ago•1 comments

A card that gives some purchases for free

https://twitter.com/itstuyo/status/2052404971979550871
1•alexperezpaya•13m ago•0 comments

AI Is Starting to Build Better AI

https://spectrum.ieee.org/recursive-self-improvement
2•pseudolus•14m ago•0 comments

Tearable UI

https://pushmatrix.github.io/tearable/
1•napolux•15m ago•0 comments

DeepSeek 4 Flash local inference engine for Metal

https://github.com/antirez/ds4
3•tamnd•17m ago•1 comments

Show HN: Statewright – Visual state machines that make AI agents reliable

https://statewright.ai/
2•azurewraith•17m ago•0 comments

Ask HN: What inspires you to persevere through adversity?

1•downbad_•18m ago•2 comments

Show HN: Stage CLI – a tool to make reading your AI generated changes easier

https://github.com/ReviewStage/stage-cli
4•cpan22•19m ago•0 comments

Integration testing led me to create the Tyk mock MCP server

https://tyk.io/blog/imagine-build-share-how-integration-testing-led-me-to-create-the-tyk-mock-mcp...
2•elkinthewoods•21m ago•0 comments

TikTok Algo Simple AF

https://newsroom.tiktok.com/how-tiktok-recommends-videos-for-you?lang=en
1•smooke•22m ago•0 comments

The telemapforloktastic inventor: decide size of required majority, then topic

https://gb.weltfernsehsender.de/demandmajority/
1•interbr•22m ago•0 comments