frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

OpenSMTPD Is the Mail Server for the Future

https://bsdly.blogspot.com/2026/05/opensmtpd-is-mail-server-for-future.html
1•birdculture•1m ago•0 comments

Who Are You?

https://syntheticauth.ai/posts/who-are-you
1•zerolayers•1m ago•0 comments

Kahneman on Contingencies (2002)

https://pmc.ncbi.nlm.nih.gov/articles/PMC3292229/
1•downbad_•4m ago•0 comments

Chinese memory maker CXMT enters mainstream consumer memory with Corsair kit

https://www.tomshardware.com/pc-components/ddr5/chinese-memory-maker-cxmt-enters-the-mainstream-c...
1•Markoff•5m ago•0 comments

Efficient Way to Cool a Drink

https://blog.sintef.com/energy/efficient-way-to-cool-a-drink/
2•wolfi1•6m ago•1 comments

DeepSeek Sparse Attention

https://github.com/rasbt/LLMs-from-scratch/tree/main/ch04%2F09_dsa
1•eigenBasis•6m ago•0 comments

Companies pay billions to show ads to bots. We can pay humans instead

https://www.nexertise.com/founding
2•izzygottlieb•8m ago•1 comments

USB Drives Are Cool Now

https://micahblachman.beehiiv.com/p/usb-drives-are-cool-now
1•subdomain•9m ago•0 comments

Ask HN: Best worldwide / classic phone games?

3•bix6•9m ago•1 comments

What we lost when we stopped programming

https://hermanschaaf.com/what-we-lost-when-we-stopped-programming/
1•hermanschaaf•9m ago•0 comments

Show HN: Time to Get Up

https://bigballi.com/move-reminder/
1•BigBalli•12m ago•0 comments

The End of a Craft?

https://neuribs.substack.com/p/the-end-of-a-craft
1•ribhu97•14m ago•0 comments

Show HN: The Front Page – Newspaper-style front page for Hacker News

https://thefrontpage.dev/
1•stagas•17m ago•0 comments

Is a Claw driven Hacker News user a problem?

3•delichon•18m ago•1 comments

Blocking an ASN (or similar) from my sites

https://dracos.co.uk/wrote/blocking-an-asn/
1•cdrnsf•19m ago•0 comments

The Shadowserver Foundation is a nonprofit security organization

https://www.shadowserver.org/
1•mooreds•20m ago•0 comments

I can focus for 5 hours on a tumor but not 5 minutes on someone I love

https://itsbrainsurgery.beehiiv.com/p/i-can-focus-for-5-hours-on-a-tumor-but-not-5-minutes-on-som...
1•dotcoma•20m ago•0 comments

You can't whisper at an AI agent

https://stripe.dev/blog/ai-steering-experiments
1•mooreds•21m ago•0 comments

The Case Against the AI Job Apocalypse

https://www.theringer.com/podcasts/plain-english-with-derek-thompson/2026/05/12/the-case-against-...
1•mooreds•22m ago•1 comments

Bad Agent

https://scobt.com/posts/bad-agent/
1•scotchfield•23m ago•0 comments

Jira Is Turing Complete

https://seriot.ch/computation/jira.html
5•fanf2•25m ago•0 comments

How my minimal, memory-safe Go rsync steers clear of vulnerabilities

https://michael.stapelberg.ch/posts/2026-05-24-minimal-memory-safe-go-rsync-vulns/
1•secure•30m ago•0 comments

How to Tame AI's Voracious Appetite for Energy

https://nautil.us/how-to-tame-ais-voracious-appetite-for-energy-1281212
1•Brajeshwar•30m ago•0 comments

Your Dotfiles Are Not a Distro

https://abyss.fish/your_dotfiles_are_not_a_distro
2•j3s•30m ago•0 comments

Sparrow

https://sparrowhub.io/
1•tosh•37m ago•0 comments

Miranda's Rescue was paid to save dogs, but is accused of killing them instead

https://kymkemp.com/2026/05/22/paid-to-save-them-accused-of-killing-them-the-investigation-of-mir...
2•ilamont•40m ago•0 comments

The seed oil panic is hurting my cardiac patients

https://www.statnews.com/2026/05/22/seed-oils-healthy-fats-tallow-fact-check-cardiac-health/
57•randycupertino•40m ago•20 comments

AI and the Rise of Just-in-Time Knowledge Work

https://operatingnotes.bearblog.dev/ai-and-the-rise-of-just-in-time-knowledge-work/
1•ninja-z•40m ago•0 comments

Salesforce Touts AI Promise over Reality in SaaSpocalypse Fight

https://www.bloomberg.com/news/articles/2026-05-22/salesforce-touts-ai-promise-over-reality-in-sa...
2•kjhughes•42m ago•1 comments

Rust is a great fit for the agentic era

https://kerkour.com/rust-agentic-coding
2•randomint64•42m ago•1 comments