frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: Smart Tab Suspender – Reduce Chrome memory usage by autosuspending tabs

https://chromewebstore.google.com/detail/smart-tab-suspender/mmhfonkfehekiofpkjiofjeoofloidog
1•yavuzyildirim•3m ago•0 comments

Infinite Collaborative Word Search Game

https://words.zip/
2•bookofjoe•3m ago•0 comments

Crafting Interpreters

https://craftinginterpreters.com/
2•tosh•3m ago•0 comments

OpenAI Forges Multibillion-Dollar Computing Partnership with Cerebras

https://www.wsj.com/tech/ai/openai-forges-multibillion-dollar-computing-partnership-with-cerebras...
3•rbanffy•4m ago•1 comments

Ugandans, Iranians turn to Dorsey's messaging app Bitchat in web crackdowns

https://www.reuters.com/business/media-telecom/ugandans-iranians-turn-dorseys-messaging-app-bitch...
1•_djo_•6m ago•0 comments

Analyzing my own genome with DRAGEN and Claude

https://www.dddiaz.com/post/t1d-genome-analysis-report/
1•dddiaz1•8m ago•0 comments

Training My Smartwatch to Track Intelligence

https://dmvaldman.github.io/rooklift/
1•dmvaldman•10m ago•0 comments

Scaling long-running autonomous coding

https://cursor.com/blog/scaling-agents
5•samwillis•11m ago•0 comments

The novelists who predicted our present

https://www.theguardian.com/books/2026/jan/10/mass-surveillance-the-metaverse-making-america-grea...
4•mooreds•12m ago•1 comments

The Hypocrisy over Iran

https://www.telegraph.co.uk/news/2026/01/14/silence-luvvies-iran-exposes-left-war-on-west-middle-...
3•midlander•12m ago•2 comments

Germany, Other NATO Allies Sending Troops to Greenland Amid Trump Threats

https://www.newsweek.com/greenland-germany-sending-troops-nato-donald-trump-threats-11361535
3•mooreds•13m ago•0 comments

Former NYC Mayor Eric Adams Accused of Crypto Pump and Dump with NYC Token

https://gizmodo.com/former-nyc-mayor-eric-adams-accused-of-crypto-pump-and-dump-with-nyc-token-20...
4•pseudolus•17m ago•1 comments

DoorDash and Uber Eats Cost Delivery Workers Millions of Dollars in Tips, NYC

https://gizmodo.com/doordash-and-uber-eats-cost-delivery-workers-millions-of-dollars-in-tips-nyc-...
2•pseudolus•19m ago•0 comments

Six prosecutors quit over push to investigate ICE shooting victim's widow

https://www.nytimes.com/2026/01/13/us/prosecutors-doj-resignation-ice-shooting.html
14•heavyset_go•20m ago•1 comments

Students aren't asking for help anymore. That could be a good thing

https://practicespace.substack.com/p/students-arent-asking-for-help-anymore
2•rappatic•21m ago•0 comments

Why I Use the GPL and Not Cuck Licenses

https://lukesmith.xyz/articles/why-i-use-the-gpl-and-not-cuck-licenses/
2•soygem•21m ago•0 comments

Poking holes into bytecode with peephole optimisations

https://xnacly.me/posts/2026/purple-garden-first-optimisations/
1•xnacly•21m ago•0 comments

Quantum Automated Theorem Proving

https://arxiv.org/abs/2601.07953
1•7777777phil•23m ago•0 comments

Verizon Outage

https://apnews.com/article/verizon-cellular-outage-85d658a4fb6a6175cae8981d91a809c9
2•zephyreon•23m ago•1 comments

The State of OpenSSL for pyca/cryptography

https://cryptography.io/en/latest/statements/state-of-openssl/
7•SGran•25m ago•0 comments

Show HN: Distribute AI agent test runs across your spare machines via `rr`

https://github.com/rileyhilliard/rr
1•RileyHilliard•29m ago•0 comments

Ui.dev and Fireship Join Forces

https://fireship.dev/uidotdev-and-fireship-join-forces
3•JustSkyfall•30m ago•0 comments

Germany joins European partners with troop deployment to Greenland

https://www.reuters.com/world/europe/germany-send-reconnaissance-troops-greenland-government-says...
13•consumer451•31m ago•0 comments

Our First Public Parks: The Forgotten History of Cemeteries (2011)

https://www.theatlantic.com/national/archive/2011/03/our-first-public-parks-the-forgotten-history...
1•toomuchtodo•33m ago•1 comments

Simple to Ornate and Back Again

https://josem.co/simple-to-ornate-and-back-again/
1•nikodunk•34m ago•0 comments

Show HN: quick-sync. TikTok-esque video switch using WebRTC

https://github.com/pion/webrtc/tree/master/examples/quick-switch
1•Sean-Der•35m ago•1 comments

Distributed SQL engine for ultra-wide tables

2•synsqlbythesea•35m ago•0 comments

Data centers are amazing. Everyone hates them

https://www.technologyreview.com/2026/01/14/1131253/data-centers-are-amazing-everyone-hates-them/
3•rbanffy•36m ago•2 comments

A Times Reporter Goes Inside a Cyberscam Center in a War Zone

https://www.nytimes.com/video/world/asia/100000010582900/myanmar-scam-complex-fraud.html
3•smurda•37m ago•0 comments

Dokploy uses a shared Swarm network with a hardcoded database password

https://github.com/Dokploy/dokploy/issues/3449
3•computergert•37m ago•0 comments