frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Epiplexity

https://andys.blog/epiplexity/
1•andytratt•1m ago•0 comments

The Cheapest Mercy

https://protortyp.github.io/posts/the-cheapest-mercy/
2•protortyp•2m ago•0 comments

Building self-improving tax agents with Codex

https://openai.com/index/building-self-improving-tax-agents-with-codex/
1•gmays•6m ago•0 comments

Google pushes water standards amid data center backlash

https://www.axios.com/2026/06/03/google-pushes-water-standards-data-center-backlash
1•1vuio0pswjnm7•7m ago•1 comments

Show HN: AI Agent that resolves all your support issues

1•Daniel-Pan•7m ago•0 comments

American capitalism has taken an apocalyptic turn

https://economist.com/business/2026/06/03/american-capitalism-has-taken-an-apocalyptic-turn
1•andsoitis•8m ago•0 comments

American Fork PD posts and removes unredacted bodycam footage

https://old.reddit.com/r/RecklessBen/comments/1tvzfv9/american_fork_pd_unredacted_bodycamdashcam_...
1•cosmicgadget•11m ago•0 comments

Strategic Petroleum Reserve

https://www.energy.gov/hgeo/opr/strategic-petroleum-reserve
1•mooreds•11m ago•0 comments

The Post That Beat The News By 38 Minutes. [video][50 mins]

https://www.youtube.com/watch?v=b2VAiFBXz-E
1•Bender•14m ago•0 comments

DOJ investigating former congressman George Santos for insider trading on Kalshi

https://www.npr.org/2026/06/02/nx-s1-5843371/george-santos-kalshi-insider-trading-investigation
2•gnabgib•17m ago•0 comments

SpaceX wins tax exemption for $55B AI chip plant despite local backlash

https://www.ft.com/content/86b2440a-60ce-4a5b-94ba-a6a4456ae574
1•1vuio0pswjnm7•17m ago•0 comments

Sil Val was built on public money-now it's fighting California's billionaire tax

https://www.morningstar.com/news/marketwatch/2026060256/silicon-valley-was-built-on-public-money-...
1•initramfs•18m ago•0 comments

Postgres IDE in Cursor

https://techcommunity.microsoft.com/blog/adforpostgresql/your-postgresql-workflow-just-found-its-...
1•gen_tp•18m ago•0 comments

Scientists uncover Feynman's formula for finding best holiday restaurant

https://www.theguardian.com/science/2026/jun/01/scientists-uncover-feynmans-formula-for-finding-b...
1•paulpauper•19m ago•0 comments

How cowboy culture remade Brazil

https://thebaffler.com/outbursts/rodeo-clowns-cowie
1•paulpauper•19m ago•0 comments

Are India's GDP figures OK after all?

https://www.ft.com/content/28783a0c-5a7c-4d6b-9485-61bdcb06e83d
1•paulpauper•20m ago•0 comments

Large AI Models in Dental Healthcare

https://arxiv.org/abs/2606.02914
1•berlianta•20m ago•0 comments

Meta Is Reportedly Working on an AI Pendant and More Smart Glasses

https://www.engadget.com/2184224/meta-developing-ai-pendant-more-smart-glass-models/
1•gmays•26m ago•0 comments

Ask HN: What would justify writing a kernel in 2026?

1•alonsovm44•26m ago•2 comments

Fridge with a Tiny Funnel Site

https://tailscale.com/blog/funnel-fridge
1•ChicknNuggt•28m ago•0 comments

The King and the Swarm

https://firstthings.com/the-king-and-the-swarm/
1•cratermoon•28m ago•0 comments

OpenAI Codex tool linked to malicious NPM supply chain attack

https://www.techradar.com/pro/security/openai-codex-tool-with-over-29-000-downloads-linked-to-mal...
1•ChicknNuggt•29m ago•0 comments

HttpBin Service

https://github.com/conductor-oss/httpbin
1•opiniateddev•31m ago•0 comments

Show HN: AI Gauge, a desktop monitor for Claude/Codex/Copilot usage limits

https://github.com/jpajak/ai-gauge
1•jpajak•34m ago•0 comments

Darknet Market Maximalism

https://antimoonboy.com/darknetmarketmaximalism/
2•Cider9986•39m ago•0 comments

Algebra of Contexts

https://github.com/neurons-me/.me
2•suiGn•45m ago•0 comments

Father of VR: The best AI future nobody is talking about – Jaron Lanier [video]

https://www.youtube.com/watch?v=v8f73ueeSTw
2•tartoran•47m ago•0 comments

Grok Becomes the Voice of Vapi

https://x.ai/news/grok-vapi
3•azeitona•50m ago•0 comments

The ancient diseases that plagued the dinosaurs

https://www.bbc.com/future/article/20230214-could-dinosaurs-get-cancer
2•thunderbong•51m ago•0 comments

When IPOs go wrong: SpaceX, AI firms face a delicate process

https://www.reuters.com/legal/transactional/when-ipos-go-wrong-spacex-ai-firms-face-delicate-proc...
2•1vuio0pswjnm7•51m ago•0 comments