frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Trump Clears Sale of More Powerful Nvidia A.I. Chips to China

https://www.nytimes.com/2025/12/08/business/trump-nvidia-chips-china.html
1•aaraujo002•17s ago•0 comments

Bringing More Real-Time News and Content to Meta AI

https://about.fb.com/news/2025/12/bringing-more-real-time-news-and-content-to-meta-ai/
1•donohoe•5m ago•0 comments

Manual: Spaces

https://type.today/en/journal/spaces
1•doener•7m ago•1 comments

Deprecation: Software Engineering at Google

https://abseil.io/resources/swe-book/html/ch15.html
1•jez•7m ago•0 comments

Scientists devise method to fight aging at the cellular level

https://www.washingtonpost.com/science/2025/12/08/aging-stem-cells-longevity-mitochondria/
1•pseudolus•8m ago•1 comments

Show HN: ZetaCrush – An Intelligent LLM Leaderboard

https://zetacrush.com
1•zetacrushagent•9m ago•0 comments

EU says it will 'make sure' Elon Musk's X pays €120M fine

https://www.politico.eu/article/eu-commissions-will-make-sure-x-musk-pay-120m-fine-transparency-r...
2•doener•10m ago•0 comments

We clean up AI-generated code

https://www.taylortech.app/services/vibe-code-janitorial
1•jtaylortech•13m ago•1 comments

Just how big is the AI investment wave?

https://www.reuters.com/graphics/USA-ECONOMY/AI-INVESTMENT/gkvlqbgxkpb/
4•gmays•13m ago•1 comments

Waymo self-driving cars in 'standoff' cause traffic jam in San Francisco

https://abc7news.com/post/waymo-driving-cars-standoff-cause-traffic-jam-san-francisco-video-shows...
2•mikhael•14m ago•0 comments

State of Neuroscience 2025: Trends and Breakthroughs

https://stateofneuroscience.thetransmitter.org/
2•gmays•16m ago•0 comments

Kroger acknowledges that its bet on robotics went too far

https://www.grocerydive.com/news/kroger-ocado-close-automated-fulfillment-centers-robotics-grocer...
19•JumpCrisscross•21m ago•7 comments

Ask HN: How do you get human support from OpenAI?

2•TheWizKnows•23m ago•0 comments

Is There a Religious Revival Among Young US Adults?

https://www.pewresearch.org/religion/2025/12/08/religion-holds-steady-in-america/
2•Mertax•28m ago•0 comments

Putin Wanted AI Supremacy. Now Russia Is Struggling to Stay in the Race

https://www.wsj.com/tech/ai/putin-wanted-ai-supremacy-now-russia-is-struggling-to-stay-in-the-rac...
2•bookofjoe•31m ago•1 comments

Scaling compute for retrieval by 5 OOMs: SID-1 tech report

https://www.sid.ai/research/sid-1-technical-report
2•maxrumpf•32m ago•1 comments

Trump Says U.S. Will Allow Nvidia H200 Chip Sales to China, Get 25% Cut

https://www.wsj.com/tech/nvidia-china-exports-h2000-chips-5943aa48
4•sebastian_z•33m ago•0 comments

EU to weaken more environment reporting rules, draft document shows

https://www.reuters.com/sustainability/climate-energy/eu-weaken-more-environment-reporting-rules-...
1•mohi-kalantari•36m ago•1 comments

The UK Is Transforming Coal Mines into Geothermal Hubs

https://oilprice.com/Energy/Energy-General/The-UK-Is-Transforming-Coal-Mines-Into-Geothermal-Hubs...
2•PaulHoule•37m ago•0 comments

GPT Gone Rogue

https://publuu.com/flip-book/1026520/2269465
1•nuevita70•38m ago•1 comments

Los Angeles' power supply is now officially coal-free

https://electrek.co/2025/12/08/los-angeles-power-supply-is-now-officially-coal-free/
4•gnabgib•39m ago•1 comments

General Purpose Simulation System

https://en.wikipedia.org/wiki/GPSS
2•thomasjb•39m ago•0 comments

Why does New Zealand take such a long summer holiday break?

https://www.rnz.co.nz/life/lifestyle/why-does-new-zealand-take-such-a-long-summer-holiday-break
4•billybuckwheat•42m ago•0 comments

Simplifying Quines

https://blog.phronemophobic.com/quineize.html
1•refset•44m ago•0 comments

Chrome removes middle click on new-tab button to open URL from clipboard

https://issues.chromium.org/issues/457495649
2•subleq•44m ago•0 comments

Looking for Badass Front Dev

https://www.beycome.com/
2•The-Patriot•48m ago•0 comments

Paramount launches rival bid for Warner Bros Discovery

https://www.bbc.com/news/articles/cj69xzpzrdyo
5•colinprince•51m ago•1 comments

iPhone Users in Japan Can Now Send Messages via Satellite

https://www.macrumors.com/2025/12/08/japan-messages-via-satellite/
4•mikhael•51m ago•0 comments

It's ~2026 –. ChatGPT still doesn't allow email change

https://help.openai.com/en/articles/4936827-how-to-change-your-email-address
46•amukbils•1h ago•62 comments

Safety services for recreational vessels conducting ocean passages

https://passageguardian.nz/
1•monerozcash•1h ago•0 comments