frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

18th-century mechanical volcano roars to life 250 years later

https://www.sciencedaily.com/releases/2026/05/260502015359.htm
1•samizdis•1m ago•0 comments

WeSearch

https://wesearch.press/
1•EGCstudy•1m ago•1 comments

Making 10 apps in 30 Days

https://bendansby.com/posts/10-apps-30-days.html
1•webwielder2•2m ago•0 comments

Iceland's Pools and Hot Tubs Now UNESCO-Recognized. Some Locals Aren't Thrilled.

https://www.nytimes.com/2026/04/30/world/europe/iceland-hot-tub-pools-tourism.html
1•bookofjoe•2m ago•1 comments

Show HN: Predicting the 2026 Kentucky Derby with 1T Monte Carlo Sims on Burla

https://burla-cloud.github.io/examples/kentucky-derby-demo/
1•Jack_at_Burla•4m ago•0 comments

AI talks draw backlash from Mass. state lawmakers

https://www.politico.com/news/2026/05/01/ai-backlash-massachusetts-lawmakers-00903440
1•1vuio0pswjnm7•6m ago•0 comments

Life update: Zig, AI, unemployment, and more [video]

https://www.youtube.com/watch?v=DhhPUrizZcw
1•rubenflamshep•6m ago•0 comments

How Oregon's Data Center Boom Is Supercharging a Water Crisis

https://waterwatch.org/how-oregons-data-center-boom-is-supercharging-a-water-crisis/
1•therobots927•7m ago•0 comments

Palantir Comes to Campus

https://nymag.com/intelligencer/article/palantir-yale-conference-ai.html
1•jbegley•8m ago•0 comments

Shitpostmodernism: Understanding the Slopgeneration

https://www.spikeartmagazine.com/articles/essay-shitpostmodernism
1•thinkingemote•8m ago•0 comments

AI Agents Are the Mass-Produced Cars of Software

https://telegraphic.substack.com/p/ai-agents-are-the-mass-produced-cars
1•telegrahi•9m ago•0 comments

Opioid maker Purdue Pharma shuts down as part of $7.4B deal

https://www.usatoday.com/story/news/nation/2026/05/01/purdue-pharma-shuts-down-opioid-crisis-oxyc...
1•geox•10m ago•0 comments

Disneyland Now Uses Face Recognition on Visitors

https://www.wired.com/story/security-news-this-week-disneyland-now-uses-face-recognition-on-visit...
2•Brajeshwar•15m ago•0 comments

Digital Ecosystems: Interactive Multi-Agent Neural Cellular Automata

https://pub.sakana.ai/digital-ecosystem/
1•jarmitage•17m ago•0 comments

How are Life-Size Figures Created at hololive production?

https://coveredge.cover-corp.com/en/list/4759
1•ai_slop_hater•17m ago•0 comments

Vibecoded my dream game, GeoGuesser for guns, now its helping with student bills

https://gunguesser.com
4•salad_vr•17m ago•5 comments

The Railway and the Balloon

https://netwars.pelicancrossing.net/2026/05/01/the-railway-and-the-balloon/
1•ColinWright•21m ago•0 comments

Floating Armoury

https://en.wikipedia.org/wiki/Floating_armoury
1•jjmarr•23m ago•0 comments

Customizing Claude Code spinner verbs

https://www.augmentedswe.com/p/customizing-claude-code-spinner-verbs
1•wordsaboutcode•24m ago•0 comments

Back end-for-Front end: The most secure architecture for browser-based apps

https://fusionauth.io/blog/backend-for-frontend-security-architecture
2•mooreds•28m ago•0 comments

Voyager and the Art of Graceful Degradation

https://www.flyingbarron.com/2026/04/voyager-and-art-of-graceful-degradation.html
1•mooreds•29m ago•0 comments

Did I photograph the Aurora or was it something else? (2016)

https://wp.lancs.ac.uk/aurorawatchuk/2016/03/16/did-i-photgraph-the-aurora-or-was-it-something-else/
1•susam•30m ago•0 comments

Upcoming Blender Development Fund and AI Policies

https://www.blender.org/news/upcoming-blender-development-fund-and-ai-policies/
2•sensanaty•32m ago•0 comments

The Annoying Usefulness of Emacs [video]

https://www.youtube.com/watch?v=DMbrNhx2zWQ
2•susam•32m ago•0 comments

The Sky Tonight

https://theskylive.com/guide
2•susam•33m ago•0 comments

New US phone network for Christians to block porn and gender-related content

https://www.technologyreview.com/2026/05/01/1136739/a-new-t-mobile-network-for-christians-aims-to...
7•thinkingemote•36m ago•2 comments

Making Your Writing Work Harder for You

https://training.kalzumeus.com/newsletters/archive/content-marketing-strategy
2•eigenBasis•38m ago•0 comments

Show HN: TradingAgents without the API bill – run multi agents in Claude Code

https://github.com/lucemia/trading-agents-plugin
1•lucemia51•43m ago•0 comments

Stop Supplying. Start Owning

https://allensthoughts.com/2026/05/01/stop-supplying-start-owning/
2•herbertl•44m ago•0 comments

Uber wants to turn its drivers into a sensor grid for AV companies

https://techcrunch.com/2026/05/01/uber-wants-to-turn-its-millions-of-drivers-into-a-sensor-grid-f...
6•nickvec•45m ago•1 comments