frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Universe Time Machine Using AI God and the Universe Internet

https://patents.google.com/patent/US20250238653A1/en
1•zdw•1m ago•0 comments

Show HN: Airboard – $1 voice dictation for Mac local

https://dhruvian473.gumroad.com/l/pgcjbc
1•mehrad_1•1m ago•0 comments

Ripple: The Elegant TypeScript UI Framework

https://jsdev.space/meet-ripple/
1•javatuts•2m ago•0 comments

Cecil Kelley Criticality Accident

https://en.wikipedia.org/wiki/Cecil_Kelley_criticality_accident
1•toomuchtodo•8m ago•0 comments

Millennium Prize Problem Bench

https://mppbench.com/
2•a_tartaruga•11m ago•0 comments

eBook: Wall Street Sold Out Main Street

https://www.founderstowne.com/
1•i_have_to_speak•15m ago•0 comments

My Home Fibre Network Disintegrated

https://alienchow.dev/post/fibre_disintegration/
1•alienchow•16m ago•0 comments

I Use Jujutsu

https://abhinavsarkar.net/posts/jj-usage/
2•vinhnx•17m ago•0 comments

AI Predicts Disease from One Night of Sleep

https://www.sciencedaily.com/releases/2026/01/260109023114.htm
1•phyzix5761•22m ago•1 comments

Best Practices for Reducing Dependabot Noise

https://nesbitt.io/2026/01/10/16-best-practices-for-reducing-dependabot-noise.html
1•todsacerdoti•23m ago•0 comments

SoundSlab: How It Started

https://craigjb.com/2026/01/10/soundslab-beginning/
1•ahlCVA•23m ago•0 comments

AI Consciousness: A Biological Perspective

https://substack.com/@cocakoala/note/p-184178209
1•imranmk•29m ago•1 comments

The Stranger You Can Trust

https://www.thefp.com/p/the-stranger-you-can-trust
1•gmays•32m ago•0 comments

Judging Books by Their Covers, Empirically

https://yakshed.com/books/
1•abound•36m ago•0 comments

Show HN: PrintReadyBook

https://printreadybook.com/
1•cboulio•39m ago•0 comments

I Built a 1 Petabyte Server from Scratch [video]

https://www.youtube.com/watch?v=vVI7atoAeoo
1•zdw•39m ago•0 comments

Q Source

https://en.wikipedia.org/wiki/Q_source
1•vinnyglennon•40m ago•0 comments

Lord's Prayer

https://en.wikipedia.org/wiki/Lord%27s_Prayer
1•vinnyglennon•41m ago•0 comments

Subagents, Commands and Skills Are Converging

https://vivekhaldar.com/articles/claude-code-subagents-commands-skills-converging/
1•gandalfgeek•41m ago•0 comments

Musk's X to open source new algorithm in seven days

https://www.reuters.com/business/media-telecom/musks-x-open-source-new-algorithm-seven-days-2026-...
1•maxloh•41m ago•0 comments

Show HN: UI testing using multimodal LLMs

https://kodefreeze.com
1•kodefreeze•45m ago•0 comments

San Jose Mayor: CA's proposed wealth tax push burden onto middle class families [video]

https://www.youtube.com/watch?v=muVVOjJsLG8
1•donsupreme•46m ago•0 comments

Max Payne – two decades later – Graphics Critique

https://darkcephas.blogspot.com/2021/07/max-payne-two-decades-later-graphics.html
2•davikr•51m ago•0 comments

Show HN: Just published a hard-SF novel Voyager1 returns with a quantum palantir

https://www.amazon.com/dp/B0GFSMP572
1•dufbugderopa•56m ago•0 comments

Orca: A New Architecture for Efficient AGI Through Parent-Teacher Learning

https://x.com/EricOmnigenius/article/2009656779945451932
2•ericspecullaas•58m ago•0 comments

A curated list of awesome explorable explanations

https://github.com/blob42/awesome-explorables
3•vitalnodo•1h ago•0 comments

The Declining Value of Personal Advice

https://www.gojiberries.io/the-declining-value-of-interpersonal-advice/
2•neehao•1h ago•1 comments

Show HN: Artdots: The benefits of creating a side project

https://artdots.co/blog/artdots-the-benefits-of-creating-a-side-project
1•veliona•1h ago•0 comments

Nvidia Announces Alpamayo Open-Source AI Models to Accelerate Reasoning-Based AV

https://nvidianews.nvidia.com/news/alpamayo-autonomous-vehicle-development
2•lateforwork•1h ago•0 comments

Ask HN: Before codebase review, replace all vars containing simple with complex?

1•gitprolinux•1h ago•0 comments