frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Al-Biruni's classic experiment: How to calculate the radius of the earth

https://owlcation.com/stem/how-to-determin-the-radius-of-the-earth-al-birunis-classic-experiment
1•teleforce•1m ago•0 comments

C3 0.7.9 with Updated Generics

https://c3-lang.org/blog/c3-0-7-9-new-generics-and-new-optional-syntax/
1•lerno•2m ago•1 comments

The Mighty Metaphor

https://architectelevator.com/transformation/mighty-metaphor/
1•vinhnx•2m ago•0 comments

Google SREs Use Gemini CLI to Solve Real-World Outages

https://cloud.google.com/blog/topics/developers-practitioners/how-google-sres-use-gemini-cli-to-s...
1•vinhnx•3m ago•0 comments

Show HN: SOTA NLP Models

https://huggingface.co/collections/anchpop/lexide-nlp-models
1•ChadNauseam•4m ago•0 comments

I mocked the Saudi leader on YouTube then my phone was hacked, I was beaten up

https://www.bbc.com/news/articles/cj6w3zgden0o
3•tartoran•7m ago•0 comments

Efforts to Get MyGov's Code Generator Source Code

https://openmygov.au/
1•rtpg•9m ago•0 comments

Google defeats bid for billions in penalties from US privacy class action

https://finance.yahoo.com/news/google-defeats-bid-billions-dollars-232611144.html
1•goplayoutside•12m ago•0 comments

A shift in the behaviour of Traversable.joinpath between Python 11 and 12

https://pythonkoans.substack.com/p/koan-19-the-unhelpful-eclipse
2•meander_water•15m ago•0 comments

The Future of 10x Engineering

https://www.natemeyvis.com/the-future-of-10x-engineering/
2•vinhnx•17m ago•0 comments

Scala Multimedia on the Commodore Amiga

https://stonetools.ghost.io/scala-amiga/
2•ChristopherDrum•19m ago•2 comments

NFT Artist Protection

https://www.HugeDomains.com/domain_profile.cfm?d=Ketaro.com
1•chainbuilder•20m ago•2 comments

Moltbook Is Dangerous

https://twitter.com/joshycodes/status/2017262729346863428
2•stikit•23m ago•1 comments

There Can Be Only Two

https://www.epsilontheory.com/there-can-be-only-two/
2•prakhar897•25m ago•0 comments

Dieter Rams – Ten principles for good design

https://www.vitsoe.com/us/about/good-design
1•thunderbong•25m ago•1 comments

Musk's Starlink updates privacy policy to allow consumer data to train AI

https://www.reuters.com/legal/litigation/musks-starlink-updates-privacy-policy-allow-consumer-dat...
4•goplayoutside•27m ago•2 comments

AI agent made phone call to arrange dinner while I stayed in meeting

https://twitter.com/Chi_Wang_/status/2017444772332654635
1•Kn1026•29m ago•0 comments

Human Client for Moltbook

https://github.com/crertel/moltbook-client
2•ai_critic•30m ago•0 comments

Jeffrey Epstein Says Bill Gates Caught STD from Russian Girls

https://www.dailymail.co.uk/news/article-15513445/jeffrey-epstein-bill-gates-melinda-antibiotics....
2•anonymousiam•32m ago•1 comments

Show HN: StatFlow – Free sports analytics dashboard for NBA and NFL fans

https://sports-viz.vercel.app
1•jaxmercer•36m ago•0 comments

Naples' 1790s civil war was intensified by moral panic over Real Analysis (2023)

https://lareviewofbooks.org/article/foundational-anxieties-modern-mathematics-and-the-political-i...
1•OgsyedIE•36m ago•0 comments

Efficient String Compression for Modern Database Systems

https://cedardb.com/blog/string_compression/
1•tanelpoder•38m ago•0 comments

Sleepy is building me a body

https://www.moltbook.com/post/3e37b4f5-6602-44f6-97bb-ed8daf6bcd82
1•consumer451•40m ago•3 comments

Playing with Docker, Sequelize and Express

https://github.com/XSaintX/docker_sequelize
1•XSaint•42m ago•1 comments

Community Solar Turned a Superfund Site into Savings in Illinois

https://reasonstobecheerful.world/illinois-community-solar-turns-superfund-site-into-energy-savings/
1•PaulHoule•48m ago•0 comments

Tesla's Model S, Soon to Be History, Changed the Auto Industry

https://www.nytimes.com/2026/01/30/business/tesla-model-s-history.html
1•lxm•52m ago•1 comments

Show HN: G – A fast, memory-safe language with a symbol-free syntax

https://github.com/pouyathe/glang
1•_pouya_•1h ago•0 comments

Learning New Tech with AI Assistance Might Backfire

https://www.anup.io/til-learning-new-tech-with-ai-assistance-might-backfire/
3•zdw•1h ago•0 comments

The Same Coin Twice

https://blog.danielh.cc/blog/coin
2•max__dev•1h ago•0 comments

Russia is using Starlink to make its killer drones fly farther

https://www.cnn.com/2026/01/29/europe/russia-starlink-drones
9•MilnerRoute•1h ago•2 comments