frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

DHS pushes social media giants to dox anonymous accounts critical of ICE

https://mashable.com/article/ai-hard-drive-hdd-shortages-western-digital-sold-out
1•jmward01•35s ago•0 comments

A Love Letter to Plastic Junk: Nintendo's Weirdest Inventions

https://www.itstheorbit.com/p/nintendos-beautifully-unnecessary
1•myth_drannon•7m ago•0 comments

Standard Notes – End-to-End Encrypted Notes App from Proton

https://standardnotes.com
2•throwoutway•20m ago•2 comments

The Rediscovery of 103 Hokusai Lost Sketches (2021)

https://japan-forward.com/eternal-hokusai-the-rediscovery-of-103-hokusai-lost-sketches/
1•debo_•22m ago•0 comments

Show HN: Katipo is a minimal alternative internet with a Vulkan based browser

https://github.com/mjdave/katipo
2•majicDave•23m ago•0 comments

Palo Alto Networks Expands Identity Security with CyberArk Deal and TASE Listing

https://finance.yahoo.com/news/palo-alto-networks-expands-identity-091230669.html
1•Khaine•23m ago•0 comments

India's pollution is becoming an economic roadblock

https://www.economist.com/asia/2026/02/15/indias-pollution-is-becoming-an-economic-roadblock
1•andsoitis•25m ago•0 comments

Show HN: OpenSlimedit – Cut AI coding token usage by 21-45% with zero config

https://github.com/ASidorenkoCode/openslimedit
1•aSidorenkoCode•27m ago•3 comments

Why do I not use "AI" at OSNews?

https://www.osnews.com/story/144405/why-do-i-not-use-ai-at-osnews/
2•cdvonstinkpot•27m ago•0 comments

Show HN: Image to Photo

https://imagetophoto.com
1•wangmao•29m ago•1 comments

A NEW Windows‑native SSH agent

https://github.com/Sanmilie/PKCS11SSHAgent
1•Sanmilie•32m ago•1 comments

EPA ends credits for automatic start-stop vehicle ignition

https://apnews.com/article/climate-zeldin-automakers-vehicles-consumers-dca74900298e45485987b87c3...
2•geox•33m ago•1 comments

ASUKA.md – The SOUL.md for Eva Asuka

https://asuka.md
1•jetsquirrel•37m ago•1 comments

Live Variables in the Verse Language

https://twitter.com/vukefn/status/2022809591051096233
1•DustinEchoes•43m ago•1 comments

Arm wants a bigger slice of the chip business

https://www.economist.com/business/2026/02/12/arm-wants-a-bigger-slice-of-the-chip-business
7•andsoitis•44m ago•2 comments

Michelangelo Made His First Masterpiece When He Was 12 Years Old

https://www.thisiscolossal.com/2026/01/michelangelo-first-painting-torment-of-saint-anthony/
1•andsoitis•48m ago•0 comments

I tried to prompt-engineer a writing style and got a psychoanalysis instead

https://executelater.substack.com/p/how-i-taught-claude-to-write-like
1•NarratorTD•54m ago•1 comments

2026, the Last Year of the AI Bubble

https://medium.com/predict/2026-the-last-year-of-the-bubble-the-ai-empire-begins-to-crumble-1bb5e...
3•WaitWaitWha•58m ago•2 comments

AI Is Getting Scary Good at Making Predictions

https://www.theatlantic.com/technology/2026/02/ai-prediction-human-forecasters/685955/
1•vinhnx•58m ago•1 comments

Getting the Main Thing Right

https://www.seangoedecke.com/getting-the-main-thing-right/
1•Garbage•59m ago•0 comments

Printing Films Archive

https://printingfilms.com
1•vinhnx•59m ago•0 comments

EpsteinDB – Making the Epstein Files More Searchable

https://epsteindb.com/
1•Lbesecker195•1h ago•1 comments

Show HN: Talk2Code – Text your codebase from your phone (~150 lines of Python)

https://github.com/dchisholm125/Talk2Code
1•dchisholm125•1h ago•1 comments

Show HN: Ls-f a fast, zero-dependency ls with Nerd Font icons (Rust rewrite)

https://github.com/swadhinbiswas/ls-f
1•0x0003r•1h ago•0 comments

I manage my Guix System configs

https://www.terracrypt.net/posts/guix-config.html
1•todsacerdoti•1h ago•0 comments

Test my live Tempest AI Metrics Dashboard on the web

http://davepl.dyns.org:8765/
2•davepl•1h ago•1 comments

Show HN: MultiWA - Open-source self-hosted WhatsApp API Gateway

https://github.com/ribato22/MultiWA
1•ribato•1h ago•0 comments

Trapped in the Hell of Social Comparison

https://www.noahpinion.blog/p/trapped-in-the-hell-of-social-comparison
1•herbertl•1h ago•0 comments

Show HN: Self-hosted alternative to Goodreads. Own your reading data

https://github.com/raghavan/BookSync
2•raghavankl•1h ago•1 comments

Artificial Intelligence and Magical Thinking

http://edwardfeser.blogspot.com/2019/03/artificial-intelligence-and-magical.html
2•b-man•1h ago•0 comments