frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Claude Code Remote Control

https://code.claude.com/docs/en/remote-control
1•mfiguiere•1m ago•0 comments

Microsoft AutoRest Deprecation – July 1, 2026

https://github.com/Azure/autorest/issues/5175
1•cyptus•1m ago•0 comments

Learn Graph Theory Interactively with D3 Animations

https://d3gt.com
1•elviejo•1m ago•0 comments

Principles for independent research in a digital world

https://www.brookings.edu/articles/6-principles-for-independent-research-in-a-digital-world/
1•hn_acker•2m ago•0 comments

How to grow organically with SEO in 2026

https://www.kerplexir.com/
1•ajagatobby•3m ago•1 comments

Cognitive biases of talent scouts can undermine sports teams' success

https://phys.org/news/2026-02-cognitive-biases-talent-scouts-undermine.html
1•PaulHoule•4m ago•0 comments

Why Are American Passenger Trains So Slow?

https://americanaffairsjournal.org/2026/02/why-are-american-passenger-trains-slow/
1•idleplant•4m ago•0 comments

Show HN: TurkishSieve CPU/GPU prime sieve found errors in Nicely's tables

https://github.com/bilgisofttr/turkishsieve
1•bilgisoft•4m ago•0 comments

Show HN: ComfyUI for HuggingFace Spaces

https://huggingui.vercel.app/
2•SpyCoder77•6m ago•0 comments

Dream Recorder AI – a portal to your subconscious

https://dreamrecorder.ai/
2•level87•6m ago•0 comments

Retired EV Batteries Scored a New Gig: Bolstering Texas' Grid

https://insideclimatenews.org/news/17022026/retired-ev-batteries-bolster-texas-grid/
1•hn_acker•6m ago•0 comments

Ask HN: Is it worth avoiding AI while making a game?

1•2muchclout•7m ago•0 comments

Iready in Schools Problem

https://unherd.com/2026/02/why-your-kid-hates-learning-apps/?edition=us
1•marysminefnuf•8m ago•0 comments

Anthropic digs in heels in dispute with Pentagon

https://www.reuters.com/world/anthropic-digs-heels-dispute-with-pentagon-source-says-2026-02-24/
2•vforgione•9m ago•0 comments

Proposal: Add "AI generated" as a flag reason

https://lobste.rs/s/rkjpob/proposal_add_ai_generated_as_flag_reason
4•lr0•9m ago•1 comments

VibePad – Control Coding agents with a gamepad from your couch (macOS)

https://vibepad.now
1•ignatovv•9m ago•1 comments

Screen Power Saving in the Linux Console

https://changelog.complete.org/archives/42061-screen-power-saving-in-the-linux-console
1•iamnothere•11m ago•0 comments

Plastics recycled into acetic acid using sunlight

https://www.heise.de/en/news/Plastics-recycled-into-acetic-acid-using-sunlight-11188232.html
2•doener•11m ago•0 comments

Reached 330 stars on our open source agentic platform

https://github.com/coasty-ai/open-computer-use
1•nkov47as•13m ago•0 comments

Show HN: Mnemosyne – Cognitive memory OS for AI agents (zero LLM calls)

https://github.com/28naem-del/mnemosyne
3•mnemosy•13m ago•1 comments

Time-Locked Cryptography

https://farlow.dev/2023/05/23/when-you-read-this-ill-be-gone
2•lispybanana•14m ago•0 comments

DNB thinks it can become less dependent on American ICT giants within five years

https://nos.nl/artikel/2603848-in-vijf-jaar-minder-afhankelijk-van-amerikaanse-ict-reuzen-kan-den...
2•doener•17m ago•0 comments

Gitzy is now on TestFlight A modern, native iOS Git client

https://testflight.apple.com/join/SB16NCfr
3•marc0janssen•18m ago•1 comments

Maraapunisaurus § Disappearance of the specimen

https://en.wikipedia.org/wiki/Maraapunisaurus
1•quuxplusone•18m ago•1 comments

Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3

https://github.com/moonshine-ai/moonshine
3•petewarden•22m ago•0 comments

Pi – a minimal terminal coding harness

https://pi.dev
3•kristianpaul•22m ago•0 comments

A small tool I made for local LLMs: LLM-neofetch-plus

1•HFerrahoglu•22m ago•0 comments

Show HN: An "earned autonomy" architecture for AI agents using Subjective Logic

https://kenschachter.substack.com/p/earned-autonomy
1•ken_neth•23m ago•0 comments

Show HN: I solo-built Sovereign-Mohawk – FL with 500K nodes and 55% BFT

https://rwilliamspbg-ops.github.io/Sovereign-Mohawk-Proto/
1•rwilliamspbgops•23m ago•0 comments

Show HN: Recursively apply patterns for pathfinding

https://pattern-pathfinder.vercel.app/?fixtureId=%7B%22path%22%3A%22site%2Fexamples%2F_intro.fixt...
5•seveibar•25m ago•0 comments