frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Effective harnesses for long-running agents

https://www.anthropic.com/engineering/effective-harnesses-for-long-running-agents
1•aratahikaru5•55s ago•0 comments

Show HN: Ghostty-Web – Ghostty in the Browser

https://github.com/coder/ghostty-web
1•jonayers_•2m ago•0 comments

The Moron Filter Effect

https://moonbearmusings.com/the-moron-filter-effect/
1•ronsor•3m ago•0 comments

The Griswold Effect: How the Holiday Economy Makes Joy Expensive

https://indieinvestor.substack.com/p/the-griswold-effect-how-the-holiday
1•IndieInvestor•4m ago•0 comments

AgentLens: The Future of Evaluation Is Agentic

https://contextual.ai/blog/agentlens-the-future-of-evaluation-is-agentic
3•shikib•7m ago•1 comments

Chinese researchers simulate large-scale electronic warfare against Starlink

https://www.scmp.com/news/china/science/article/3333523/chinese-researchers-simulate-large-scale-...
2•2OEH8eoCRo0•7m ago•0 comments

Is it disruption, or is it theft?

https://www.chrbutler.com/disruption-or-theft
5•delaugust•7m ago•0 comments

Show HN: We built a free tool to help game devs better set Steam regional prices

https://hushcrasher.com/tools/steam-regional-pricing/
1•juliebelz•8m ago•0 comments

One mile on bike is a 42¢ economic gain to society, a mile driving is a 20¢ loss

https://grist.org/biking/one-mile-on-a-bike-is-a-42-economic-gain-to-society-one-mile-driving-is-...
2•voxadam•9m ago•1 comments

Why Content Is King in Tech Events

https://substack.com/inbox/post/179317312
1•Lindsayterra•9m ago•0 comments

NASA Rover Makes a 'Shocking' Discovery: Lightning on Mars

https://www.nytimes.com/2025/11/26/science/mars-lightning-nasa.html
1•gnabgib•10m ago•0 comments

How stealth addresses work in Monero

https://www.johndcook.com/blog/2025/11/24/monero-stealth-addresses/
1•azhenley•11m ago•0 comments

A Brief History of Large Language Models

https://koenvangilst.nl/lab/brief-history-of-llms
1•vnglst•12m ago•0 comments

Rethinking Innovation in the Real World

https://www.fattonys.net/episodes/coastas-papaikonomou-rethinking-innovation-in-the-real-world
1•Incerto•13m ago•1 comments

Rey, the Admin of 'Scattered Lapsus$ Hunters'

https://krebsonsecurity.com/2025/11/meet-rey-the-admin-of-scattered-lapsus-hunters/
1•todsacerdoti•14m ago•0 comments

Show HN: pyproject – A linter for your Python project configuration

https://github.com/terror/pyproject
2•crap•14m ago•0 comments

Show HN: Aigit – AI-powered Git CLI for commit messages, branch names, and PRs

https://github.com/hardiksondagar/aigit
2•hardiksondagar•15m ago•0 comments

Deriving Reverse-Time Stochastic Differential Equations (SDEs)

https://jiha-kim.github.io/posts/deriving-reverse-time-stochastic-differential-equations-sdes/
1•ibobev•15m ago•0 comments

AlphaChip transformed computer chip design (2024)

https://deepmind.google/blog/how-alphachip-transformed-computer-chip-design/
1•karmakaze•15m ago•0 comments

Urgent Medical Device Recall: FreeStyle Libre 3 Plus Sensors

https://www.freestylecheck.com/ca-en/home.html
1•doener•16m ago•0 comments

Discrete Calculus

https://jiha-kim.github.io/posts/discrete-calculus/
1•ibobev•16m ago•0 comments

HP plans to save millions by laying off thousands, ramping up AI use

https://arstechnica.com/information-technology/2025/11/hp-plans-to-save-millions-by-laying-off-th...
1•stalfosknight•17m ago•1 comments

AI and Child Processes <3

https://realamanazad.substack.com/p/ai-child-processes-3
1•namadaza•17m ago•0 comments

LLM live model ranker in latency

https://metrik-dashboard.vercel.app/
1•mbouassa•19m ago•0 comments

Pozsar's Bretton Woods III: Three Years Later

https://philippdubach.com/2025/10/26/pozsars-bretton-woods-iii-three-years-later-2/2/
1•7777777phil•23m ago•1 comments

GrapheneOS is leaving France after receiving threats from law enforcement

https://grapheneos.social/@GrapheneOS/115606319562587450
2•a022311•25m ago•2 comments

DRAM prices are spiking, but I don't trust the industry's why

https://www.xda-developers.com/dram-prices-spiking-dont-trust-industry-reasons/
4•binarycrusader•26m ago•0 comments

USPTO issues revised inventorship guidance for AI-assisted inventions [pdf]

https://public-inspection.federalregister.gov/2025-21457.pdf
2•kjhughes•31m ago•0 comments

Plotnine – A Grammar of Graphics for Python

https://github.com/has2k1/plotnine
2•nothrowaways•33m ago•0 comments

Show HN: MightyGrep

https://ksylvestre.itch.io/mightygrep
6•zeeeeeebo•35m ago•0 comments