frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Co-Founder Honch

https://www.honch.io/
1•Raeedzz•34s ago•0 comments

Hi HN: Loopy agent, meta-loop engineer my Claude Code and codex sessions

https://github.com/secretbuilds/loopy
1•secretbuilds•14m ago•1 comments

Pac-Man, but You're the Ghost

https://garrit.xyz/posts/2026-06-13-pac-man-but-you-re-the-ghost
1•mindracer•18m ago•0 comments

Ask HN: Do you buy the domain first or build first then domain?

1•akashwadhwani35•18m ago•0 comments

PeopleSoft 0-day affecting organizations steals gigabytes of data

https://arstechnica.com/security/2026/06/peoplesoft-0-day-affecting-hundreds-of-organizations-ste...
2•geoffbp•21m ago•0 comments

Track tokens usage and AI Subscriptions across major AI platforms

https://www.tokens4breakfast.app
1•1Kapish•27m ago•1 comments

Software Architecture Guide

https://martinfowler.com/architecture/
2•laxmena•29m ago•0 comments

Show HN: Winamp's Geiss and Milkdrop ported to WebGL

https://milkbar.fm/
2•vlbeta•40m ago•0 comments

OmniCloud is a full-stack cloud drive aggregation platform

https://github.com/dimartarmizi/OmniCloud
1•tonyhart7•43m ago•0 comments

Free SQL→ER diagram tool, runs in the browser, nothing uploaded

https://sqltoerdiagram.com/
2•robhati•53m ago•0 comments

Show HN: I created a simple searchable list of abandoned WordPress Plugins

https://vimsy.io/plugin-graveyard
2•arximughal•54m ago•0 comments

Running Out of Context? No More

https://github.com/shrey1110-dotcom/CLAUDE_API_SAVER
1•otto_api•54m ago•0 comments

AAD-50: multi-cycle NVMe sanitize with per-cycle hardware verification

https://github.com/yonasabeselom/aad50
1•yonasabeselom•1h ago•0 comments

UK announces $1.5B AI infrastructure plan

https://www.reuters.com/world/uk/uk-sets-out-15-billion-ai-hardware-plan-with-supercomputer-chip-...
3•Soumya_Max•1h ago•2 comments

What old technology do you still use regularly?

2•Soumya_Max•1h ago•4 comments

What happens if you click the first link on every Wikipedia article? [video]

https://www.youtube.com/watch?v=dpLG3DpfSlM
2•wilsonhobbs•1h ago•0 comments

Zero-knowledge SAT validation engine

https://ptsf-engine.vercel.app/
1•curio_Pol_curio•1h ago•0 comments

Type Theory Forall #62 – Dependent Haskell – Vladislav Zavialov [video]

https://www.youtube.com/watch?v=COBZZb6Iu2Q
5•matt_d•1h ago•0 comments

Automating my job away

https://austinhenley.com/blog/automatingmyjob.html
2•azhenley•1h ago•0 comments

The Redistribution of Housing Wealth Caused by Rent Control [pdf]

https://www.rhawa.org/file/secure/shs-the-impact-of-rent-control-in-st-paul.pdf
51•luu•1h ago•46 comments

Half-Life able to run on ReactOS

https://xcancel.com/reactos/status/2064839936059011207
3•zdw•1h ago•1 comments

Making Claude a Chemist

https://www.anthropic.com/research/making-claude-a-chemist
5•gmays•1h ago•0 comments

Life Evolved

https://github.com/harrisjerico30-dotcom/G4-construct-
4•jericoharris•1h ago•0 comments

Weave: Merging based on language structure and not lines

https://ataraxy-labs.github.io/weave/
7•rohanat•1h ago•1 comments

Show HN: Tabby – sleeps tabs based on RAM pressure, not fixed timers

https://meettabby.netlify.app/
3•justbuilding•1h ago•0 comments

Show HN: Bastion – isolated Linux VMs for background coding agents

https://bastion.computer/
3•almostlit•1h ago•0 comments

Thirty Years My Open Source Project: The ApeSDK/Noble Ape

https://apesdk.com/
1•barbalet•2h ago•1 comments

The Rise of Housing Nationalism in Canada and Transnational Ownership Patterns

https://open.library.ubc.ca/media/stream/pdf/52383/1.0438798/5
4•luu•2h ago•0 comments

I will put a talking persona on your website

https://www.usegoblin.xyz
2•Obi-•2h ago•6 comments

AUR Malware Attack: Do Not Update [video]

https://www.youtube.com/watch?v=WoxR7fGl4CI
1•kshri24•2h ago•1 comments