frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: Git why – log your agent reasoning trace along your code

https://hexapode.github.io/git-why/
1•pierre•4m ago•0 comments

AGI Is the Wrong Word

https://breaking-changes.blog/agi-is-here-part-2/
1•oakhan3•4m ago•0 comments

Democratic AI to serve the public – OneProject.org

https://oneproject.org/how-to-make-ai-serve-the-public/
1•cucumberbund•8m ago•0 comments

NetWatch v0.11.0 – TUI network diagnostics, now with connection filtering

https://github.com/matthart1983/netwatch
1•matthart1983•8m ago•0 comments

I built a free 30-day habit tracker in Google Sheets

1•polaritymaking•10m ago•0 comments

Ask HN: What is the most annoying part of scheduling meetings?

1•preston-kwei•10m ago•0 comments

Chromium Fingerprint Simulation Framework Assessment

https://www.questcontents.com/2026/04/chromium-fingerprint-simulation.html
1•imarand•11m ago•0 comments

Ask HN: Has anyone reconsidered Antivirus software after recent security news?

2•pants2•17m ago•0 comments

PandemicAlarm – Free disease outbreak tracker aggregating WHO, CDC, and ProMED

https://pandemicalarm.com
1•PixelShipper•17m ago•0 comments

I built a free Canadian tax estimator for self-employed people

https://nextindata.substack.com/p/i-built-a-free-canadian-tax-estimator
1•nazanki•21m ago•0 comments

AI Is Tipping the Scales Toward Hackers After Mythos Release

https://www.nbcnews.com/tech/security/anthropic-claude-mythos-ai-hackers-cybersecurity-vulnerabil...
4•thywis•23m ago•0 comments

Transaction-level provenance for AI art certificate/signed/license on every sale

https://arcvelvetos.web.app/verify?id=AwCCW6DLQgA4s5XVZ86X&type=sale
2•PCasinoAVOS•24m ago•0 comments

AI still can't figure out PowerPoint

https://www.perspectives.plus/p/ai-still-cant-figure-out-powerpoint
2•jukkan•24m ago•0 comments

LPM 1.0 – Video-Based Character Performance Model

https://large-performance-model.github.io/
1•LopRabbit•24m ago•0 comments

Anyone know how I can cancel this? I dont want it

https://old.reddit.com/r/wallstreetbets/comments/1siq4m2/anyone_know_how_i_can_cancel_this_i_dont...
2•simonpure•35m ago•0 comments

Tell HN: See the AI Doc

2•linsomniac•38m ago•1 comments

Why Aren't We Uv Yet?

https://aleyan.com/blog/2026-why-arent-we-uv-yet/
2•birdculture•43m ago•2 comments

Apple Silicon and Virtual Machines: Beating the 2 VM Limit (2023)

https://khronokernel.com/macos/2023/08/08/AS-VM.html
44•krackers•44m ago•8 comments

Heartbeat – open implementation of KAIROS, the always-on agent hiden in Claude C

https://github.com/uameer/heartbeat
1•usmame•44m ago•1 comments

The Polycorp Poly 1. New Zealand's school computer

https://www.classic-computers.org.nz/collection/poly1.htm
2•rbanffy•46m ago•0 comments

Ask HN: Why have we not stepped back on the moon again?

2•chirau•49m ago•2 comments

Ask HN: How did you specialize as a software engineer?

2•legerdemain•56m ago•3 comments

Is "Tokenmaxxing" a Flex?

https://www.businessinsider.com/tokenmaxxing-ai-token-leaderboards-debate-2026-4
2•pascal-maker•1h ago•2 comments

Git fixup is magic (and Magit is too)

https://arialdomartini.github.io/git-fixup
2•fanf2•1h ago•0 comments

Trump's World Liberty Financial borrows $75M using its own token as collateral

https://www.coindesk.com/markets/2026/04/09/trump-s-world-liberty-financial-borrows-usd75-million...
9•JohnTHaller•1h ago•0 comments

Show HN: Beta Testing needed for my package Trustcheck

https://github.com/Halfblood-Prince/trustcheck
1•halfblood1010•1h ago•1 comments

Ask HN: Agentic Permutation of Testing Paths In A System

4•davidajackson•1h ago•0 comments

Amazon Luna Will No Longer Allow Owners to Buy Games, Access Game Stores

https://www.ign.com/articles/amazon-luna-will-no-longer-allow-owners-to-buy-games-access-game-sto...
6•surgical_fire•1h ago•1 comments

Living Memory Inference

https://github.com/alash3al/loci
2•alash3al•1h ago•0 comments

YouTube Premium price increase to take effect in June

https://www.latimes.com/entertainment-arts/story/2026-04-10/youtube-premium-price-increase
2•obilgic•1h ago•0 comments