frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: Userscript that shows HN submission count for any page

https://gist.github.com/overflowy/75b2df8a5349e51c5f83b5a2719d80d1
1•overflowy•5s ago•0 comments

Show HN: Navox Agents – 8 AI Agents for Claude Code with HITL Checkpoints

https://github.com/navox-labs/agents
1•nahrin•26s ago•0 comments

Quantum 'Jamming' Explores the Fundamental Principles of Nature

https://www.quantamagazine.org/quantum-jamming-explores-the-truly-fundamental-principles-of-natur...
1•lschueller•41s ago•0 comments

Books I Like: SICP

https://users.cms.caltech.edu/~mvanier/blog/sicp/
1•tehnub•1m ago•0 comments

The Computers of WarGames – What Was Real? What Was Fake? [video]

https://www.youtube.com/watch?v=y7thb-SC09g
1•LatencyKills•1m ago•0 comments

The Road to Responsive IntelliJ-Based IDEs

https://blog.jetbrains.com/platform/2026/04/road-to-responsive-ides/
2•saikatsg•3m ago•0 comments

Show HN: RepoGauge – save token costs and compare agents on your own repos

https://repogauge.org
1•siliconc0w•5m ago•0 comments

For founders who feel like nobody's watching

https://autheona.com/tools/audience-visualizer/
1•lasgawe•7m ago•1 comments

Show HN: Smith – AI Agent Orchestrator

https://getsmith.dev/
1•netnameus•7m ago•0 comments

Shift-Left Code Quality: Inside DebtDrone CLI 2.0.0 and a Dual-Mode Architecture

https://www.endrilickollari.com/blog/debtdrone-cli-2-release
1•endrilickollari•7m ago•0 comments

Discover More of the Fediverse with Tags.pub

https://activitypub.blog/2026/04/02/discover-more-of-the-fediverse-with-tags-pub/
1•ZacnyLos•9m ago•0 comments

Iceland Just Got Its First Mosquitoes

https://gizmodo.com/iceland-just-got-its-first-mosquitoes-scientists-arent-ready-for-what-comes-n...
1•speckx•10m ago•0 comments

KGB Building and Cells

https://www.dark-tourism.com/index.php/941-riga-kgb-building
1•jruohonen•11m ago•0 comments

Finance ministers and top bankers raise serious concerns about Mythos model

https://www.bbc.com/news/articles/c2ev24yx4rmo
1•reconnecting•13m ago•1 comments

Ask HN: How do you search the web programmatically these days?

1•coreyp_1•13m ago•1 comments

Show HN: web-pinentry: a pinentry program for decrypting server-side passwords

https://codeberg.org/seanhly/web-pinentry
1•seanhly•14m ago•0 comments

AI in Onboarding

https://app.soon.works/create
1•Olaf_Soon•14m ago•0 comments

A List of Zettelkasten Resources

https://github.com/fhoehl/awesome-zettelkasten
1•marukodo•15m ago•0 comments

Tesla tells HW3 owner to 'be patient' after 7 years of waiting for FSD

https://electrek.co/2026/04/17/tesla-hw3-owners-be-patient-7-years-fsd/
3•breve•16m ago•0 comments

Show HN: Arc Browser UI Skeleton (Next.js)

https://arc-ui-skeleton.vercel.app/
2•SpyCoder77•16m ago•0 comments

Ask HN: I built a Blox Fruits trading tool – feedback?

1•BFV•16m ago•0 comments

Year of the IPv6 Overlay Network

https://www.defined.net/blog/year-of-the-ipv6-overlay-network/
1•stock_toaster•17m ago•0 comments

Choosing Entrepreneurship over a Corporate Career

https://levels.io/choosing-entrepreneurship/
2•tylerdane•18m ago•1 comments

Smoglandia: Smog was killing L.A., and a Caltech chemist found the murder weapon

https://www.latimes.com/california/story/2026-03-26/smoglandia-smog-was-killing-l-a-caltech-chemi...
1•PaulHoule•18m ago•0 comments

Optionality Curse

https://www.karanjanthe.me/posts/optionality-curse/
1•KMJ-007•18m ago•1 comments

Retro Is the Future

https://blog.absurdpirate.com/retro-is-the-future/
1•speckx•18m ago•0 comments

Easy code and work AI agent system: auto, asynchronous, concurrency, efficiently

https://github.com/vcaesar/codg
2•veni0•19m ago•1 comments

Tesla Cybertruck sales inflated: SpaceX bought 1,279 units

https://electrek.co/2026/04/16/tesla-cybertruck-spacex-1279-q4-sales-inflated/
2•doener•20m ago•0 comments

Django Admin–style panel for NestJS (TypeORM/Prisma, filters, actions)

https://github.com/xtrinch/nestjs-dj-admin
1•xtrinch•20m ago•0 comments

European Directory and Alternatives

1•EuropeanSwitch•21m ago•0 comments