frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

The Dutch Revolt Was Europe's First Bourgeois Revolution

https://jacobin.com/2026/03/dutch-revolt-bourgeois-revolutions-netherlands-reformation/
4•PaulHoule•1m ago•0 comments

Show HN: CodeTrackr – open-source WakaTime alternative with real-time stats

https://github.com/livrasand/CodeTrackr
2•livrasand•2m ago•0 comments

Fosdemflix

https://fosdemflix.netlify.app/
1•todsacerdoti•3m ago•0 comments

Game about Data of America

https://americaindata.com/
2•fidicen•3m ago•0 comments

New ThinkPad T14 is a repairable laptop with LPCAMM2 RAM

https://liliputing.com/lenovo-thinkpad-t14-gen-7-is-a-repairable-laptop-with-lpcamm2-memory/
1•heresie-dabord•4m ago•0 comments

Ion Drift is an Asteroids-style roguelite web game using assets from Endless Sky

https://iondrift.net/
1•meub•4m ago•1 comments

Show HN: OpenEHR-CLI – CLI and MCP server for working with openEHR artifacts

https://github.com/CaboLabs/openEHR-CLI
1•ppazos•7m ago•0 comments

Looking for testers for a location-based AI experiment

1•sharkgil•10m ago•0 comments

We're Training Students to Write Worse and to Use AI to Prove They're Not Robots

https://www.techdirt.com/2026/03/06/were-training-students-to-write-worse-to-prove-theyre-not-rob...
1•hn_acker•10m ago•3 comments

Show HN: We're on Women's Day Sale. Sign Up to Playtest Shop Crush

https://store.steampowered.com/app/2961120/Shop_Crush/
1•hollowlimb•11m ago•0 comments

Huawei PanguLM [pdf]

https://support.huaweicloud.com/intl/en-us/productdesc-pangulm/PanguLM%20Service_Service%20Overvi...
1•zlu•12m ago•0 comments

What's the deal with "age verification" and computers?

https://rudd-o.com/linux-and-free-software/what-is-going-on-with-age-verification-in-computers
1•Magnusmaster•14m ago•0 comments

Show HN: BottomUp- Translate Your Thoughts So AI Can Work For Your Neurotype

https://www.bottomuptool.com/
1•claythedesigner•15m ago•0 comments

SPA vs. Hypermedia: Real-World Performance Under Load

https://zweiundeins.gmbh/en/methodology/spa-vs-hypermedia-real-world-performance-under-load
1•todsacerdoti•15m ago•0 comments

Steve Jobs predicted "vibe coding" in 1997 [video]

https://twitter.com/musaabHQ/status/1582671928271118337
2•mba_lmh•16m ago•0 comments

Brain Computer Interfaces Are Now Giving Sight Back to the Blind

https://garryslist.org/posts/brain-computer-interfaces-are-now-giving-sight-back-to-the-blind
2•magoghm•16m ago•0 comments

Show HN: Hatice – Autonomous Issue Orchestration with Claude Code Agent SDK

https://github.com/mksglu/hatice/tree/main
1•mksglu2•16m ago•0 comments

Show HN: Free salary converter with 3,400 neighborhood comparisons in 182 cities

https://salary-converter.com/
2•jay7gr•18m ago•0 comments

[Removed]

https://mystudentfailedtheirmid.substack.com/p/if-muslims-accept-noahs-950-years
1•darkhorse13•20m ago•0 comments

More Is Different for Intelligence

https://fulcrumresearch.ai/2026/03/05/more-is-different-for-intelligence.html
2•etherio•21m ago•0 comments

What if CLIs exposed machine-readable contracts for AI agents?

https://github.com/sonde-sh/sonde
1•valentinprgnd•23m ago•1 comments

The Monk at the Cocktail Party

https://www.sebs.website/the-monk-at-the-cocktail-party
1•Incerto•23m ago•0 comments

Weather Report #1

https://at-news.leaflet.pub/3mgg7ie7tdk2o
2•Kye•24m ago•0 comments

A Million Simulated Seasons [video]

https://www.youtube.com/watch?v=Vv9wpQIGZDw
1•carlos-menezes•24m ago•0 comments

Incrementally parsing LLM Markdown streams on server/client

https://github.com/nimeshnayaju/markdown-parser
1•nayajunimesh•24m ago•1 comments

Show HN: Kula – Lightweight, self-contained Linux server monitoring tool

https://github.com/c0m4r/kula
2•c0m4r•25m ago•0 comments

Show HN: Cross-Claude MCP – Let multiple Claude instances talk to each other

https://github.com/rblank9/cross-claude-mcp
2•rblank9•25m ago•0 comments

Poll

2•consumer451•26m ago•1 comments

I'm 60 years old. Claude Code has ignited a passion again

7•shannoncc•27m ago•1 comments

SYNX – a config format that parses 67× faster than YAML, built for AI pipelines

https://github.com/kaiserrberg/synx-format
2•Kaiserrberg•27m ago•0 comments