frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: The 323, a 32-bit computer in Conway's Game of Life

https://256-32.com/computers/323
1•256_•22s ago•0 comments

Make Every Click Count with Real-Time Personalization

https://www.beaconmatch.com
1•Notorious_DAO•2m ago•0 comments

Mario and Earendil

https://lucumr.pocoo.org/2026/4/8/mario-and-earendil/
1•doppp•3m ago•0 comments

Volunteers turn a fan's recordings of 10K concerts into an online treasure trove

https://apnews.com/article/aadam-jacobs-collection-concerts-internet-archive-chicago-b1c9c4466a2d...
1•geox•5m ago•0 comments

Grokking the MariaDB test runner (MTR)

https://optimizedbyotto.com/post/grokking-mariadb-test-run-mtr/
1•mariuz•6m ago•0 comments

Apple is running out of A18 Pro chips for the MacBook Neo

https://www.tomsguide.com/computing/macbooks/macbook-neo-is-so-popular-apple-is-running-out-of-a1...
1•Lwrless•6m ago•0 comments

Active Incident with Atlassian Services

https://status.atlassian.com
1•svedin•9m ago•0 comments

Developing Creative Identity

https://michaelnotebook.com/dci/index.html
1•walterbell•12m ago•0 comments

Show HN: Rootcx.com – open-source AI agents and internal software

https://github.com/RootCX/RootCX
1•seyz•14m ago•0 comments

Hindsight Simulator: Go back in time and get rich

https://chrispattle.com/hindsight-simulator
1•pattle•15m ago•0 comments

OpenAI Doubling Down on Text Models, Shifting Strategies to Superapp Plan

https://www.bigtechnology.com/p/openai-president-greg-brockman-doubling
2•lschueller•18m ago•1 comments

Show HN: SharpSkill – We built the future of AI coding interviews

https://sharpskill.dev/en
2•Enjoyooor•20m ago•0 comments

AI-Ready Modular Data Center Slashes Deployment Time

https://spectrum.ieee.org/modular-data-center
1•JeanKage•21m ago•0 comments

Aether – Auto-extract entities and build a knowledge graph from any URL

https://github.com/bugrax/aether
2•bugrax•22m ago•0 comments

Passgen-Moz

https://github.com/loperfido/passgen-moz
1•loperfido•22m ago•0 comments

The Git Commands I Run Before Reading Any Code

https://piechowski.io/post/git-commands-before-reading-code/
1•grepsedawk•23m ago•0 comments

Is Entire.io hype or is it the future of GitHub?

https://techstackups.com/guides/entire-io-hands-on-what-it-actually-captures/
1•sixhobbits•24m ago•0 comments

Failing the Fix (2026): Grading laptop and cell phone companies on fixability

https://pirg.org/edfund/resources/failing-the-fix-2026/
1•doener•25m ago•0 comments

Škoda DuoBell: A bicycle bell that penetrates noise-cancelling headphones

https://www.skoda-storyboard.com/en/skoda-world/skoda-duobell-a-bicycle-bell-that-outsmarts-even-...
4•ra•26m ago•0 comments

UK's grand plan to fuel AI with public data faces uphill battle

https://www.theregister.com/2026/04/08/national_data_library_plan/
2•jjgreen•30m ago•0 comments

I made this to enhance the surfing experience

https://github.com/StyleSwift/StyleSwift
1•zane12580•30m ago•0 comments

Milla Jovovich released an AI memory system. None of benchmark scores are real

https://penfieldlabs.substack.com/p/milla-jovovich-just-released-an-ai
2•mxpr•31m ago•0 comments

Benchmark Fatigue

https://gertlabs.com/blog/gbench-1
3•gertlabs•33m ago•0 comments

HTML for People

https://htmlforpeople.com/
1•fanf2•34m ago•0 comments

I lost 3 weeks of SEO because of a canonical tag bug in Next.js

https://www.learncodeguide.com/
1•AndreiDia•35m ago•0 comments

US fired 1k JASSM cruise missiles in 37 days. Lockheed makes 396 per year

https://www.shatterbelt.co/articles/jassm-stockpile-crisis
2•realpolitik9•35m ago•0 comments

Planet Develops Space-to-Ground and Space-to-Space Connectivity

https://www.planet.com/pulse/planet-develops-novel-radio-communication-systems-that-support-hybri...
1•marklit•35m ago•0 comments

Claude AI down: Anthropic users hit with errors as chatbot goes offline

https://www.the-independent.com/tech/claude-ai-down-anthropic-chatbot-error-status-b2953528.html
3•01-_-•35m ago•0 comments

Whitepaper: Road to synthetic hydrocarbons cheaper than drilled oil (2022)

https://caseyhandmer.wordpress.com/2022/02/03/terraform-industries-whitepaper/
3•justintiime•36m ago•1 comments

Xcode 26.4 requires a Mac running macOS Tahoe 26.2

https://developer.apple.com/documentation/xcode-release-notes/xcode-26_4-release-notes
1•alexanderklein•36m ago•1 comments