frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

The Future of Everything Is Lies, I Guess: Annoyances

https://aphyr.com/posts/415-the-future-of-everything-is-lies-i-guess-annoyances
1•aphyr•3m ago•0 comments

Brazil seizes over 1,100 weapons and 1.5 tons of drugs from US, says official

https://www.reuters.com/world/americas/brazil-seizes-over-1100-weapons-15-tons-drugs-us-says-offi...
2•kaycebasques•5m ago•0 comments

Black traffic: the corporate sabotage technique you've never heard of

https://www.machinesociety.ai/p/black-traffic-the-corporate-sabotage-37e
1•mikelgan•5m ago•1 comments

Nexus AI

https://nexusai.run
1•nexusai26•6m ago•0 comments

BYD to open 20 car dealerships in Canada this year

https://financialpost.com/transportation/autos/byd-open-20-car-dealerships-canada-2026
2•pseudolus•9m ago•0 comments

Selective Test Execution at Stripe: Fast CI for a 50M-Line Ruby Monorepo

https://stripe.dev/blog/selective-test-execution-at-stripe-fast-ci-for-a-50m-line-ruby-monorepo
1•Wingy•10m ago•0 comments

QB64 Tutorial A beginner's introduction to game programming

https://www.qb64tutorial.com
1•AlexeyBrin•10m ago•0 comments

Show HN: Peer – health research chat, 6 medical databases, verified citations

https://frompeer.com/
2•uelbably•10m ago•0 comments

Published on Rapid API

1•CapianHolstrom•12m ago•0 comments

Canada Can't Pretend America Is Still the Good Guy

https://thewalrus.ca/the-us-torpedoed-an-unarmed-ship-who-are-the-good-guys-again/
6•Teever•16m ago•0 comments

The Case That More Openness Brings More Good to Society

https://danieltan.weblog.lol/2026/04/the-case-that-more-openness-brings-more-good-to-society
1•danieltanfh95•17m ago•0 comments

Measure coding productivity with this Claude Code Plugin

https://github.com/Facens/coding-productivity
2•Facens•18m ago•1 comments

Build Your Own Claw

https://github.com/tedhsieh1966/wofa_ide
1•tedhsieh1966•20m ago•0 comments

LineageScope – static analyzer for SQL, dbt, Airflow, Spark, and data contracts

https://github.com/kirannarayanak/lineagescope
2•kirannarayana•20m ago•1 comments

Show HN: I made a visual tool for EV vs. petrol/diesel running-cost breakeven

https://carcosttool.com/ev-vs-ice-breakeven
1•sensecall•21m ago•0 comments

Why Phishing Emails Keep Working on Smart People

https://cacm.acm.org/blogcacm/why-phishing-emails-keep-working-on-smart-people/
1•pseudolus•21m ago•0 comments

Rewriting a 20-year-old Python library

https://www.b-list.org/weblog/2026/mar/23/20-year-library/
1•PaulHoule•23m ago•0 comments

Clypi ― all-in-one for beautiful, prod-ready CLIs (Python)

https://danimelchor.github.io/clypi/
1•kaathewise•24m ago•0 comments

Sumochess

https://sumochess.org
1•pingou•25m ago•0 comments

Maker of Pet Toys in Ukraine Turns to Killer Drones

https://www.nytimes.com/2026/04/09/world/europe/ukraine-defense-technology-companies.html
1•bookofjoe•26m ago•1 comments

Cpuid hacked to deliver malware via CPU-Z, HWMonitor downloads

https://www.bleepingcomputer.com/news/security/supply-chain-attack-at-cpuid-pushes-malware-with-c...
1•Brajeshwar•28m ago•0 comments

Sad, Sad Video of Dude Checking on the Trump Phone He Ordered

https://www.youtube.com/watch?v=fduWfFM6eEE
2•OhMeadhbh•28m ago•1 comments

The Problem That Built an Industry

https://ajitem.com/blog/iron-core-part-1-the-problem-that-built-an-industry/
2•ShaggyHotDog•33m ago•0 comments

LinkedIn Pulse Lost 85% of Its Organic Traffic in the Last Two Years

https://growtika.com/blog/linkedin-pulse-research
1•Growtika•34m ago•0 comments

In Defense of Rediscovery

https://wilsoniumite.com/2026/04/11/in-defense-of-rediscovery/
1•Wilsoniumite•36m ago•0 comments

Framechart – Turn CSV data into animated chart videos

https://framechart.com
1•Don_Data•39m ago•0 comments

Can OpenClaw and Claude be better than therapy?

https://world.hey.com/cassio/openclaw-claude-are-better-than-therapy-e0ac3ad9
2•cacozen•40m ago•1 comments

Show HN: Helix – open-source self-healing back end for production crashes

https://88hours.github.io/helix-community/
1•NomiJ•40m ago•1 comments

Iran War and the great reset with Katherine Austin Fitts [video][1hr]

https://www.youtube.com/watch?v=Y7JdMLITSDU
1•Bender•40m ago•0 comments

America Has a New GLP-1 Playbook

https://www.theatlantic.com/health/2026/04/glp-1-pill-wegovy-weight-loss/686768/
1•01-_-•41m ago•0 comments