frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Hedystia 1.10 – Type Mastery is here

https://docs.hedystia.com
1•Zastinian•51s ago•0 comments

Laid off? Start a small business. Save hours of research

https://startinstates.com/guides
1•GKakhiani•1m ago•0 comments

Apple's App Store in China gets lower 25% commission to appease regulators

https://appleinsider.com/articles/26/03/13/apples-app-store-in-china-gets-lower-25-commission-to-...
1•spenvo•1m ago•0 comments

Leaderboard of Leaderboards – A Real-Time Meta-Ranking of AI Benchmarks

https://huggingface.co/posts/mayafree/802385854425752
1•seawolf2357•3m ago•0 comments

Glimpse: Native macOS micro-UI for scripts and agents

https://github.com/hazat/glimpse
1•rahimnathwani•4m ago•0 comments

List of animals awarded human credentials

https://en.wikipedia.org/wiki/List_of_animals_awarded_human_credentials
1•staticshock•5m ago•0 comments

Pi-generative-UI: Claude.ai's generative UI reverse-engineered, rebuilt for pi

https://github.com/Michaelliv/pi-generative-ui
1•rahimnathwani•7m ago•0 comments

Stout: A drop-in replacement for Homebrew CLI that's 10-100x for most operations

https://github.com/neul-labs/stout
2•sea-gold•11m ago•0 comments

Tome: Open-source documentation platform with Markdown

https://github.com/vxcozy/tome
1•noleary•12m ago•0 comments

Pie Day at the Massachusetts Institute of Tasteology

https://mitadmissions.org/blogs/entry/pi-day-2026-food-institute/
1•d0able•15m ago•0 comments

The exploitation paradox in open source

https://lwn.net/Articles/1058031/
1•signa11•16m ago•0 comments

Show HN: DarkMatter – P2P mesh networking protocol for AI agents

https://loseylabs.ai/docs
1•DanielJLosey•18m ago•0 comments

xAI/SpaceX Poach Two Cursor Leaders

https://twitter.com/elonmusk/status/2032179883858932101
2•bkls•19m ago•0 comments

Microsoft AI Launches Copilot Health

https://microsoft.ai/news/introducing-copilot-health/
1•alexmorley•22m ago•1 comments

We Automated RL Environment Engineering for $10

https://arxiv.org/abs/2603.12145
1•milkkarten•22m ago•0 comments

You can't escape coordination costs by throwing more AI agents at a problem

https://chatbotkit.com/reflections/coordination-has-limits
1•_pdp_•25m ago•0 comments

Show HN: Collected 1000 real business problems that don't have good software yet

https://painsignal.net/
1•gzoo•28m ago•0 comments

Loss of U.S. KC-135 Over Iraq

https://www.centcom.mil/MEDIA/PRESS-RELEASES/Press-Release-View/Article/4432850/loss-of-us-kc-135...
1•beedeebeedee•28m ago•1 comments

FDA-Approved Seizure Drug May Stop Alzheimer's Before It Starts

https://scitechdaily.com/fda-approved-seizure-drug-may-stop-alzheimers-before-it-starts/
2•bilsbie•34m ago•0 comments

Trump Is "Strongly Considering" Pardoning Julian Assange and Edward Snowden

https://twitter.com/BBMagaMom/status/2031894649472758268
2•karp773•37m ago•4 comments

Japan's Comeback: The Race to Build the 2-Nanometer Chip [video]

https://www.youtube.com/watch?v=JzmU5X0R0I8
2•mgh2•40m ago•0 comments

Golden Sets: Regression Engineering for Probabilistic Systems

https://heavythoughtcloud.com/knowledge/designing-a-golden-set
1•ryan-s•42m ago•0 comments

My Website's API Was Flagged as Phishing–and I Still Don't Know Why

https://www.chrisvogt.me/meta/gcp-firebase-api-suspended/
2•valentinemsmith•44m ago•2 comments

Western AI models "fail spectacularly" in farms and forests abroad

https://restofworld.org/2026/ai-agriculture-local-data/
1•i7l•47m ago•0 comments

The War Trump Doesn't Want to Talk About

https://www.newyorker.com/news/letter-from-trumps-washington/the-war-trump-doesnt-want-to-talk-about
3•petethomas•52m ago•1 comments

The State of the Culture (2024)

https://www.honest-broker.com/p/the-state-of-the-culture-2024
1•dgudkov•53m ago•0 comments

"This Is Not the Computer for You"

https://samhenri.gold/blog/20260312-this-is-not-the-computer-for-you/
5•MBCook•55m ago•2 comments

AI Killed My Job: Educators

https://www.bloodinthemachine.com/p/if-ai-is-writing-the-work-and-ai
2•cdrnsf•56m ago•0 comments

Harvey Weinstein gives first interview in six years, says prison life is "hell"

https://www.nme.com/news/film/harvey-weinstein-gives-first-interview-in-six-years-says-prison-lif...
1•tartoran•56m ago•0 comments

AI is great at writing code. It's terrible at making decisions

https://untangle.work/blog/ai-writes-code-terrible-at-decisions/
6•kdbgng•59m ago•0 comments