frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Is Product Design another casualty of AI?

https://twitter.com/gokulr/status/2048132579099062313
1•regera•3m ago•0 comments

Sergey Brin Confronted Gavin Newsom

https://www.bloomberg.com/news/articles/2026-04-26/how-google-s-sergey-brin-helped-fuel-a-politic...
1•arunsivadasan•4m ago•0 comments

CachyOS Introduces New Default GUI Package Manager, Kyber for NVMe I/O Scheduler

https://www.phoronix.com/news/CachyOS-April-2026
2•Bender•6m ago•0 comments

I built a hiring platform that watches engineers work in a real CAD tool

3•mind_uncapped•8m ago•0 comments

Thermoacoustic heat pumps on the verge of commercial breakthrough

https://www.pv-magazine.com/2026/04/23/thermoacoustic-heat-pumps-on-the-verge-of-commercial-break...
2•simonebrunozzi•9m ago•0 comments

White House Correspondents' Dinner gunman Cole Allen's full anti-Trump manifesto

https://nypost.com/2026/04/26/us-news/read-whcd-gunman-cole-allens-full-anti-trump-manifesto/
2•Bender•9m ago•1 comments

Near-instantly aborting the worst pain imaginable with psychedelics

https://psychotechnology.substack.com/p/near-instantly-aborting-the-worst
3•maxutility•10m ago•0 comments

How OpenAI Kills Oracle

https://www.wheresyoured.at/how-openai-kills-oracle/
1•napolux•11m ago•0 comments

Who's developing Golden Dome's orbital interceptors if theyre ever built

https://arstechnica.com/space/2026/04/this-is-whos-developing-golden-domes-orbital-interceptors-i...
2•Bender•12m ago•0 comments

Why do note-taking apps store everything but connect nothing?

https://dotreader.info
3•efecerre•13m ago•1 comments

A.I. is creating engineers who can't think without it

https://www.koshyjohn.com/blog/ai-should-elevate-your-thinking-not-replace-it/
35•koshyjohn•13m ago•15 comments

If You Stop Hiring Juniors, Your Senior Engineers Own You

https://evalcode.com/posts/if-you-stop-hiring-juniors-your-seniors-own-you/
6•milkglass•14m ago•0 comments

The Shoe That Broke Running [video]

https://www.youtube.com/watch?v=pfIWxFIVP_Y
2•downboots•15m ago•0 comments

FreePG Project

https://freepg.org/
7•xeeeeeeeeeeenu•17m ago•0 comments

San Francisco, AI capital of the world, is an economic laggard

https://www.economist.com/finance-and-economics/2026/04/26/san-francisco-ai-capital-of-the-world-...
6•andsoitis•17m ago•0 comments

DevOps Is a Culture, Not a Team: What I've Learned Building at Scale

https://austinxyz.github.io/blogs/blog/2026/04/26/devops-at-scale
4•milkglass•18m ago•0 comments

What it's like to drive Route 66 in an EV

https://www.bbc.com/travel/article/20260424-what-its-like-to-drive-route-66-in-an-ev
3•mooreds•20m ago•0 comments

Have you tried Clean Architecture as foundation for your AI project?

83•esmelazy•21m ago•0 comments

Marx vs. the Robots (2017)

https://www.jstor.org/stable/45134296?seq=1
4•mooreds•22m ago•0 comments

Glyph: A sub-millisecond prompt-injection detector

https://github.com/enkryptai/glyph
4•divyanshusingh•22m ago•0 comments

RangeFlow: A different way to pick date ranges

https://rangeflow.raminmousavi.dev/
4•ramin2nt2•22m ago•0 comments

The Impacts of Parole Supervision

https://bfi.uchicago.edu/insights/the-impacts-of-parole-supervision/
4•mooreds•22m ago•0 comments

1,350 Days with Logseq

https://ianreppel.org/goodbye-logseq/
2•Brajeshwar•22m ago•0 comments

Show HN: Stop Destroying Your Charging Cables

https://www.bbc.com/future/article/20260421-your-bad-habits-are-destroying-your-charging-cables
6•wasimsk•24m ago•0 comments

Great Minds Should Not Think Alike, They Should Think Together

https://docs.eventsourcingdb.io/blog/2026/04/27/great-minds-should-not-think-alike-they-should-th...
2•goloroden•27m ago•0 comments

Why Start a Company Instead of Working in Aid

https://indevelopmentmag.com/exporters-without-borders-why-you-should-start-a-company-instead-of-...
3•paulpauper•29m ago•0 comments

Do these pictures prove tennis is dead?

https://bigthink.com/strange-maps/do-these-pictures-prove-tennis-is-dead/
3•paulpauper•29m ago•0 comments

Aging Gracefully in the Tech Industry

https://petersobot.com/blog/aging-gracefully-in-the-tech-industry/
6•itunpredictable•30m ago•0 comments

The case of missing American mushrooms

https://sftw.substack.com/p/the-case-of-missing-american-mushrooms
2•paulpauper•30m ago•0 comments

Browser as an Interactive Disassembly Exploration Tool (2015)

https://mrale.ph/blog/2015/03/29/browser-as-an-interactive-disassembler.html
4•downbad_•32m ago•1 comments