frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Valgrind-3.27.0 Is Available

https://sourceforge.net/p/valgrind/mailman/message/59324626/
1•paulf38•2m ago•0 comments

Crystal Now Has Official Linux ARM64 Builds

https://crystal-lang.org/2026/04/07/official-linux-arm64-builds/
1•TheWiggles•5m ago•0 comments

The AI revolution – spamming 680PRs in 442 GitHub repos in 21 days in April

https://github.com/SAY-5
1•ddorian43•6m ago•1 comments

The first neural interface that transforms your thoughts into text

https://sabi.com/
1•filippofinke•11m ago•0 comments

Indent Is All You Need

https://blog.est.im/2026/stdin-11
1•est•15m ago•0 comments

The arrogant superbanker whose hubris brought Britain to its knees

https://inews.co.uk/opinion/arrogant-superbanker-hubris-brought-britain-knees-4331457
1•robtherobber•16m ago•0 comments

Making the Rails Default Job Queue Fiber-Based

https://paolino.me/solid-queue-doesnt-need-a-thread-per-job/
1•earcar•17m ago•0 comments

The Dirty Little Secret of AI (On a 1979 PDP-11) [video]

https://www.youtube.com/watch?v=OUE3FSIk46g
1•KnuthIsGod•22m ago•0 comments

HappyHorse AI – AI-Powered Equestrian Training

https://www.runhappyhorse.net
1•danielmateo773•23m ago•1 comments

Master of chaos wins $3M math prize for 'blowing up' equations

https://www.scientificamerican.com/article/master-of-chaos-wins-usd3m-math-prize-for-blowing-up-e...
1•signa11•23m ago•0 comments

Why the Original Task Manager Was Under 80K and Insanely Fast [video]

https://www.youtube.com/watch?v=OyN4LGyPwxc
2•KnuthIsGod•23m ago•0 comments

Influencers Are Spinning Nicotine as a 'Natural' Health Hack

https://www.nytimes.com/2026/04/20/well/nicotine-health-maha.html
2•SockThief•23m ago•2 comments

Details that make interfaces feel better

https://jakub.kr/writing/details-that-make-interfaces-feel-better
1•dg-ac•24m ago•0 comments

Watch a 200 Pound, 14" Drive from the 80s Boot Unix [video]

https://www.youtube.com/watch?v=kpC_9EmStAE
1•KnuthIsGod•25m ago•0 comments

My billing system, it could be useful to some

https://github.com/peterretief/billing-v2
2•peter_retief•27m ago•1 comments

ConvertHook – White-label widget that shows where brands rank in ChatGPT

https://converthook.com
1•joefromcomkey•29m ago•0 comments

Palantir manifesto reads like the ramblings of a comic book villain

https://www.engadget.com/big-tech/palantir-posted-a-manifesto-that-reads-like-the-ramblings-of-a-...
1•robtherobber•29m ago•0 comments

SUSE and Nvidia reveal a turnkey AI factory for sovereign enterprise workloads

https://thenewstack.io/suse-nvidia-ai-factory/
1•CrankyBear•29m ago•0 comments

Curlew conservation scheme makes breakthrough in Fermanagh

https://www.rte.ie/news/ireland/2026/0421/1569263-curlew-conservation/
1•austinallegro•30m ago•0 comments

Modern Front end Complexity: essential or accidental?

https://binaryigor.com/modern-frontend-complexity.html
1•birdculture•31m ago•0 comments

Show HN: WeTransfer Alternative for Developers

https://dlvr.sh/
3•mariusbolik•38m ago•0 comments

Keeping code quality high with AI agents

https://locastic.com/blog/keeping-code-quality-high-with-ai-agents
1•locastica•39m ago•0 comments

The MACL Extended Attribute

https://eclecticlight.co/2026/04/21/the-macl-extended-attribute/
1•frizlab•41m ago•0 comments

Mother Earth Mother Board

https://efdn.notion.site/Mother-Earth-Mother-Board-WIRED-a8ff97e460bc4ac1b4a7b87f3503a55c
1•thunderbong•43m ago•0 comments

US recession probabilities implied by the yield curve

https://www.stlouisfed.org/on-the-economy/2023/sep/what-probability-recession-message-yield-spreads
1•latentframe•47m ago•1 comments

Show HN: AnyHabit – A minimalist habit tracker for Raspberry Pi and Docker

https://github.com/Sparths/AnyHabit
1•bebedi•50m ago•0 comments

Highlights from Git 2.54

https://github.blog/open-source/git/highlights-from-git-2-54/
1•tux3•52m ago•0 comments

Enhancing Sporting Organisation Efficiency with Generative AI

https://sinankprn.com/posts/enhancing-sporting-organisation-efficiency-with-generative-ai/
1•sminchev•53m ago•0 comments

Reconstructing a Vue and Three.js app from a single Webpack bundle

1•YufanZhang•53m ago•0 comments

Show HN: Tiltbump – another game in a single HTML file

https://tiagosimoes.github.io/tiltbump/
2•eropatori•55m ago•0 comments