frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

List of Common Misconceptions

https://en.wikipedia.org/wiki/List_of_common_misconceptions
1•thedrexster•2m ago•0 comments

Human brain operates near, but not at, the critical point

https://phys.org/news/2026-03-human-brain-critical.html
1•yoquan•5m ago•0 comments

Fedora 44 will automatically make your Windows games run faster

https://www.xda-developers.com/fedora-44-will-automatically-make-your-windows-games-run-faster-no...
1•Alupis•7m ago•0 comments

WTO reforms talks stalled amid U.S.-India digital services taxation deadlock

https://www.reuters.com/world/india/wto-talks-stalled-going-into-final-day-amid-us-india-e-commer...
2•alephnerd•10m ago•0 comments

Hertz and Hearts – PC HRV biofeedback for chest-strap ECG (OpenHRV fork)

https://github.com/JoelAtHome/HertzAndHearts
1•J_Kobe•13m ago•0 comments

Apple issues urgent lock screen warnings for unpatched iPhones and iPads

https://securityaffairs.com/190109/security/apple-issues-urgent-lock-screen-warnings-for-unpatche...
2•WaitWaitWha•16m ago•0 comments

Emergency Microsoft, Oracle patches point to wider cyber issues

https://www.computerweekly.com/news/366640648/Emergency-Microsoft-Oracle-patches-point-to-wider-c...
1•smurda•18m ago•0 comments

Pentagon prepares for weeks of ground operations in Iran

https://www.washingtonpost.com/national-security/2026/03/28/trump-iran-ground-troops-marines/
2•Jimmc414•18m ago•1 comments

ReadyPC – open-source Rust PC Optomizer

https://github.com/Gloom-Team/ReadyPC/releases/tag/Latest
1•asdadaZ•19m ago•0 comments

Improving My C Build System with Zig

https://louislefebvre.net/tech/zig-gcc-replace/
1•louislef299•19m ago•0 comments

OpenYak – An open-source Cowork that runs any model and owns your filesystem

https://github.com/openyak/desktop
3•wangzhangwu•26m ago•1 comments

The Fastest Man Alive? [video]

https://www.youtube.com/shorts/R7OoEXaOVY0
1•SilentM68•26m ago•0 comments

How to Do Any Work

https://drive.google.com/uc?id=1wurJsO1vZYiynrTxDLroiQX2fBnKmldo&export=download
1•waseyjamal•30m ago•1 comments

Generalized Linear Model

https://en.wikipedia.org/wiki/Generalized_linear_model
1•azhenley•31m ago•0 comments

Data Centers Under Fire: A Systemic Security Challenge

https://www.datacenterknowledge.com/physical-security/data-centers-under-fire-a-growing-critical-...
1•WaitWaitWha•31m ago•0 comments

Mark Zuckerberg texted Elon Musk to offer help with DOGE

https://techcrunch.com/2026/03/28/mark-zuckerberg-texted-elon-musk-to-offer-help-with-doge/
2•toomanyrichies•36m ago•0 comments

Thinking in the Margins

https://theamericanscholar.org/thinking-in-the-margins/
1•SegfaultSeagull•50m ago•0 comments

The Revenge of the Data Scientist

https://hamel.dev/blog/posts/revenge/
1•prabal97•51m ago•0 comments

Eval-Driven Development: Applying TDD Principles to AI Agent Prompts

https://iris-eval.com/blog/eval-driven-development
1•iparent•52m ago•0 comments

Vanilla Claude vs. GitAuto Test Generation

https://gitauto.ai/blog/vanilla-claude-vs-gitauto-test-generation
1•nishiohiroshi•55m ago•0 comments

Show HN: Phantom – Let AI use your API keys without leaking them

https://github.com/ashlrai/phantom-secrets
1•masonwyatt23•57m ago•0 comments

Wikipedia officially bans AI-generated content

https://nypost.com/2026/03/28/tech/wikipedia-officially-bans-ai-generated-encyclopedia-entries/
8•1vuio0pswjnm7•57m ago•0 comments

Can You Guess What Tests a Calculator Needs?

https://gitauto.ai/blog/can-you-guess-what-tests-a-calculator-needs
1•nishiohiroshi•57m ago•0 comments

What Are Adversarial Tests and Why We Run Them

https://gitauto.ai/blog/what-are-adversarial-tests
1•nishiohiroshi•58m ago•0 comments

Indonesia Starts First Southeast Asia Social Media Ban for Kids

https://www.bloomberg.com/news/articles/2026-03-28/indonesia-starts-first-southeast-asia-social-m...
1•1vuio0pswjnm7•59m ago•0 comments

Indonesia's social media curbs for under 16s take set to start

https://www.reuters.com/business/media-telecom/indonesias-social-media-curbs-kids-set-saturday-fe...
1•1vuio0pswjnm7•1h ago•0 comments

Nice Graphs – Text to chart in one click

https://nice-graphs.com/pt
1•domiuau•1h ago•1 comments

Elon Musk's last co-founder reportedly leaves xAI

https://techcrunch.com/2026/03/28/elon-musks-last-co-founder-reportedly-leaves-xai/
4•SilverElfin•1h ago•3 comments

A n00B PM's guide to vibe coding kernels from scratch

https://www.ddmckinnon.com/2026/03/28/a-n00b-pms-guide-to-vibe-coding-kernels-from-scratch/
2•dmckinno•1h ago•0 comments

Aging Is a Software Problem

https://twitter.com/davidasinclair/status/2037966418453410024
1•dschol•1h ago•0 comments