frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

America's post-apocalyptic maps reveal eerily familiar fault lines

https://bigthink.com/strange-maps/america-after-the-fall/
1•Brajeshwar•35s ago•0 comments

You don't need complex prompts, you need mechanical sympathy and honesty

https://renormalize.substack.com/p/the-best-prompt-engineering-is-just
1•getnormality•1m ago•0 comments

Show HN: Open-source customizable AI voice dictation built on Pipecat

https://github.com/kstonekuan/tambourine-voice
1•kstonekuan•3m ago•0 comments

FFmpeg Lands Initial Support for JPEG-XS

https://www.phoronix.com/news/FFmpeg-Merges-JPEG-XS
1•Bender•3m ago•0 comments

Die Yield Calculator

https://semianalysis.com/die-yield-calculator/
1•gmays•4m ago•0 comments

Wendelstein 7-X sets World record for long plasma triple product

https://euro-fusion.org/eurofusion-news/wendelstein-7-x-sets-world-record-for-long-plasma-triple-...
1•mpweiher•6m ago•0 comments

Update Now: iOS 26.2 Fixes 20 Security Vulnerabilities, 2 Actively Exploited

https://www.macrumors.com/2025/12/12/ios-26-2-security-vulnerabilities/
3•akyuu•10m ago•0 comments

Power Plant Underwater

1•vanguardeco•12m ago•1 comments

Price of a bot army revealed across online platforms

https://www.cam.ac.uk/stories/price-bot-army-global-index
1•teleforce•14m ago•0 comments

Beware: PayPal subscriptions abused to send fake purchase emails

https://www.bleepingcomputer.com/news/security/beware-paypal-subscriptions-abused-to-send-fake-pu...
1•fleahunter•17m ago•0 comments

Linux GPIB Drivers Declared Stable – 53 Years After HP Introduced the Bus

https://www.phoronix.com/news/GPIB-De-Staged-Linux-6.19
1•croes•19m ago•0 comments

Home-Schooled Kids Are Not All Right

https://www.nytimes.com/2025/12/14/opinion/home-school-isolation.html
3•ripe•19m ago•0 comments

Show HN: GitPow a cross-platform, open-source Git GUI with unique features

https://github.com/markrai/gitpow
1•markrai•23m ago•0 comments

Learnt Father

https://johnocens.com/soothfare/learntfather
1•wonderbar•24m ago•0 comments

How AI coding agents handle file editing

https://wuu73.org/aiguide/infoblogs/coding_file_edits/index.html
2•radio879•27m ago•0 comments

Software Architecture as a Cognitive Structure

https://medium.com/@dmitriy.kirenkin/software-architecture-as-a-cognitive-structure-3c3363be07d0
1•ideamod•27m ago•0 comments

In one of the driest places on Earth, recurrent floods have become deadlier

https://www.washingtonpost.com/weather/interactive/2025/oman-dubai-flooding/
1•lisper•27m ago•0 comments

Trump's $1M 'Gold Card' immigration application plan launches

https://abcnews.go.com/US/trumps-1-million-gold-card-immigration-application-plan/story?id=128320787
3•throw0101a•29m ago•0 comments

Show HN: Wax On, Python' – learn Python dojo-style

https://waxonpython.com
1•d416•31m ago•0 comments

Ultrafinitism

https://www.infinitelymore.xyz/p/ultrafinitism
1•FillMaths•33m ago•0 comments

Show HN: I made a live UK Bus Map from open data (and you can, too)

https://github.com/adamjames/busmap
1•mrwizrd•34m ago•0 comments

Private Equity Finds a New Source of Profit: Volunteer Fire Departments

https://www.nytimes.com/2025/12/14/us/fire-department-software-private-equity.html
7•7402•34m ago•3 comments

Theia IDE – AI-Native Open-Source Cloud and Desktop IDE

https://theia-ide.org/
3•janandonly•35m ago•2 comments

There Is One Clear Winner in the Corn vs. Solar Battle

https://cleantechnica.com/2025/04/26/there-is-one-clear-winner-in-the-corn-vs-solar-battle/
1•xbmcuser•35m ago•0 comments

AI was not invented, it arrived

https://andrewarrow.dev/2025/12/ai-was-not-invented-it-arrived/
2•fcpguru•36m ago•0 comments

CopperSpice – A Modern Cross-Platform C++ GUI Library

https://www.copperspice.com/
1•giancarlostoro•37m ago•1 comments

Claude Code's DX is too good. And that's a problem

https://www.bharath.sh/writing/claude-code-dx
3•lnbharath•40m ago•1 comments

When LICM [Loop-Invariant Code Motion] Fails Us

https://xania.org/202512/14-licm-when-it-doesnt
2•brewmarche•40m ago•0 comments

Let's Talk about GitHub Actions – The GitHub Blog

https://github.blog/news-insights/product-news/lets-talk-about-github-actions/
1•ossusermivami•42m ago•0 comments

Zmij: Faster floating point double-to-string conversion

https://vitaut.net/posts/2025/faster-dtoa/
1•fanf2•42m ago•0 comments