frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Muxy – A lightweight terminal multiplexer for Mac

https://muxy.app/
1•eustoria•24s ago•0 comments

The Science Behind 'Project Hail Mary'

https://science.nasa.gov/the-science-behind-project-hail-mary/
1•cstever•26s ago•0 comments

Mediabunny – A complete JavaScript media toolkit for the browser

https://mediabunny.dev/
1•eustoria•57s ago•0 comments

Show HN: Jynx, a matchmaking app to find gaming teammates

https://jynx.app/
1•akiro____•2m ago•0 comments

Coordinated, Until It Isn't: Moksha's 89-vuln XAPI drop

https://cje.io/2026/05/17/coordinated-until-it-isnt/
1•pavel_lishin•3m ago•0 comments

To access your creativity, start playing

https://bigthink.com/books/to-access-your-creativity-start-playing/
1•cstever•5m ago•0 comments

Using safe-area-inset to build mobile-safe layouts

https://polypane.app/blog/using-safe-area-inset-to-build-mobile-safe-layouts/
1•eustoria•5m ago•0 comments

Researchers let AI models run a simulated society; Claude safest, Grok extinct

https://tech.yahoo.com/ai/claude/articles/researchers-let-ai-models-run-070300865.html
1•spankibalt•6m ago•0 comments

Replace GnuPG with Sequoia-PGP (& Actively Warn Against GnuPG)

https://discuss.privacyguides.net/t/replace-gnupg-with-sequoia-pgp-actively-warn-against-gnupg/38238
2•Cider9986•8m ago•0 comments

Join the Independent Science Society! (A New Kinda Science Society)

https://chillphysicsenjoyer.substack.com/p/join-the-independent-science-society
1•crescit_eundo•10m ago•0 comments

JLink JTAG Access on the Pinecil

https://danielmangum.com/posts/jlink-jtag-pinecil/
2•hasheddan•10m ago•0 comments

NixOS 26.05 Released

https://nixos.org/blog/announcements/2026/nixos-2605/
1•trulyrandom•11m ago•0 comments

Problematic TF Providers

https://newsletter.masterpoint.io/p/problematic-tf-providers
1•mooreds•11m ago•0 comments

Why the U.S. cattle herd is at a 75-year low

https://text.npr.org/nx-s1-5719511
1•mooreds•12m ago•0 comments

Airlines Can't Charge You for What You Wear

https://voyagecoat.com/
2•mooreds•12m ago•1 comments

Spatial IDE's for agentic coding workflows

2•Imbiss•13m ago•1 comments

Lossless URL Compressor – Piss.zip

https://piss.zip
1•Erenay09•15m ago•0 comments

AI demand absorbs wafer capacity, crushing budget PC segment

https://www.trendforce.com/presscenter/news/20260529-13068.html
2•Ember_Wipe•19m ago•0 comments

Microsoft slaps new coat of paint on Copilot, buries annoying button

https://www.theregister.com/ai-ml/2026/05/29/microsoft-slaps-new-coat-of-paint-on-copilot-buries-...
1•Bender•22m ago•0 comments

Neuro-Bayesian architecture in economic modeling

https://romankurnovskii.com/en/research/neuro-bayesian-architecture-in-economic-modeling/
1•djangofree•24m ago•0 comments

The software rebound is real, but not every big name is back

https://finance.yahoo.com/news/the-software-rebound-is-real-but-not-every-big-name-is-back-chart-...
1•jaynate•25m ago•0 comments

SSH authorized_keys command restriction to isolate container access

https://vxlabs.com/2026/05/30/ssh-command-restriction-container-to-host/
2•kobieps•26m ago•0 comments

Replimune's Drug Got Third Chance After White House Intervention

https://www.wsj.com/health/pharma/how-replimunes-drug-got-third-chance-after-white-house-interven...
3•impish9208•27m ago•1 comments

US mass layoffs tracker via WARN ACT notices

https://layoffs.kadoa.com/
3•ck2•28m ago•1 comments

Online Sleuthing Helped Catch the 'Google Insider' on Polymarket

https://www.wsj.com/finance/currencies/how-online-sleuthing-helped-catch-the-google-polymarket-tr...
1•1vuio0pswjnm7•30m ago•1 comments

Why the Next Datacenter Should Be Sized for a Village, Not a City

https://nicolabortignon.com/posts/community-datacenter-demand-shaping/
3•snickmy•31m ago•0 comments

Show HN: SnapState – Native Swift window manager for macOS

https://getsnapstate.com
1•soulsniper•31m ago•0 comments

Text Extraction from Images via Curl

https://softweavers.net/image-to-text.html
1•noboruma•32m ago•1 comments

Tech companies desperately want to film you doing chores

https://www.theverge.com/ai-artificial-intelligence/940007/ai-companies-will-pay-for-robot-traini...
1•1vuio0pswjnm7•33m ago•1 comments

AI and Taste

https://twitter.com/joulee/status/2054275672563175834
1•sanj•33m ago•0 comments