frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: Daily-web: your daily-updates web feed

https://github.com/garyo/daily-web
1•darkstarsys•1m ago•0 comments

Tired of messy GitHub PRs? Chrome extensions enforce descriptions and size limit

https://chromewebstore.google.com/detail/pr-description-guard/idfeaafjnjnfknjbfbpnlgphjhohfpah
1•afrasiyabhaider•2m ago•1 comments

Show HN: 21st.fund, an AI tool to discover grants and non-dilutive funding

https://www.21st.fund/
1•udit_50•3m ago•0 comments

Simple Browser AI

https://simplebrowserai.pagedrop.io/
1•antidotumagen•5m ago•0 comments

Email obfuscation: What works in 2025?

https://spencermortensen.com/articles/email-obfuscation/
1•blackstache•8m ago•0 comments

At least 5k dead in Iran unrest, official says

https://www.reuters.com/business/media-telecom/iranian-official-says-verified-deaths-iran-protest...
1•wslh•8m ago•0 comments

How AI makes for better software (& companies)

https://gmays.com/how-ai-makes-for-better-software-companies/
2•gmays•11m ago•0 comments

Our top Core Web Vitals recommendations for 2023

https://web.dev/articles/top-cwv
1•Tomte•12m ago•0 comments

Fear and Loathing of the English Passive (2010)

https://www.lel.ed.ac.uk/~gpullum/passive_loathing.html
2•Tomte•12m ago•1 comments

Show HN: Keepthat.link – rudimentary, no-frills bookmarks

https://www.keepthat.link/
1•e_xyz•14m ago•0 comments

Childhood Neighbors Influence Occupation Choice [pdf]

https://drive.google.com/file/d/17Pq41ZzfwEdm-YrmWCMkvU0E4T-SXzPp/view
1•elsewhen•15m ago•0 comments

Show HN: Zsweep – Play Minesweeper using only Vim motions

https://zsweep.com
1•oug-t•18m ago•4 comments

Nuclear Weapons Are Now ESG Compliant

https://news.slashdot.org/story/26/01/14/144240/nuclear-weapons-are-now-esg-compliant
1•7777777phil•19m ago•0 comments

The Truth Architecture – Why Web3 Is the Only Way Out

https://aegistrail.github.io/posts/Why-Web3-is-the-only-way-out/
2•patronage•20m ago•0 comments

Humans are taking our jobs!

https://humanthreat.xyz/
2•modinfo•21m ago•0 comments

Predator Spyware Turns Failed Attacks into Intelligence for Future Exploits

https://www.securityweek.com/predator-spywares-granular-anti-analysis-features-exposed/
1•smurda•22m ago•0 comments

Engineering a reusable insulin patch pump

2•u-pump•23m ago•0 comments

The Harvesting of Lettuce

https://sftw.substack.com/p/310-to-yuma
2•HR01•24m ago•0 comments

Seamless codebase-relevant context enrichment for prompts

https://github.com/arterialist/magic-prompt
1•Arterialist•24m ago•0 comments

Is Sienna Rose AI? All Signs Point to 'Yes'

https://www.rollingstone.com/music/music-news/sienna-rose-ai-artist-real-1235499068/
1•geox•25m ago•0 comments

With AI coding we can just make our own editors

https://github.com/posix4e/minivim
2•alexnewman•30m ago•3 comments

Show HN: StayUp – a background desktop app for activity-based time trackers

1•delusdev•31m ago•0 comments

How to Build an AI Agent Declaratively with Terraform

https://chatbotkit.com/tutorials/how-to-build-an-ai-agent-declaratively-with-terraform
1•_pdp_•31m ago•0 comments

Perelman's Proof of the Poincar E Conjecture: A Nonlinear PDE Perspective

https://arxiv.org/pdf/math/0610903
2•tzury•37m ago•0 comments

Show HN: SMath Units, RCPC Initiative

https://github.com/JTRSoftware/Project_RCPC/tree/main/ReadyToShare/sMath
2•jtr87•39m ago•0 comments

Blue on X: "unrot your brain"

https://twitter.com/bluewmist/status/2012755834636533893
2•bilsbie•40m ago•0 comments

Show HN: Open-source confusion matrix generator for ML models

1•pareshrnayak•40m ago•1 comments

Ljudmila

https://wiki.ljudmila.org/Main_Page
3•jruohonen•40m ago•0 comments

The real technical debt is semantic decay and only platforms can stop it

https://unvarnishedgrady.substack.com/p/on-platforms-iii-the-physics-of-meaning
4•ecurb•40m ago•0 comments

Show HN: 13MB full-text site search

https://www.asciimx.com/log/site-search/
1•kovac•41m ago•0 comments