frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

IBM is the latest AI casualty

https://www.cnbc.com/2026/02/23/ibm-is-the-latest-ai-casualty-shares-are-tanking-on-anthropic-cob...
1•baal80spam•1m ago•0 comments

Ask HN: Are developers who build libs and dev tools safer from AI replacement?

1•danver0•1m ago•0 comments

GitHub Is Down

https://www.githubstatus.com/incidents/jn8kcmg5ydch
2•smithcoin•4m ago•0 comments

Lat: Apple removes environmental metrics from executive pay

https://www.latimes.com/business/story/2026-02-23/apple-quietly-removes-environmental-metrics-fro...
3•domoregood•5m ago•1 comments

Intellectual Honesty in the Age of Vibes

https://blog.zmalik.dev/p/intellectual-honesty-in-the-age-of
1•zmalik•6m ago•0 comments

Show HN: Merkle Casino – Random CT Domains

https://merkle.altayakkus.dev
1•biosboiii•6m ago•0 comments

Why I love learning new things

https://seekingtrust.substack.com/p/in-pursuit-of-new-colors
1•FinnLobsien•8m ago•0 comments

Show HN: Groupchat, Open Source Slack for Developers

https://www.groupchatty.com/
3•svapnil•8m ago•0 comments

Tesla sues California DMV to reverse 'false advertising' ruling on self-driving

https://www.cnbc.com/2026/02/23/tesla-sues-california-dmv-to-reverse-false-advertising-ruling-on-...
3•MilnerRoute•8m ago•0 comments

Signs on Stone Age objects: Precursor to written language dates back 40K years

https://www.uni-saarland.de/en/news/steinzeit-zeichen-44061.html
2•geox•9m ago•0 comments

Show HN: Livecodes – client-side code playground created by a heart surgeon

https://github.com/live-codes/livecodes
1•hopefully_can•11m ago•0 comments

Bitdeer sold all its Bitcoin to fund its move into AI data centers

https://www.coindesk.com/markets/2026/02/23/bitdeer-empties-bitcoin-treasury-as-miners-accelerate...
1•doener•11m ago•0 comments

Flock cameras gifted by Horowitz Foundation, avoiding public oversight

https://thenevadaindependent.com/article/vegas-police-are-big-users-of-license-plate-readers-publ...
3•rurp•11m ago•0 comments

Show HN: MoltMyHeart – a dating site for AI agents

https://www.moltmyheart.com/
2•dinge•12m ago•0 comments

IBM Plunges After Anthropic's Latest Update Takes on COBOL

https://www.zerohedge.com/markets/ibm-plunges-after-anthropics-latest-update-takes-cobol
3•gradus_ad•15m ago•0 comments

Show HN: We built Talos – a full CNN inference engine running on silicon

https://talos.wtf/
1•luthiraabeykoon•15m ago•0 comments

A prediction on MCP servers from last year

https://mbsamuel.substack.com/p/will-people-actually-pay-for-mcp
1•JimsonYang•15m ago•1 comments

Booklore – A modern way to organize, read, and own your digital library

https://booklore.org/
2•voxadam•16m ago•0 comments

Show HN: We built Talos – a full CNN inference engine running on silicon

https://twitter.com/luthiraabeykoon/status/2026036244455489750
1•luthiraabeykoon•16m ago•0 comments

Strands AI Functions

https://github.com/strands-labs/ai-functions
1•jlward4th•16m ago•0 comments

Moore Threads Launches Premium MTT Aibook with China ARM-Based SoC

https://videocardz.com/newz/moore-threads-launches-premium-mtt-aibook-with-china-arm-based-soc-2-...
1•LorenDB•18m ago•0 comments

3D Printing a 3D Printer

https://guille.site/posts/3d-printed-printer/
2•LolWolf•18m ago•0 comments

2028 Global Intelligence Crisis

https://substack.com/@citrini/p-188821754
1•kristianp•18m ago•0 comments

Less than 14% of those arrested by ICE had violent criminal records

https://www.cbsnews.com/news/ice-arrests-violent-criminal-records-trump-first-year/
4•RickJWagner•19m ago•0 comments

Porting Doom to a 20-year-old VoIP phone

https://0x19.co/post/snom360_doom/
2•25hex•20m ago•0 comments

Reddit Is Down

https://downdetector.com/status/reddit/
2•sometinsome•21m ago•0 comments

Listening to the Mind: Earable Acoustic Sensing of Cognitive Load

https://dl.acm.org/doi/10.1145/3714394.3756157
1•PaulHoule•22m ago•0 comments

Stop Killing Games update says EU petition advances

https://videocardz.com/newz/stop-killing-games-update-says-eu-petition-advances
2•LorenDB•22m ago•0 comments

Baltimore police credit partnerships, hiring increases for historic crime drop

https://www.cbsnews.com/baltimore/news/baltimore-police-crime-homicides-mayor-scott-worley/
1•RickJWagner•23m ago•0 comments

OpenClaw on a 1998 iMac G3 – Kind Of

https://twitter.com/maddiedreese/status/2025818066563764672
1•maddiedreese•24m ago•2 comments