frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Apple Reportedly Agrees to 100% Price Hike on Samsung Memory Chips

https://www.macrumors.com/2026/02/26/apple-agrees-100-price-hike-samsung-ram/
1•tosh•35s ago•0 comments

Someone just exposed a whole lot of API keys

https://gist.github.com/noctonic/0576b930fca24c33faf90fd2a8935443
1•CraterD•1m ago•0 comments

Chinese industry is beating Germany at its own game. Cue panic

https://www.economist.com/europe/2026/02/19/how-germany-fell-out-of-love-with-china
1•alecco•3m ago•1 comments

What breaks first when you try to run AI agents on a 1–2 MB memory budget?

https://github.com/nullclaw/nullclaw
2•NULLCLAW•4m ago•1 comments

As ye clone(), so shall ye AUTOREAP

https://lwn.net/SubscriberLink/1059673/57cdcdf73570ac75/
1•jwilk•5m ago•0 comments

Mitch Bradley: Sun Microsystems, Firmware, Forth, OLPC (2008)

https://web.archive.org/web/20120118132847/http://howsoftwareisbuilt.com/2008/03/27/interview-wit...
1•tosh•5m ago•0 comments

Humans.md

https://www.jerpint.io/blog/2026-02-25-humans-md/
2•jerpint•5m ago•0 comments

Temporary processing loops as a sometimes replacement for background threads

https://notebook.drmaciver.com/posts/2026-02-23-17:10.html
1•sebg•7m ago•0 comments

Querying 3B Vectors

https://vickiboykis.com/2026/02/21/querying-3-billion-vectors/
1•sebg•7m ago•0 comments

Power from the Sun: Its Future (1968)

https://www.science.org/doi/10.1126/science.162.3856.857
1•abracos•8m ago•0 comments

Yes, building AI chat is still hard

https://getlago.com/blog/building-ai-is-hard
1•FinnLobsien•9m ago•0 comments

I Priced My Dotfiles Syncing App Wrong (and Other Lessons)

https://david.coffee/configmesh-1-1/
1•speckx•12m ago•0 comments

Police Officer Accused of Tracking Partner via Flock Camera License Plate Reader

https://www.nytimes.com/2026/02/25/us/milwaukee-police-officer-charged-flock-camera.html
1•bonsai_spool•13m ago•1 comments

Show HN: NSENS – AI decision governance with Prolog and adversarial review

https://github.com/maciejjankowski/nsens-framework
1•mjankowski•14m ago•0 comments

A New Home for React Hosted by the Linux Foundation

https://react.dev/blog/2026/02/24/the-react-foundation
1•pimterry•15m ago•0 comments

libreDSSP: A GPL Licensed DSSP Interpreter

https://github.com/mechaniputer/libreDSSP
1•tosh•16m ago•0 comments

Get ready for takeoff with Uber and Joby

https://www.uber.com/newsroom/uber-air/
2•falcor84•17m ago•0 comments

Artificial Intelligence and the Economy. Myths, Realities and the Future of Work

https://roblesnotes.com/blog/ai-economy-future-of-work/
1•Findeton•17m ago•0 comments

Building virtual iPhone using VPHONE600AP component of recent PCC firmware

https://github.com/wh1te4ever/super-tart-vphone-writeup
1•Gander5739•17m ago•0 comments

Internet routing as supply chain risk

https://blog.apnic.net/2026/02/26/internet-routing-as-supply-chain-risk/
1•speckx•19m ago•0 comments

A Man with the Plan

https://reason.com/1998/01/01/the-man-with-the-plan/
2•eamag•21m ago•0 comments

Git in Postgres

https://nesbitt.io/2026/02/26/git-in-postgres.html
2•todsacerdoti•23m ago•0 comments

Programming in K

https://github.com/JohnEarnest/ok/blob/gh-pages/docs/Programming.md
1•tosh•24m ago•0 comments

You Want to Visit the UK? You Better Have a Google Play or App Store Account

https://www.heltweg.org/posts/you-want-to-visit-the-uk-you-better-have-a-google-play-or-app-store...
59•rhazn•25m ago•44 comments

'Futuristic' Unison functional language debuts

https://www.infoworld.com/article/4100673/futuristic-unison-functional-language-debuts.html
1•mpweiher•28m ago•0 comments

The Coming Middle-Class Existential Crisis

https://d1gesto.blogspot.com/2026/02/the-coming-middle-class-existential.html
1•voxleone•28m ago•0 comments

Comparing manual vs. AI requirements gathering: 2 sentences vs. 127-point spec

1•thesssaism•29m ago•0 comments

The Edge of Mathematics

https://www.theatlantic.com/technology/2026/02/ai-math-terrance-tao/686107/
1•danielmorozoff•30m ago•0 comments

China's robot dance for German Chancellor

https://twitter.com/MKuefner/status/2026928081538265378
1•harscoat•31m ago•0 comments

Mako: A simple virtual game console

https://github.com/JohnEarnest/Mako
1•tosh•32m ago•0 comments