frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Match Block Size to CPU / Cache with Boost.DynamicBitset

https://www.boost.org/outreach/program_page/dynamicbitset/
1•joaquintides•1m ago•1 comments

Leave the Gold in the Ground

https://www.bloomberg.com/opinion/newsletters/2025-11-24/leave-the-gold-in-the-ground
1•feross•1m ago•0 comments

The History and Power of Poop

https://youtu.be/v42gznW6cuA?si=swt9s4O4t1LYwWhy
1•heshiebee•2m ago•0 comments

Making Smarter Decisions, Faster with AI at Coinbase

https://www.coinbase.com/blog/making-smarter-decisions-faster-with-AI-at-Coinbase
1•dukebartnik•3m ago•0 comments

Time for a market upgrade? Wholesale electricity market designs for the future

https://www.sciencedirect.com/science/article/abs/pii/S0140988325004670
1•PaulHoule•3m ago•0 comments

Why my rust rewrite of Mozilla's readability is better than original readability

https://github.com/theiskaa/readabilityrs
1•theiskaa•3m ago•0 comments

From Cloudwashing to O11ywashing

https://charity.wtf/2025/11/24/from-cloudwashing-to-o11ywashing/
1•gpi•3m ago•0 comments

X Displays Users' Locations, Fueling Scrutiny over Political Accounts

https://www.nytimes.com/2025/11/24/us/politics/x-twitter-location-maga-controversy.html
1•mrtesthah•5m ago•0 comments

Culture is critical in orangutan diet development [pdf]

https://www.nature.com/articles/s41562-025-02350-y
3•thunderbong•5m ago•0 comments

Linux Command Line: No More Secrets

https://github.com/bartobri/no-more-secrets
1•Bender•8m ago•1 comments

Show HN: Fractalbits – S3 compatible high performance storage with Rust and Zig

https://github.com/fractalbits-labs/fractalbits-main
3•thomas_fa•9m ago•0 comments

Conspiratorial Design. Information design for the bigger picture

https://we-make-money-not-art.com/conspiratorial-design-information-design-for-the-bigger-picture/
1•thinkingemote•9m ago•0 comments

A Top Scientist's Ideas as to NIH

https://goodscience.substack.com/p/a-top-scientists-ideas-as-to-nih
1•paulpauper•10m ago•0 comments

Side-Walking Problems

https://marginalrevolution.com/marginalrevolution/2025/11/side-walking-problems.html
1•paulpauper•11m ago•0 comments

Can't Look Away: The Case Against Social Media

https://www.bloomberg.com/features/cant-look-away/
1•wslh•12m ago•0 comments

Can the U.S. Make Big Nuclear Reactors?

https://www.wsj.com/business/energy-oil/can-the-u-s-make-big-nuclear-reactors-1ab24db9
3•domofutu•13m ago•0 comments

Quake Engine Indicators

https://fabiensanglard.net/quake_indicators/
1•_pob•14m ago•0 comments

Split brain does not lead to split consciousness

https://www.sciencedaily.com/releases/2017/01/170125093823.htm
2•pinkmuffinere•16m ago•1 comments

A Man Who Wanted to Believe in Life on Mars

https://newrepublic.com/article/202815/martians-book-review-man-believe-life-mars
1•c420•19m ago•0 comments

Evaluating Effect Size in Psychological Research: Sense and Nonsense

https://journals.sagepub.com/doi/10.1177/2515245919847202
1•Anon84•19m ago•0 comments

PS5 now costs less than 64GB of DDR5 memory. RAM jumps to $600 due to shortage

https://www.tomshardware.com/pc-components/ddr5/64gb-of-ddr5-memory-now-costs-more-than-an-entire...
9•speckx•19m ago•0 comments

Show HN: ZDTP Chess – Multi-dimensional analysis using zero divisor algebras

https://github.com/pchavez2029/zdtp-chess/tree/main/zdtp_chess_mcp
1•pchavez2025•20m ago•1 comments

The GPT Awakening – Official Prophecy Trailer (2025) [video]

https://www.youtube.com/watch?v=Sl3a4HjM5Zg
1•nuevita70•22m ago•0 comments

The unpowered SSDs in your drawer are slowly losing your data

https://www.xda-developers.com/your-unpowered-ssd-is-slowly-losing-your-data/
3•amichail•23m ago•0 comments

How do we keep apps maintained on Flathub? (or building a more respectful App S

https://tim.siosm.fr/blog/2025/11/24/building-better-app-store-flathub/
1•todsacerdoti•24m ago•0 comments

Gramma, a Galápagos Tortoise at the San Diego Zoo, Dies at About 141

https://www.nytimes.com/2025/11/24/science/gramma-galapagos-tortoise-san-diego-zoo-dies.html
1•quapster•24m ago•0 comments

Old-school rotary phone dials into online meetings hangs up when slam it down

https://www.theregister.com/2025/11/24/rotary_phone_online_meetings/
1•Bender•25m ago•0 comments

A.I. is a printed birthday card train to Paris

https://filiph.net/text/ai-is-a-printed-birthday-card-train-to-paris.html
1•markdog12•26m ago•0 comments

Claude Advanced Tool Use

https://www.anthropic.com/engineering/advanced-tool-use
4•lebovic•27m ago•0 comments

Riding Uphill: The Tariff Squeeze on America's Bike Scene

https://micromobility.io/news/riding-uphill-the-tariff-squeeze-on-americas-bike-scene
1•prabinjoel•27m ago•0 comments