frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

'$100 Steam Machine' uses a cut-down PS5 APU with Bazzite

https://www.tomshardware.com/pc-components/gpus/usd100-steam-machine-uses-a-cut-down-ps5-apu-with...
1•throwaway270925•42s ago•0 comments

Epstein Files Browser

https://epstein-files-browser.vercel.app/
1•helloplanets•48s ago•0 comments

Sam Altman's New Brain Venture, Merge Labs, Will Spin Out of a Nonprofit

https://www.wired.com/story/sam-altman-brain-computer-interface-merge-labs-spin-out-nonprofit-for...
1•danielmorozoff•5m ago•0 comments

America and China Are Racing to Different AI Futures [video]

https://www.youtube.com/watch?v=qDNFaAz3_Cw
1•hunglee2•9m ago•0 comments

The post-GeForce era: What if Nvidia abandons PC gaming?

https://www.pcworld.com/article/3013044/the-post-geforce-era-what-if-nvidia-abandons-pc-gaming.html
2•taubek•14m ago•0 comments

Ask HN: What is the most complex software you've built single handedly?

2•chistev•14m ago•0 comments

New Design for the Official Ruby Website

https://www.ruby-lang.org/en/
1•thunderbong•14m ago•0 comments

The Gerrit code review iceberg

https://www.haiku-os.org/blog/pulkomandy/2025-11-24-the_gerrit_pending_review_iceberg
2•birdculture•17m ago•0 comments

Show HN: A minimalist, high-quality Text-to-Speech Chrome extension

https://chromewebstore.google.com/detail/qariyo-text-to-speech/bdnmnapejclcgkgljpkddbnjfcplkhoj
1•abagh999•23m ago•0 comments

Ask HN: What are the most convincing resources about climate change?

1•eimrine•23m ago•0 comments

Em-admin: Open-source tool to read-write WaterStar watermeter radio parameters

https://github.com/hn/em-admin
1•hn___•24m ago•0 comments

Compiler Design (Summer 2025)

https://symbolaris.com/course/compiler.html
1•waldarbeiter•25m ago•0 comments

What Does a Database for SSDs Look Like?

https://brooker.co.za/blog/2025/12/15/database-for-ssd.html
2•charleshn•26m ago•0 comments

Show HN: Thufir – Claude Code plugin to solve production issues

https://github.com/evangelosmeklis/thufir
2•twelvechess•28m ago•1 comments

How to Watch the Ursids Winter Solstice Meteor Shower

https://www.nytimes.com/2025/12/20/science/ursids-meteor-shower-how-to-watch.html
1•quapster•33m ago•0 comments

The Intergiro Story

https://intergiro.com/
1•breton•34m ago•0 comments

How to Add Your Pathmind Course Certificate to LinkedIn

https://pathmind.app/landing/
1•WebToolsCaE•41m ago•1 comments

Reflections on Building a Pixel-Perfect UI Pipeline in JUCE Applications

https://playfultones.com/blog/reflections-on-building-a-pixel-perfect-ui-pipeline-in-juce-applica...
1•playfultones•41m ago•0 comments

Show HN:macOS Memory Benchmark for Apple Silicon (cache, bandwidth, latency)

https://github.com/timoheimonen/macOS-memory-benchmark
1•user_timo•45m ago•2 comments

From "I'll set up monitoring later" to 50 paying customers

https://www.catops.app/
1•honley•46m ago•1 comments

Google's boomerang year: 20% of AI engineers hired in 2025 were ex-employees

https://www.cnbc.com/2025/12/19/google-boomerang-year-20percent-ai-software-devs-hired-2025-ex-em...
1•arberavdullahu•52m ago•0 comments

Jax-JS: an ML library for the web

https://ekzhang.substack.com/p/jax-js-an-ml-library-for-the-web
1•samuel246•52m ago•0 comments

From Gridlock to Grid Power: The Promise of Superconducting Cables

https://ee.eng.cam.ac.uk/index.php/2025/09/22/from-gridlock-to-grid-power-the-promise-of-supercon...
1•zeristor•57m ago•0 comments

Monorepos: Please Don't

https://medium.com/@mattklein123/monorepos-please-dont-e9a279be011b
1•fanf2•57m ago•1 comments

Reflections on AI at the End of 2025

https://antirez.com/news/157
38•danielfalbo•1h ago•27 comments

Robocop – The Future of Copy Protection

https://hoffman.home.blog/2025/12/18/robocop-the-future-of-copy-protection/
2•neitsa•1h ago•0 comments

Len Sassaman and Satoshi: A Cypherpunk History

https://evanhatch.medium.com/len-sassaman-and-satoshi-e483c85c2b10
4•chistev•1h ago•0 comments

TOML 1.1.0 Released

https://github.com/toml-lang/toml/releases/tag/1.1.0
1•birdculture•1h ago•0 comments

Ask HN: Morning gravy train discussing AI, here is my opinion

2•gitprolinux•1h ago•2 comments

Subscription with a 3-year trial period

https://play.google.com/store/apps/details?id=com.typexai.bardo&hl=en_US
1•Typexex•1h ago•1 comments