frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Lattice-Based Cryptography and Formal Verification

https://mayckongiovani.substack.com/p/pqc-engineering-series-deep-dive-dba
1•doomhammerhell•2m ago•0 comments

How to make Firefox builds 17% faster

https://blog.farre.se/posts/2026/04/10/caching-webidl-codegen/
1•mbitsnbites•2m ago•0 comments

Brunost: The Nynorsk Programming Language

https://lindbakk.com/blog/introducing-brunost
1•atomfinger•2m ago•0 comments

Péter Magyar: Hungary's next leader energised voters but is 'dark horse'

https://www.theguardian.com/world/2026/apr/12/peter-magyar-hungary-next-leader-profile
2•mooreds•4m ago•0 comments

Misanthropic

https://privatebank.jpmorgan.com/nam/en/o/eotm/misanthropic
2•baddash•4m ago•0 comments

New credit card will tap into borrowers' fossil-fuel rights

https://www.americanbanker.com/news/new-credit-card-will-tap-into-borrowers-fossil-fuel-rights
1•petethomas•5m ago•0 comments

New Player Control for YouTube

https://chromewebstore.google.com/detail/accent-converter-for-yout/lnmbdamplioghdakbfnbbofeipjlbgpk
1•astipili•6m ago•0 comments

The Age of Dinosaurs

https://longreads.com/2026/03/31/age-of-dinosaurs-parenting-history-museum/
2•mooreds•7m ago•0 comments

AI ran into the cold hard reality of the legal profession

https://www.theregister.com/2026/04/13/ai_attorneys/
3•blackcoffeerain•9m ago•0 comments

Laundry folding floor lamp for $1500

https://syncere.com/product
1•ageofattention•10m ago•0 comments

Building a Web Page That Edits Itself

https://www.patrickweaver.net/blog/one-pager-self-editing-html/
1•evakhoury•10m ago•0 comments

Anthropic's Mythos Preview and Project Glasswing

https://www.schneier.com/blog/archives/2026/04/on-anthropics-mythos-preview-and-project-glasswing...
2•speckx•10m ago•0 comments

New Mexico governor signs nation's first universal child care law

https://www.governor.state.nm.us/2026/03/10/governor-lujan-grisham-signs-nations-first-universal-...
2•eatonphil•10m ago•0 comments

AI Frontier Model Tracker with API

https://www.demandsphere.com/research/ai-frontier-model-tracker/
1•rgrieselhuber•11m ago•1 comments

Show HN: RememberMap

https://remembermap.com
1•sameg14•12m ago•0 comments

Show HN: Soulhunt – your digital twin is loose. capture it or someone else will

https://soulhunt.ai
2•tormine1•13m ago•1 comments

Building a Robust Documentation Agent with DigitalOcean Gradient AI Platform

https://www.digitalocean.com/blog/documentation-agent
2•gabes•14m ago•0 comments

The Age-Old Urge to Destroy Technology

https://www.newyorker.com/culture/infinite-scroll/the-age-old-urge-to-destroy-technology
3•mitchbob•14m ago•2 comments

We're Using So Much AI That Computing Firepower Is Running Out

https://www.wsj.com/tech/ai/ai-is-using-so-much-energy-that-computing-firepower-is-running-out-15...
3•NN88•19m ago•1 comments

Breaking Rohde and Schwarz AMIQ License Keys – The Hard and the Easy Way

https://tomverbeure.github.io/2026/04/12/AMIQ-License-Key-Generation.html
2•Eduard•20m ago•0 comments

Drawbridge: What SQL Server on Linux is built on (2021)

https://threedots.ovh/blog/2021/01/drawbridge-what-sql-server-on-linux-is-built-on/
2•my123•20m ago•0 comments

Building a Grow-Only Counter on a Sequentially Consistent KV Store

https://brunocalza.me/blog/2026/04/13/building-a-grow-only-counter-on-a-sequentially-consistent-k...
2•brunocalza•21m ago•0 comments

Breathing pattern is as unique as a fingerprint

https://www.psypost.org/your-breathing-pattern-is-as-unique-as-a-fingerprint/
3•lentoutcry•22m ago•0 comments

Dummy Client

https://news.ycombinator.com/news
2•alchemy97•22m ago•0 comments

Austerity Creates Fascism

https://pluralistic.net/2026/04/12/always-great/
12•Refreeze5224•23m ago•1 comments

Why Context Switching Kills Deep Work and How to Fix It on Mac

https://www.brnsft.com/blog/why-context-switching-kills-deep-work-and-how-to-fix-it-on-mac
2•robertohanas•23m ago•1 comments

Show HN: Type-level Fibonacci with a while loop in stable Rust (no const)

https://gist.github.com/aluqas/c7209b8990762db72620a87200f3e2aa
2•saqula•23m ago•0 comments

From Fossil to Fact: The Denisova Discovery as Science in Action [pdf]

https://www.diva-portal.org/smash/get/diva2:1632719/FULLTEXT01.pdf
2•larve•25m ago•0 comments

Serenely Fast I/O Buffer (With Benchmarks) – SereneDB

https://blog.serenedb.com/io-buffer
2•PaulHoule•25m ago•0 comments

Visualizing CPU Pipelining (2024)

https://timmastny.com/blog/visualizing-cpu-pipelining/
3•flipacholas•25m ago•0 comments