frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Cisco open sources toolkit for tracing AI model lineage

https://blogs.cisco.com/ai/model-provenance-kit
1•hsanthan•4m ago•0 comments

Swival: A coding agent for any model

https://swival.dev/
1•handfuloflight•4m ago•0 comments

Uantifying Voter Biases in Online Platforms: An Instrumental Variable Approach

https://arxiv.org/abs/1910.00757
1•smooke•5m ago•0 comments

Steep fertilizer and fuel prices could squeeze US farmers for months to come

https://www.wpr.org/news/steep-fertilizer-fuel-prices-squeeze-us-farmers-months-come
1•_tk_•7m ago•0 comments

Show HN: Vanilla-scroll-sky: CSS-only modern scroll-driven storytelling sections

https://github.com/ulrischa/vanilla-scroll-sky
1•ulrischa•7m ago•0 comments

Migrating from Supabase

https://blog.val.town/blog/migrating-from-supabase/
2•gurjeet•10m ago•0 comments

Do we even need a better GitHub?

https://www.aviator.co/blog/do-we-even-need-a-better-github/
2•tonkkatonka•10m ago•0 comments

Stable Specialization in Rust

https://goldstein.lol/posts/stable-specialization/
1•PaulHoule•10m ago•0 comments

Claude will use all SpaceX Colossus datacenter capacity

https://twitter.com/NVIDIAAI/status/2052082412994383936
4•kristianpaul•12m ago•1 comments

When do we know someone has died

https://blog.computationalcomplexity.org/2026/05/when-do-we-know-someone-has-died.html
1•speckx•13m ago•0 comments

Multipath Reliable Connection spec published

https://www.opencompute.org/documents/ocp-mrc-1-0-pdf
1•jabl•13m ago•0 comments

Olaf: Bringing an Animated Character to Life in the Physical World

https://arxiv.org/abs/2512.16705
1•programd•13m ago•0 comments

Bell Laboratories Record (August 1941) [pdf]

https://www.worldradiohistory.com/Archive-Bell-Laboratories-Record/40s/Bell-Laboratories-Record-1...
2•zuhayeer•15m ago•0 comments

MIT’s virtual violin offers luthiers a new design tool

https://arstechnica.com/science/2026/05/mits-virtual-violin-offers-luthiers-a-new-design-tool/
2•smushy•15m ago•0 comments

Free and Simple Chess Analysis

https://www.g6chess.com/
1•mantegna•15m ago•1 comments

Supercomputer networking to accelerate large scale AI training

https://openai.com/index/mrc-supercomputer-networking/
1•dataking•16m ago•0 comments

xAI will be dissolved as a separate company

https://twitter.com/elonmusk/status/2052105373621121284
1•break_the_bank•17m ago•0 comments

Learning Advanced JavaScript (2008)

https://johnresig.com/apps/learn/
1•downbad_•19m ago•1 comments

Gypsy Woman Hardware Live Jam (2023) [video]

https://www.youtube.com/watch?v=_SSXALxZ3Hs
1•elvis70•20m ago•0 comments

Mainframe modernization is no longer optional for the AI-driven enterprise

https://thenewstack.io/open-mainframe-enterprise-modernization/
2•rbanffy•21m ago•0 comments

Let's Get EFF to Accept Monero Donations

https://monerocoalition.org/lets-get-eff-to-accept-monero-donations/
5•Cider9986•22m ago•0 comments

You can make more money buying MTG cards than the lottery

https://meadow.cafe/blog/0073-you-can-make-more-money-buying-mtg-cards-than-the-lottery/
2•speckx•25m ago•0 comments

Go-joker – a much faster Clojure interpreter written in Go and WASM

https://rcarmo.github.io/projects/go-joker/
7•rcarmo•26m ago•0 comments

ZAYA1-8B: Frontier intelligence density, trained on AMD

https://www.zyphra.com/post/zaya1-8b
3•mseri•27m ago•0 comments

Shadow – find which prompt change broke your AI agent

https://github.com/manav8498/Shadow
2•manav8498•27m ago•0 comments

Upcoming El Niño: The World Is About to Get a Preview of Life in 2035

https://www.nytimes.com/2026/05/06/opinion/el-nino-climate.html
5•puttycat•30m ago•0 comments

Planting Trees and Dreaming of Software

https://jerodsanto.net/2026/05/planting-trees-software-dreams/
1•herbertl•30m ago•0 comments

Hackers Hate AI Slop More Than You Do

https://www.wired.com/story/cybercriminals-are-complaining-about-ai-slop-flooding-their-forums/
9•aledevv•32m ago•1 comments

An aggregate of payment usage data released by businesses that accept Monero

https://monerostats.org/
1•Cider9986•32m ago•0 comments

A Fundamental FX Factor Model

https://dm13450.github.io/2026/04/19/A-Fundamental-FX-Factor-Model.md.html
1•dm13450•32m ago•0 comments