frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Best practices for long-run LED strip installs (20–50M) to avoid flicker?

1•emmasuntech•2m ago•0 comments

Gluetun v3.41.0 Release – The Ranting Section

https://www.youtube.com/watch?v=SSkGpys40ck
1•FeelingGood•3m ago•0 comments

Prompts.chat: the social platform for AI prompts

https://prompts.chat
1•fka•6m ago•0 comments

Cursor UI is built with SolidJS

https://old.reddit.com/r/solidjs/comments/1puoifc/cursor_ui_is_built_with_solidjs/
1•itayadler•6m ago•0 comments

Show HN: FailCore – Execution-Time Safety Runtime for AI Agents

https://github.com/Zi-Ling/failcore
1•IntelliAvatar•8m ago•0 comments

Show HN: Awkward 90s Christmas Studio Portrait

https://picxstudio.com/templates/298-awkward-90s-christmas-studio-portrait
1•Yash16•10m ago•0 comments

Show HN: Measuring the "Mom-induced" insulin resistance this Christmas

https://www.longevity-tools.com/glucose-metabolism-interpreter
1•zsolt224•11m ago•0 comments

Show HN: Browser Based IDE for Love2D

https://github.com/Charmunks/loveweb
4•Charmunk•11m ago•0 comments

Christmas and Kim Jong-suk's birthday

https://www.cartographerstale.com/p/christmas-and-kim-jong-suks-birthday
1•Milhaud•12m ago•0 comments

Star Wars Vader vs. Kenobi, Reimagined

https://www.youtube.com/watch?v=to2SMng4u1k
2•fanf2•31m ago•0 comments

Why 'The Global Market' Is an Irresponsible Phrase

https://oswarld.com/eng/insight/251117_Why-the-global-market-Is-an-Irresponsible-Phrase
2•haebom•32m ago•0 comments

Show HN: Pepterm – view protein structures in your terminal (Rust)

https://crates.io/crates/pepterm
1•ss-13•32m ago•0 comments

Ruby Turns 30 Celebrating the Anniversary with the Release of Ruby 4.0

https://blog.jetbrains.com/ruby/2025/12/ruby-turns-30-a-celebration-of-code-community-and-creativ...
2•pogrebnoy•33m ago•0 comments

Sitemap Valid but "Couldn't Fetch" in GSC

1•rb_sys•36m ago•0 comments

In Search of Scrooge's Door Knocker

https://londonist.com/london/christmas-in-london/where-is-scrooges-door-knocker
2•zeristor•41m ago•0 comments

Ask HN: Did any projects succeed with crowdfunding?

1•asim•41m ago•0 comments

The Measure of All Things – Richard Chwedyk (2001)

https://archive.org/details/Fantasy_Science_Fiction_v100n01_2001-01_N591_DaisyChainsaw
1•gudzpoz•42m ago•0 comments

Anthropic's Christmas Gift to Subscribers

https://support.claude.com/en/articles/13163666-holiday-2025-usage-promotion
3•gverrilla•47m ago•3 comments

Claude Code changed my life

https://spader.zone/xmas/
4•dboon•48m ago•1 comments

Quantum Error Correction Goes FOOM

https://algassert.com/post/2503
2•EvgeniyZh•55m ago•0 comments

CYC: A large-scale investment in knowledge infrastructure (1995) [pdf]

https://dl.acm.org/doi/pdf/10.1145/219717.219745
1•swatson741•55m ago•0 comments

Show HN: Free CLI for cryptographic receipts using Ethereum signatures

https://github.com/805-ai/receipt-cli
1•amann805•59m ago•0 comments

The CRA's Impetus to Openness

https://meshedinsights.com/2025/11/27/cra-openly-shared/
1•Tomte•59m ago•0 comments

Groq and Nvidia Enter Non-Exclusive Inference Technology Licensing Agreement

https://groq.com/newsroom/groq-and-nvidia-enter-non-exclusive-inference-technology-licensing-agre...
4•qwertox•1h ago•1 comments

ThePrimeagen: Who He Is, Why Developers Love Him, and What You Can Learn

https://devnews-nu.vercel.app/posts/5
1•dawitworku•1h ago•1 comments

Self-referencing Page Tables for the x86-Architecture

https://0l.de/blog/2015/01/bachelor-thesis-abstract/
4•stv0g•1h ago•0 comments

Eight.com Is Owned by Amazon

https://eight.com
1•LaFolle•1h ago•0 comments

Twibird – Twitter bookmarks search and likes organizer

https://twibird.com
1•paidx•1h ago•1 comments

Peter Gutmann – Why Quantum Cryptanalysis Is Bollocks [video]

https://www.youtube.com/watch?v=xa4Ok7WNFHY
1•Anon84•1h ago•1 comments

Hardcore: Paul Schrader in the 70s (2010)

https://openspace.sfmoma.org/2010/08/hardcore-paul-schrader-in-the-70s-1/
1•XzetaU8•1h ago•0 comments