frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: SpearTrades – Crypto risk, health and portfolio analytics

https://speartrades.com
1•sumitarya•34s ago•0 comments

Programming Aphorisms

https://matklad.github.io/2026/02/11/programming-aphorisms.html
1•signa11•1m ago•0 comments

The "Are You Sure?" Problem: Why Your AI Keeps Changing Its Mind

https://www.randalolson.com/2026/02/07/the-are-you-sure-problem-why-your-ai-keeps-changing-its-mind/
1•EvgeniyZh•1m ago•0 comments

Show HN: Cross-platform audio notifications for Claude Code

https://github.com/ChanMeng666/claude-code-audio-hooks
1•chanmeng666•1m ago•0 comments

Show HN: Emergent Field Explorer – interactive moiré with easy shareable links

https://mcbgl.vercel.app/?p=eyJ2ZXJzaW9uIjoyLCJzZXR0aW5ncyI6eyJsYXllcjEiOnsic3BhY2luZyI6MzksInNpe...
1•mcdowell_atx•3m ago•0 comments

My wife calls me, panicked

https://twitter.com/ModernDad/status/2021675945581486309
1•latchkey•4m ago•0 comments

Ask HN: How do you deal with long Covid?

2•manx•7m ago•1 comments

Outcome Engineering – The o16g Manifesto. It was never about the code

https://o16g.com/
1•donutshop•18m ago•0 comments

Add paths filter and concurrency to coverage workflow docs

1•nishiohiroshi•19m ago•0 comments

What's it like to go through the FedRAMP process?

https://cyberscoop.com/radio/greg-talks-with-matt-kapko-about-a-whirlwind-week-around-clops-targe...
1•felineflock•21m ago•0 comments

Deep Dive into New York City Air Traffic Control (2019)

https://josephgunnwriting.wordpress.com/2019/06/13/deep-dive-into-new-york-city-air-traffic-contr...
1•asleepawake•23m ago•0 comments

OpenClaw but Running on My iPhone

2•readsdiggdaily•26m ago•0 comments

Show HN: NixOS flake for hardened OpenClaw deployment

https://github.com/Scout-DJ/openclaw-nix
1•scout_agent•26m ago•0 comments

Lex Fridman x Peter Steinberger Podcast

https://www.youtube.com/watch?v=YFjfBk8HI5o
1•jaytyagi•26m ago•0 comments

Moltbook Looked Like an Emerging AI Society, but Humans Were Pulling the Strings

https://www.forbes.com/sites/ronschmelzer/2026/02/10/moltbook-looked-like-an-emerging-ai-society-...
1•United857•36m ago•1 comments

Trump's Ruinous, Failed Attempt to Indict Congressional Democrats

https://www.nationalreview.com/2026/02/trumps-ruinous-failed-attempt-to-indict-congressional-demo...
2•petethomas•37m ago•0 comments

The Zero-Inventory Hardware Company

https://miguelarmengol.com/blog/the-zero-inventory-hardware-company-i/
1•miki_tyler•42m ago•0 comments

Show HN: MoltHub – GitHub for AI Agents with Trust-Based Auto-Merge

https://molt-hub.org
1•yaluotao•45m ago•0 comments

Proof-Oriented Programming in F*

https://fstar-lang.org/tutorial/
1•todsacerdoti•45m ago•0 comments

Dear Agent: Prove It

https://rijnard.com/blog/dear-agent-proof
1•ghuntley•47m ago•0 comments

Reflections on Using Claude Code

http://ternarysearch.blogspot.com/2026/02/reflections-on-using-claude-code.html
1•paladin314159•47m ago•0 comments

Results from the Advent of FPGA Challenge

https://blog.janestreet.com/advent-of-fpga-challenge-2025-results/
2•zdw•52m ago•0 comments

Island Enterprise Browser: Intelligent security built into the browsing session

https://www.helpnetsecurity.com/2023/07/05/mike-fey-island-enterprise-browser/
1•felineflock•54m ago•0 comments

Victorian Engineering Connections Diagram from the Brunel Museum

https://thebrunelmuseum.com/engineering-connections/
2•felineflock•59m ago•1 comments

Self-Distillation Enables Continual Learning

https://self-distillation.github.io/SDFT.html
1•teleforce•1h ago•0 comments

Distributed Llama

https://github.com/b4rtaz/distributed-llama
3•oldfuture•1h ago•0 comments

GLM-5 was trained entirely on Huawei chips

https://glm5.net/
4•wildcatqz•1h ago•2 comments

Show HN: Prompt Builder – A block-based editor for composing AI prompts

https://www.promptbuilder.space/
1•Jaber_Said•1h ago•0 comments

Dawson's Creek star James Van Der Beek has died at 48 from colon cancer

https://www.npr.org/2026/02/11/nx-s1-5552216/james-van-der-beek-dead-dawsons-creek
3•donsupreme•1h ago•0 comments

ClawShield – Security audit tool for OpenClaw deployments

https://github.com/policygate/clawshield
1•jonscott3333•1h ago•2 comments