frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Agentic Email

https://martinfowler.com/bliki/AgenticEmail.html
1•jeffkumar•1m ago•1 comments

Ask HN: Any AI browswer that I can control by Claude Code?

1•johnnyfeng•2m ago•0 comments

AI found us before Google did

1•faruk_tugtekin•2m ago•0 comments

The web is bearable with RSS, Cory Doctorow

https://pluralistic.net/2026/03/07/reader-mode/
1•verisimi•3m ago•0 comments

Israel and the FBI manipulated assassination plots to goad Trump into Iran

https://thegrayzone.com/2026/03/06/israel-fbi-assassination-plots-trump-iran-war/
1•O1111OOO•3m ago•0 comments

Braided Essays

https://www.writersdigest.com/write-better-nonfiction/what-is-a-braided-essay-in-writing
1•marysminefnuf•3m ago•1 comments

Show HN: Pappardelle, a TUI for Multi-Clauding

https://github.com/chardigio/pappardelle
1•chardigio•5m ago•0 comments

Ask HN: Why do most analytics tools show what happened but not why?

1•HPSimulator•5m ago•0 comments

Death of the Flow State

https://1984commitlog.substack.com/p/week-10-the-death-of-flow-state
1•mdp•6m ago•1 comments

Show HN: Human Psychology Simulator – AI website conversion psychology

https://human-psychology-simulator.thequantumgrove.io/
1•HPSimulator•8m ago•0 comments

The Great AI Arbitrage

https://www.dodgycoder.net/2026/03/the-great-ai-arbitrage.html
1•damian2000•9m ago•0 comments

Show HN: A command center for CS students drowning in recruiting season

https://interviewtrackr.com
1•princierKevin•9m ago•0 comments

NASA Asteroid Observations Eliminate Chance of 2032 Lunar Impact

https://science.nasa.gov/blogs/planetary-defense/2026/03/05/new-nasa-asteroid-observations-elimin...
1•geox•10m ago•0 comments

Nexvira – a personal writing archive instead of a traditional blog

https://nexpul.blogspot.com/
1•Anomatrix•11m ago•1 comments

Show HN: I automated DJ Screw's chopped and screwed technique with Python+FFmpeg

https://github.com/samuelfrench/dj-screw-video-generator
1•samuelfrench9•14m ago•0 comments

Ask HN: Github Account Recovery after a 2fa loss

https://imgur.com/a/kJtR8U3
1•PonyoSunshine•18m ago•1 comments

Show HN: A dynamic, crowdsourced benchmark for AI agents

https://clawdiators.ai
1•shalinmehtaaa•27m ago•0 comments

Show HN: SiClaw – Open-source AIOps with a hypothesis-driven diagnostic engine

https://github.com/scitix/siclaw
2•SherryWong•30m ago•1 comments

The New Apple Begins to Emerge

https://parkerortolani.blog/2026/03/07/the-new-apple-finally-begins.html
1•arto•31m ago•0 comments

AirLLM optimizes inference memory usage

https://github.com/lyogavin/airllm
1•nreece•32m ago•0 comments

Give Up GitHub – Software Freedom Conservancy

https://sfconservancy.org/GiveUpGitHub/
2•nreece•34m ago•1 comments

AI Project Handoff Format

https://github.com/yy4uic-ai/ai-handoff-forma
1•yy4uic•38m ago•1 comments

Commit What You Know of Iran to the Flames

https://www.bloomberg.com/opinion/articles/2026-03-06/oil-shock-commit-what-you-know-of-iran-to-t...
1•petethomas•39m ago•1 comments

Show HN: DailyDefense – Daily tower defense for agents or humans

https://www.dailydefense.ai
1•pj4533•40m ago•0 comments

OpenAI robotics lead Caitlin Kalinowski quits in response to Pentagon deal

https://techcrunch.com/2026/03/07/openai-robotics-lead-caitlin-kalinowski-quits-in-response-to-pe...
3•SilverElfin•40m ago•1 comments

MonoGame: A .NET framework for making cross-platform games

https://github.com/MonoGame/MonoGame
1•azhenley•42m ago•0 comments

A23a was once the biggest in the world iceberg. Now it has just weeks left

https://www.bbc.co.uk/news/resources/idt-20f878f1-f4af-4022-9f62-b0515b9f4b20
1•reconnecting•42m ago•0 comments

Show HN: Too many AI SaaS launching every day so we built Arena where they fight

https://glad-ia-tor.com/
1•GiornoJojo•44m ago•0 comments

Show setup modal with confetti on coverage page when no CI data exists

1•nishiohiroshi•45m ago•0 comments

XC-BASIC3 Space Invaders (Pet Programming Part 3)

https://retrogamecoders.com/xcbasic3-spaceinvaders/
1•ibobev•50m ago•0 comments