frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

My AI coding flow was burning tokens to do things code should do

https://geerttheys.substack.com/p/i-agent-deterministic-coding-flow
1•toadi•6m ago•1 comments

Hosting My Own Newsletter

https://endler.dev/2026/newsletter-setup/
1•yakkomajuri•6m ago•0 comments

Ask HN: Encouraging a child's gaming PC build despite fear of gaming addiction?

1•marttt•17m ago•0 comments

Over $5M in donations flowed in after the Lapu-Lapu Day attack. Where it went

https://www.cbc.ca/news/canada/lapu-lapu-donations-analysis-9.7207684
1•wolpoli•24m ago•0 comments

Huawei Unveils Tau (τ) Scaling Law for Transistor and System Breakthroughs

https://www.huawei.com/en/news/2026/5/ieee-iscas-tau-scaling
1•CalmStorm•25m ago•0 comments

Mecha Comet is an open-source hardware, modular Linux handheld computer

https://www.cnx-software.com/2026/01/25/mecha-comet-is-an-open-source-hardware-modular-linux-hand...
1•walterbell•27m ago•0 comments

Companies Are Just a Graph of Algorithms

https://danielmiessler.com/blog/companies-graph-of-algorithms
5•samuel246•28m ago•0 comments

CanYouCalculate

https://canyoucalculate.com
1•sauhard121•31m ago•0 comments

Porting Ytdlp to Bun (Ytdlb)

https://yamada-blog.pages.dev/blog/0007/
1•curliness•32m ago•0 comments

Software supply-chain attacks are no longer rare events

https://www.wired.com/story/teampcp-software-supply-chain-attack-spree-github/
2•latentframe•32m ago•1 comments

What is Git made of? (2022)

https://zserge.com/posts/git/
1•vinhnx•34m ago•0 comments

RLS sounds great until it isn't

https://planetscale.com/blog/rls-sounds-great-until-it-isntp
1•eigenBasis•34m ago•0 comments

Jira Is Turing-Complete

https://seriot.ch/computation/jira.html
2•vinhnx•34m ago•0 comments

Show HN: Live AI music sequencing agent

https://pretzel.shukant.com/?nickname=Anonymous&role=stage
1•shukantpal•36m ago•0 comments

A Cattle Ranch Is Doing What Ivy League Colleges Can't

https://www.nytimes.com/2026/05/20/opinion/deep-springs-college-ivy-league-education.html
1•gmays•37m ago•0 comments

The Eternal Sloptember

https://geohot.github.io//blog/jekyll/update/2026/05/24/the-eternal-sloptember.html
2•razin•41m ago•0 comments

My friend found idle NAT gateways his team said didn't exist

https://getnable.com/
1•chaandannn•47m ago•1 comments

AI Interpretability Is a Revolutionary Skill

https://www.outcryai.com/research/the-dark-between-the-stars
1•micahwhite•47m ago•1 comments

High-efficiency multi-scale holographic volumetric 3Dprinting with a phase light

https://www.nature.com/articles/s41377-026-02331-4
1•anikoghosyan•49m ago•0 comments

PaaS Platfrom to Deploy Apps

https://nept.cloud
1•nazmussamir•56m ago•0 comments

Command A+: Making sovereign agentic capabilities available to all

https://cohere.com/blog/command-a-plus
1•offbyone42•58m ago•0 comments

Splinter Cell veteran says realistic modern lighting has screwed up stealth game

https://www.rockpapershotgun.com/splinter-cell-veteran-says-realistic-modern-lighting-has-screwed...
3•Tomte•59m ago•0 comments

Weight loss drugs could save airlines money on fuel as Americans slim down

https://www.cbsnews.com/news/weight-loss-drugs-glp1s-airlines-fuel-costs/
1•mattas•1h ago•2 comments

Everlane Finalizes Sale to Shein

https://www.nytimes.com/2026/05/22/style/shein-everlane-fast-fashion-sustainability.html
1•lxm•1h ago•0 comments

Robotaxis Aren't as Autonomous as They Seem

https://junkoyoshidaparis.substack.com/p/robotaxis-arent-as-autonomous-as
2•mattas•1h ago•0 comments

Kids are Graduating Without Being Able to Read [video][34 mins]

https://www.youtube.com/watch?v=PcSApLcxpYc
1•Bender•1h ago•0 comments

Workspace Orchestration

https://hyperspeed.work
2•Asadsangabi•1h ago•1 comments

DeepSeek-V4 KV Cache Explained: Why 1M Context Uses Less VRAM

https://knightli.com/en/2026/05/18/deepseek-v4-kv-cache-compressed-attention/
1•vinhnx•1h ago•0 comments

Lynote Humanize Text – Open-source AI text humanization toolkit

https://github.com/lynote-ai/humanize-text
2•Danny6969•1h ago•0 comments

MetalBench – Benchmark for Apple Silicon's Metal Shading Lang

https://github.com/Lazarus-931/MetalBench
1•AlazarManakelew•1h ago•1 comments