frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Building a safe, effective sandbox to enable Codex on Windows

https://openai.com/index/building-codex-windows-sandbox/
1•gmays•4m ago•0 comments

Show HN: Visual composer for Claude Code multi-agent workflows

https://github.com/fayzan123/claude-workflow-composer
1•FayzanMalik•5m ago•0 comments

Show HN: I turned my personal website into a bash shell (with Vim)

https://darrikonn.com
2•darrikonn•6m ago•0 comments

'Major chemical explosion' kills, injures multiple people

https://www.usatoday.com/story/news/nation/2026/05/26/washington-chemical-explosion-nippon-dynawa...
2•SilverElfin•6m ago•1 comments

AI agents imperiled by critical vulnerability in open source package

https://arstechnica.com/information-technology/2026/05/millions-of-ai-agents-imperiled-by-critica...
1•ylk•6m ago•0 comments

Docker on Windows Server Felt Easier After I Tried VisualDock Server

https://www.virtualizationhowto.com/2026/05/docker-on-windows-server-finally-felt-easier-after-i-...
4•chemodax•8m ago•0 comments

The Misuse of Memory

https://www.krezl.com/essays/the-misuse-of-memory
1•prad_r•9m ago•0 comments

On the Laser-Fusion Milestone (2022)

https://inference-review.com/article/on-the-laser-fusion-milestone
1•tensegrist•10m ago•0 comments

The Vibe Coding Era: Why AI Won't Replace Software Engineers [video]

https://www.youtube.com/watch?v=xYU7zaaRjmE
1•regus•11m ago•0 comments

Turning You into a Power User with Hybrid Memory and Claude

https://medium.com/@vektormemory/turning-you-into-a-power-user-hybrid-memory-ssh-cloak-and-passwo...
2•vektormemory•15m ago•0 comments

Soft Serve – Self-hostable Git server for the command line

https://github.com/charmbracelet/soft-serve
1•helloplanets•17m ago•0 comments

Made (By Humans) in California

https://carette.xyz/posts/made_by_humans_in_california/
2•LucidLynx•19m ago•0 comments

PyCon US 2026 Packaging Summit Recap

https://bernat.tech/posts/pycon-us-2026-packaging-summit-recap/
1•nodivbyzero•20m ago•0 comments

AI agents are scrambling power users' brains

https://www.axios.com/2026/04/04/ai-agents-burnout-addiction-claude-code-openclaw
2•jjtang1•21m ago•0 comments

Life Is Short (2016)

https://paulgraham.com/vb.html
1•chistev•21m ago•0 comments

Grimm Brothers' Children's and Household Tales (Grimms' Fairy Tales)

https://sites.pitt.edu/~dash/grimmtales.html
1•Anthony-G•22m ago•0 comments

Hello, Ordinary Blog

https://ordinary.blog/posts/hello-ordinary-blog/
2•seanwatters•23m ago•0 comments

Phloto for My Photo Flow

https://cceckman.com/writing/phloto/
1•evakhoury•29m ago•0 comments

MiniMax teased M3 Sparse Attention: 9.7x prefilling, 15.6x decoding at 1M

https://twitter.com/SkylerMiao7/status/2059285750458544561
2•rebekkamikkoa•31m ago•0 comments

My First Web App for Monetization

https://engine.redsystem.dev
1•crlapples•32m ago•0 comments

Iran president ends Internet blackout, orders access to be restored

https://thehill.com/policy/international/5896061-internet-access-restored-iran/
3•Animats•32m ago•2 comments

A Dose of Hope for the Future

https://productnow.ai/blogs/a-dose-of-hope-for-the-future
1•kadhirvelm•34m ago•0 comments

Ask HN: Has AI affected negatively the job market for devs?

2•adinhitlore•39m ago•1 comments

Google I/O 2026: Sundar Pichai's opening keynote

https://blog.google/innovation-and-ai/sundar-pichai-io-2026/
2•gmays•39m ago•1 comments

Ask HN: Sudden spike in web traffic 19-21 May?

1•haemdahl•41m ago•0 comments

Pieces

https://bitcointalk.org/index.php?topic=5584120.0
1•johndebord•41m ago•0 comments

Scientists Found a New Type of Crystal Formed by the First Nuclear Explosion

https://www.popularmechanics.com/science/a71305388/crystal-formed-by-nuclear-explosion/
1•danielmorozoff•41m ago•0 comments

Huawei touts chip design breakthrough in bid to defy U.S. sanctions

https://www.nbcnews.com/world/asia/chinas-huawei-touts-chip-design-breakthrough-bid-defy-us-sanct...
6•billybuckwheat•42m ago•0 comments

An old interview of Dijkstra (1985)

https://www.cs.utexas.edu/~EWD/misc/vanVlissingenInterview.html
1•rajveerb•43m ago•1 comments

Workspace Intelligence

https://workspace.google.com/blog/product-announcements/introducing-workspace-intelligence
1•kamphey•43m ago•1 comments