frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: Mixlab, an ML arch lab in Go. JSON config, Metal and CUDA, 1.6s builds

https://github.com/mrothroc/mixlab
1•mrothroc•2m ago•0 comments

Health AI Startup Has Helped Reverse Denied Health Insurance Claims

https://www.bloomberg.com/news/features/2026-04-22/ai-and-mark-cuban-among-startup-s-tools-to-fig...
2•pir8life4me•4m ago•0 comments

Bloomberg's TypeScript CLI Library

https://bloomberg.github.io/stricli/
1•frutiger•4m ago•0 comments

A Powerful New 'QR Code' Untangles Math's Knottiest Knots

https://www.quantamagazine.org/a-powerful-new-qr-code-untangles-maths-knottiest-knots-20260422/
1•defrost•4m ago•0 comments

The Corporation

https://thecorporation.com/
1•metabagel•4m ago•0 comments

Chernobyl at 40: The accident, its impact and how it changed nuclear energy

https://www.world-nuclear-news.org/articles/chernobyl-at-40-the-accident-its-impact-and-how-it-ch...
2•philipkglass•5m ago•1 comments

Security Without Hierarchy

https://theanarchistlibrary.org/library/scrappy-capy-distro-security-without-hierarchy
1•eustoria•5m ago•0 comments

ByeDoom – Quickly get an RSS feed for your favorite reader

https://byedoom.com/
1•eustoria•7m ago•0 comments

Emergence Is Not Engineering

https://www.noemamag.com/emergence-is-not-engineering/
1•Brajeshwar•7m ago•0 comments

Show HN: Making video games every day with Claude (Day 9: Pong Paralyzer)

https://gamevibe.us/9-pong-paralyzer
2•pzxc•8m ago•0 comments

Show HN: A visual CI/CD system

https://www.actionforge.dev
4•sebastian_io•8m ago•0 comments

Exit Payout Scenarios

https://www.thesaasceo.com/p/your-exit-payout-scenarios
3•sanketbhasin•10m ago•0 comments

US turns to Ukrainian counter-drone tech after Iran attacks, sources say

https://www.reuters.com/business/aerospace-defense/us-turns-ukrainian-counter-drone-tech-after-ir...
1•mikhael•11m ago•0 comments

Show HN: AthleteData – AI coach for endurance athletes that messages you first

https://www.athletedata.health
5•fliellerjulian•11m ago•0 comments

USVC: A new fund by AngelList that broadens access to venture capital

https://usvc.com/
4•bpierre•12m ago•0 comments

RoboLab: Robot- and policy-agnostic simulation benchmarking

https://research.nvidia.com/labs/srl/projects/robolab/
1•dagli•13m ago•0 comments

Show HN: Google Docs MCP that works

https://github.com/dbuxton/google-docs-mcp
1•dbuxton•13m ago•0 comments

Show HN: Free Live Speech Translator

https://timleland.com/live-speech-translator/
2•TimLeland•13m ago•0 comments

SpaceX is working with Cursor and has an option to buy the startup for $60B

https://techcrunch.com/2026/04/21/spacex-is-working-with-cursor-and-has-an-option-to-buy-the-star...
1•hislaziness•13m ago•1 comments

How Health Workers Can Love Their Devices

https://za.virtualhospitalsafrica.org/blog/how-health-workers-can-love-their-devices
1•wweiss1230•17m ago•0 comments

Features everyone should steal from npmx

https://nesbitt.io/2026/04/16/features-everyone-should-steal-from-npmx.html
1•speckx•18m ago•0 comments

Building Ridgeline, part 1: I have too many dashboards

https://www.xydac.com/blog/building-ridgeline-part-1/
1•xydac•19m ago•0 comments

World Models will push the frontier for LLMs

https://lucrbvi.bearblog.dev/world-models-will-push-the-frontier/
2•lucrbvi•19m ago•0 comments

AI wants composition, not chat

https://linuxtoaster.com/blog/against-the-chat-box.html
2•dirk94018•20m ago•0 comments

Tolaria

https://tolaria.md/
1•handfuloflight•21m ago•0 comments

Luddites and AI Datacenters

https://www.seangoedecke.com/luddites-and-ai-datacenters/
1•Brajeshwar•21m ago•0 comments

Show HN: Map – Receipts and rollback for AI agents

https://github.com/DeadpxlStudio/ModelActionProtocol
1•Dahvay•21m ago•0 comments

White paper: Enphase universal bidirectional EV charger

https://enphase.com/download/iq-bidirectional-ev-charger-whitepaper
1•malchow•22m ago•0 comments

DCP-AI – Portable accountability layer for AI agents (post-quantum)

https://github.com/dcp-ai-protocol/dcp-ai
1•dnaranjo•23m ago•0 comments

'Finding Satoshi' Makes the Case for Hal Finney, Len Sassaman as BTC Co-Creators

https://decrypt.co/365075/finding-satoshi-makes-the-case-for-hal-finney-len-sassaman-as-bitcoin-c...
1•tromp•24m ago•1 comments