frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Gencup 2026. World Cup Posters Using Data of Each Game

https://www.gencup.art/
1•aliparlakci•1m ago•0 comments

New benchmark evaluates AI for everyday patient care

https://www.massgeneralbrigham.org/en/about/newsroom/press-releases/evaluating-ai-performance-for...
1•hhs•1m ago•0 comments

Multi doc agent workflows in Word

https://lexifina.com/blog/multi-doc-agent-workflows-in-word
1•alansaber•6m ago•0 comments

Correlated LLM Name Priors and Their Haunting of the Web and Academic Publishing

https://arxiv.org/abs/2606.02184
1•sebg•8m ago•0 comments

TOPF – Talos Orchestrator by PostFinance

https://postfinance.github.io/topf/main/
1•denysvitali•12m ago•0 comments

Writing with AI demands more thought from students, not less

https://www.news.iastate.edu/news/writing-ai-demands-more-thought-students-not-less
1•hhs•12m ago•1 comments

Usbliter8 – An A12/A13 SecureROM Exploit

https://ps.tc/pages/blog-usbliter8.html
1•denysvitali•14m ago•0 comments

A $5.6B valuation and rapid global expansion

https://fortune.com/2026/06/18/ai-european-scaler-legora-bird-bird-law/
1•montrealish•16m ago•0 comments

Don't Be Afraid to Comment

https://sylvia.studio/dont-be-afraid-to-comment
1•lylo•16m ago•0 comments

VoxBee – Audio Platform for Productivity

https://www.voxbee.app/
1•doozyxyz•16m ago•0 comments

Vibe Working

https://www.trie.dev/
1•Kriptering•17m ago•1 comments

How Smashing The NIMBYs Created Modern Capitalism

https://worksinprogress.co/issue/how-abolishing-the-stakeholder-state-caused-the-industrial-revol...
1•karakoram•17m ago•0 comments

Identity, memory, secrets that survive model switch

https://signetai.sh/
1•627467•19m ago•0 comments

Data Story: The strait went dark

https://demos.minusx.app/l/strait-of-hormuz-the-day-the-oil-lanes-went-dark-6kjt5lcg1gp5d3l49mx
1•nuwandavek•20m ago•0 comments

A Federal Regulator Wants to Fast-Track AI Data Centers onto the Power Grid

https://gizmodo.com/a-federal-regulator-wants-to-fast-track-ai-data-centers-onto-the-power-grid-2...
1•derbOac•21m ago•0 comments

Ask HN: How are you managing MCP servers across a team?

1•vitorbaptistaa•21m ago•1 comments

Show HN: PinLeads

https://pinleads.org
1•jrh89•22m ago•0 comments

Too Liked to Be Useful

https://yusufaytas.com/too-liked-to-be-useful
11•montrealish•31m ago•1 comments

Show HN: I built a multiplayer dominoes game

https://www.pipsgg.com
1•minicliprocks•33m ago•1 comments

Constrained Modeling for Coding Agents

https://github.com/dannylee1020/kkt
1•dannylee1020•33m ago•0 comments

Palmier – a free video editor built for AI

https://www.palmier.io/
1•artur_makly•36m ago•0 comments

A Beginner's Guide to Robotics Hardware

https://interlatent.com/blog/interlatent-robotics-hardware-guide
2•sebg•42m ago•0 comments

The Oura $400 Ring Is Designed to Die [video]

https://www.youtube.com/watch?v=zqCp-z9WAkA
1•SockThief•42m ago•0 comments

Why Weibo's tiny VibeThinker-3B has the AI world arguing over benchmarks again

https://venturebeat.com/technology/why-weibos-tiny-vibethinker-3b-has-the-ai-world-arguing-over-b...
5•gmays•44m ago•0 comments

Reinforcement learning towards broadly and persistently beneficial models

https://alignment.openai.com/beneficial-rl/
1•jawiggins•47m ago•0 comments

Hackers Found a Back Door into the American Living Room

https://www.wsj.com/tech/cybersecurity/how-hackers-found-a-back-door-into-the-american-living-roo...
1•rawgabbit•49m ago•0 comments

Connecting Peripherals to Atari 8-bit Computers

https://www.goto10retro.com/p/connecting-peripherals-to-atari-8
2•rbanffy•49m ago•0 comments

All Tomorrow's Parties

https://ethanmarcotte.com/wrote/all-tomorrows-parties/
1•wassimans•50m ago•0 comments

CEOs: How to Not Screw Up Your AI Memo

https://www.callercallsback.com/p/ceos-heres-how-to-not-screw-up-your
1•ohjeez•53m ago•0 comments

Transgenic hookworm secretes anti-tetrodotoxin human single chain antibody

https://www.nature.com/articles/s41467-026-73447-9
2•phront•56m ago•0 comments