frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

CIDR 2026 Proceedings

https://vldb.org/cidrdb/2026/
1•remywang•2m ago•0 comments

The Lost Art of XML

https://marcosmagueta.com/blog/the-lost-art-of-xml/
1•Curiositry•6m ago•0 comments

Over 1k Arizona teachers resigning plays a part in shortage

https://azpbs.org/horizon/2025/11/teacher-shortage-2/
1•toomuchtodo•7m ago•0 comments

Asciinema: Making Movies at the Command-Line

https://lwn.net/Articles/1053355/
1•signa11•9m ago•0 comments

Google decides what you see in Images and where invisible keywords are born

https://comuniq.xyz/post?t=738
1•01-_-•12m ago•0 comments

Microsoft investigating outage affecting Microsoft 365

https://www.cbsnews.com/news/microsoft-365-outage-outlook/
1•01-_-•13m ago•0 comments

Remotely unlocking an encrypted hard disk with systemd initrd on Arch

https://jyn.dev/remotely-unlocking-an-encrypted-hard-disk/
1•signa11•13m ago•0 comments

Show HN: Glean – RSS reader with AI-powered smart sorting and MCP integration

https://github.com/LeslieLeung/glean
1•3verest•17m ago•0 comments

Intel puts consumer chip production on back burner

https://www.theregister.com/2026/01/23/intel_earnings_q4_2025/
1•bovem•18m ago•0 comments

I Overengineered a Spinning Top

https://www.youtube.com/watch?v=Wp5NodfvvF4
1•bane•19m ago•0 comments

Man, these New York Times games are hard A computational perspective

https://arxiv.org/abs/2509.10846
1•PaulHoule•20m ago•0 comments

ChatGPT Self Portrait

https://thezvi.substack.com/p/chatgpt-self-portrait
1•gmays•20m ago•0 comments

Introducing: Postgres Best Practices

https://supabase.com/blog/postgres-best-practices-for-ai-agents
1•samuba•22m ago•0 comments

TikTok USDS Joint Venture LLC Established Under U.S. Regulatory Requirements

https://newsroom.tiktok.com/announcement-from-the-new-tiktok-usds-joint-venture-llc?lang=en
1•rzerowan•22m ago•1 comments

Thomas Edison: The Unintentional Founder of Hollywood

https://www.saturdayeveningpost.com/2021/03/thomas-edison-the-unintentional-founder-of-hollywood/
1•ronsor•23m ago•0 comments

Underground Resistance Aims to Sabotage AI with Poisoned Data

https://www.forbes.com/sites/craigsmith/2026/01/21/poison-fountain-and-the-rise-of-an-underground...
3•atomic128•27m ago•2 comments

The Cscript Style Guide – CScript is the standard C

https://github.com/domenukk/CScript
1•domenukk•28m ago•1 comments

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and RL

https://developer.nvidia.com/blog/how-to-train-an-ai-agent-for-command-line-tasks-with-synthetic-...
1•gmays•28m ago•0 comments

Waze built the largest crowdsourced surveillance system

https://twitter.com/harrris0n/status/2014197314571952167
3•takoid•28m ago•1 comments

Show HN: Bookmarklet for removing AI posts from Hacker News

https://dan-lovelace.github.io/hn-blocklist/
2•dandrew5•31m ago•0 comments

Show HN: An ultra-light, multilingual unit converter that keeps growing

https://mrunit.net/
1•thenodeshift•32m ago•0 comments

Who Just Bought TikTok

https://www.nytimes.com/2026/01/22/business/media/tiktok-investors-oracle-mgx-silver-lake-bytedan...
1•donohoe•33m ago•0 comments

Show HN: MCPxel – Navigation and rating station for Agent Skills (LLM-judged)

https://mcpxel.com
1•maxnew•34m ago•1 comments

Post-Micturition Convulsion Syndrome

https://en.wikipedia.org/wiki/Post-micturition_convulsion_syndrome
1•thunderbong•41m ago•1 comments

Google shows small models analyze smartphone screens to predict what users want

https://research.google/blog/small-models-big-results-achieving-superior-intent-extraction-throug...
1•rexbee•44m ago•0 comments

The Uncomfortable Math of Working for Yourself

https://thomasunise.com/the-uncomfortable-math-of-working-for-yourself/
2•eeko_systems•47m ago•0 comments

A Massacre in Mashhad

https://www.newyorker.com/news/as-told-to/a-massacre-in-mashhad
4•petethomas•47m ago•0 comments

What Margaret Atwood Would Like You to Know

https://newrepublic.com/article/204118/margaret-atwood-like-know-book-lives-memoir-review
1•petethomas•48m ago•1 comments

Lilliputian Hallucinations

https://www.sciencedirect.com/science/article/pii/S0149763421001068
1•rammy1234•49m ago•0 comments

Show HN: gRPC Transport for HashiCorp/Raft

https://github.com/dhiaayachi/raft-grpc-transport
1•neo2006•52m ago•0 comments