frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

NYC considers increasing minimum wage to $30 by 2030

https://ny1.com/nyc/all-boroughs/mornings-on-1/2026/03/10/city-council-considers-increasing-minim...
2•geox•2m ago•0 comments

Show HN: Readhn – AI-Native Hacker News MCP Server (Discover, Trust, Understand)

https://github.com/xodn348/readhn
1•xodn348•3m ago•0 comments

Curly Prompt – An AI Prompting Language with Local LLM Powered AI Agent

https://github.com/jimthunderbird/curlyprompt
1•jimthunderbird•6m ago•0 comments

Saudi Arabia declares 2026 as the Year of AI

https://twitter.com/spagov/status/2031365339376660682
1•sibilsalim•8m ago•1 comments

The Creator of Wordle Tries to Solve the Cryptic Crossword

https://www.newyorker.com/culture/the-lede/the-creator-of-wordle-tries-to-solve-the-cryptic-cross...
1•duck•10m ago•0 comments

Microsoft Patch Tuesday, March 2026 Edition

https://krebsonsecurity.com/2026/03/microsoft-patch-tuesday-march-2026-edition/
1•todsacerdoti•11m ago•0 comments

Embrace the Uncertainty

https://brittanyellich.com/embrace-the-uncertainty/
2•todsacerdoti•12m ago•0 comments

AEO: What happens when AI answers instead of linking (4-part series)

https://xergioalex.com/blog/series/aeo-from-invisible-to-cited/
1•xergioalex•13m ago•0 comments

Datafly – data agent that automatically understands any database you connect

https://github.com/dkeviv/datafly
1•aichelles•17m ago•1 comments

Apple's New MacBooks Have a Keyboard Change You Might Have Missed

https://www.macrumors.com/2026/03/10/macbook-keyboard-change/
2•samsolomon•18m ago•0 comments

Show HN: Principled Agentic Software Development

https://www.joegaebel.com/articles/principled-agentic-software-development/
2•joegaebel•20m ago•0 comments

Lynas locks Japanese rare earths deal

https://www.australianmining.com.au/lynas-locks-japanese-rare-earths-deal/
1•dwd•26m ago•2 comments

Ask HN: Getting Someone at Facebook to Listen

1•ccleve•29m ago•2 comments

Ohio judge rules that Kalshi is sports betting and must adhere to state law

https://www.nbcnews.com/news/us-news/ohio-judge-rules-kalshi-sports-betting-must-adhere-state-law...
2•petethomas•31m ago•1 comments

The Beginning of History

https://www.wheresyoured.at/the-beginning-of-history/
1•elorant•35m ago•1 comments

Game Modding with GenAI: A Case Study of Stardew Valley Character Maker

https://arxiv.org/abs/2507.13951
2•azhenley•43m ago•0 comments

Cybertruck Tried to Drive 'Straight Off an Overpass' Attorney Claims

https://www.404media.co/cybertruck-tried-to-drive-straight-off-an-overpass-attorney-claims/
6•malshe•44m ago•1 comments

The History of Stoner.com

https://ron.stoner.com/The_History_Of_Stoner_._com/
1•tinkelenberg•46m ago•0 comments

Wero announces the launch of its ecommerce solution in

https://epicompany.eu/media-insights/wero-announces-launch-ecommerce-in-belgium
1•absqueued•46m ago•0 comments

Building Kepler

https://www.astronomer.io/blog/building-kepler-astronomer-internal-data-assistant/
1•tayloramurphy•48m ago•0 comments

A 1,300-pound NASA spacecraft to re-enter Earth's atmosphere

https://www.bbc.com/news/articles/cd9gwdgg38vo
1•reconnecting•50m ago•0 comments

At what level of deep context engineering does AI output become human-crafted?

1•svstoyanovv•52m ago•0 comments

State of AI 2026: The $600B inference subsidy, energy bottlenecks, and labor

https://lostframe.ai/research
1•willtaubenheim•54m ago•1 comments

Tell HN: Vertical tabs has arrived (behind a flag) in Chrome stable

3•crummy•55m ago•0 comments

Ask HN: Is Starlink still being jammed in Iran?

1•Jblx2•55m ago•1 comments

RoqueOS – an OS to control your homelab (now on the Apple App Store)

https://roqueos.com.br/
1•roqueribeiro•56m ago•1 comments

SSH Is the Agent Internet

https://rolandsharp.com/ssh-is-the-agent-internet/
1•epscylonb•1h ago•0 comments

Show HN: Mumpix – Local-first AI infrastructure and $1B developer grant

https://mumpixdb.com/mumpix-billion-program.html#claim
1•carreraellla•1h ago•2 comments

MPs give ministers powers to restrict Internet

https://www.openrightsgroup.org/press-releases/mps-give-ministers-powers-to-restrict-entire-inter...
2•Jigsy•1h ago•0 comments

Amazon Cognito and FusionAuth Comparison

https://fusionauth.io/blog/amazon-cognito-and-fusionauth-comparison
1•mooreds•1h ago•0 comments