frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Petition in Support of Wikipedia Workers Union

https://en.wikipedia.org/wiki/Wikipedia:Wiki_Workers_United_solidarity
2•ftgregg•1m ago•1 comments

Prompt to Silicon with LangGraph

https://coresmith.ai/
1•ppabench•4m ago•0 comments

Tezcatl: A 2MB alternative to Puppeteer for scraping on macOS

https://george.mand.is/2026/05/tezcatl-a-2mb-alternative-to-puppeteer-for-scraping-on-macos/
1•xrd•6m ago•0 comments

Tokens or Humans? The New AI Cost Trade-Off Reshaping Corporate Budgets [video]

https://www.youtube.com/watch?v=niBP4qhNSWw
1•mgh2•11m ago•0 comments

Britain has crushed immigration, and harmed itself

https://www.economist.com/britain/2026/05/28/britain-has-crushed-immigration-and-harmed-itself
2•Anon84•12m ago•0 comments

How to Think Like a World-Class Marketer – Rory Sutherland

https://fs.blog/knowledge-project-podcast/rory-sutherland-2/
1•walterbell•13m ago•0 comments

A Closer Look at the Damage to New Glenn's Launch Pad

https://www.youtube.com/watch?v=5DAS7i9VD2w
2•busymom0•29m ago•0 comments

US to Appeal Judge's Order for Broad Refund of Trump Tariffs

https://www.bloomberg.com/news/articles/2026-05-29/us-to-appeal-judge-s-order-for-broad-refund-of...
2•petethomas•30m ago•1 comments

Software Architecture After AI

https://brianguthrie.com/p/software-architecture-after-ai/
1•bguthrie•31m ago•0 comments

Russia is being beaten by robots in Ukraine [video]

https://www.youtube.com/watch?v=_9GkUkIaOno
2•breve•35m ago•0 comments

Ember.js 7.0

https://blog.emberjs.com/ember-released-7-0/
4•satvikpendem•37m ago•1 comments

Top Three Favorite Apple Apps (Mac/iPhone)?

2•CootieRaccoon•39m ago•0 comments

What Is a Dickover?

https://daringfireball.net/2026/05/what_is_a_dickover
15•tambourine_man•41m ago•0 comments

China Limits Overseas Travel for AI Talent at DeepSeek, Alibaba, Private Firms

https://www.bloomberg.com/news/articles/2026-05-26/china-expands-travel-curbs-to-top-ai-talent-at...
5•gmays•41m ago•0 comments

Daze in the Canopy: Birding Panama at My Own Pace

https://indianaaudubon.org/2026/05/26/daze-in-the-canopy-birding-panama-at-my-own-pace/
3•petethomas•44m ago•0 comments

What's Next in Computer Graphics? [video]

https://www.youtube.com/watch?v=sIcGH3M5yPc
2•colonCapitalDee•47m ago•0 comments

Project Feather – Faster local mode Spark

https://docs.google.com/document/u/0/d/1Nphejrf_vh4YRECn0JPgKClqxDS_lB6wufZFJQxyY98/edit
2•kermatt•48m ago•0 comments

Raise your hands for a fireworks show

https://aitinkerers.org/fireworks
2•jheitzeb•48m ago•1 comments

Markspresso – Brew Static Sites from Markdown

https://github.com/cybersonic/markspresso
2•rmason•50m ago•0 comments

'He's full of s–t': JPM's Dimon rips Coinbase CEO, escalates crypto bill fight

https://www.politico.com/news/2026/05/29/dimon-jpmorgan-crypto-banks-coinbase-armstrong-00942998
5•petethomas•55m ago•2 comments

Hacker News MCP Server

https://github.com/devrelopers/hackernews-mcp
4•DavidCanHelp•1h ago•2 comments

Spitting Out the Agentic Kool-Aid

https://openpath.quest/2026/spitting-out-the-agentic-kool-aid/
4•Curiositry•1h ago•0 comments

Bias Compounds, Variance Washes Out

https://convergentthinking.sh/posts/bias-compounds-variance-washes-out/
3•jxmorris12•1h ago•0 comments

I built an agent-run 1:1 email newsletter for competitive intelligence (free)

https://rivalnewsletter.com/
2•seandotexe•1h ago•1 comments

Brain Deserves a Better Controller with PiEEG XR

https://medium.com/@ildarr2016/your-brain-deserves-a-better-controller-with-pieeg-xr-765ee944fba2
4•Christiangmer•1h ago•0 comments

I built an interactive archive for official UFO/UAP files from war.gov

https://ufofiles.info
2•hades_sixy•1h ago•0 comments

The Genius: Mike Burrows' self-effacing journey through Silicon Valley (2007)

https://web.archive.org/web/20080217003150/http://www.stanford.edu/group/gpj/cgi-bin/drupal/?q=no...
2•1vuio0pswjnm7•1h ago•1 comments

Der Spiegel has made Nazi party membership cards searchable

https://www.spiegel.de/international/nazi-card-index-digitized-the-lies-come-to-an-end-a-22bfa301...
13•cwwc•1h ago•0 comments

Free website audit tool – enter a URL, get actionable fixes

https://outboundautonomy.com/audit
2•webperfdev•1h ago•1 comments

The early days – Steve Wozniak – TEDxBerkeley [video]

https://www.youtube.com/watch?v=PwSyjz1off4
2•MilnerRoute•1h ago•0 comments