frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

OPenn

https://openn.library.upenn.edu/
1•gone35•18s ago•0 comments

Show HN: Dialed – A Radial Calendar App for iOS

https://apps.apple.com/us/app/dialed-radial-day-planner/id6755455859
1•sirkaiwade•1m ago•0 comments

Venture Capital Investments by U.S. Academic Medical Centers

https://www.nejm.org/doi/full/10.1056/NEJMp2508860
1•0xWTF•4m ago•0 comments

Why sci-fi novelist Iain M. Banks was an 'astounding' world-builder

https://www.newscientist.com/article/2506129-why-sci-fi-novelist-iain-m-banks-was-an-astounding-w...
2•petethomas•4m ago•0 comments

How wealth dies

https://surplusenergyeconomics.wordpress.com/2025/11/02/314-how-wealth-dies/
2•martinlaz•5m ago•0 comments

Neural CA for Self-Assembly – Decentralized, Self-Repairing (0.003 MSE)

https://github.com/windstorm12/neural-CA-Automata
1•windstorm12•6m ago•1 comments

Public GitLab repositories exposed more than 17,000 secrets

https://www.bleepingcomputer.com/news/security/public-gitlab-repositories-exposed-more-than-17-00...
1•dangtony98•9m ago•1 comments

Study Reveals How Tattoo Ink Affects the Immune System

https://www.swissinfo.ch/eng/various/tattoos-influence-the-defence-system-of-mice/90522006
4•croh•10m ago•0 comments

DuckDB 1.4.2 LTS

https://duckdb.org/2025/11/12/announcing-duckdb-142
2•kermatt•11m ago•0 comments

U.S. may eliminate income tax by end of current admin

https://www.axios.com/2025/11/28/trump-tariffs-income-taxes
4•hereme888•12m ago•2 comments

France Creates Voluntary Military Service as Europe Faces Russian Threat

https://www.nytimes.com/2025/11/27/world/europe/france-military-service.html
4•bookofjoe•16m ago•1 comments

Show HN: Cooltechblogs POC: local-first blog post aggregator

https://www.cooltechblogs.com/
1•phillvdm•19m ago•0 comments

The Battle over Africa's Great Untapped Resource: IP Addresses

https://www.wsj.com/business/telecom/africa-ip-addresses-china-3e543b9d
3•ajuhasz•19m ago•0 comments

Imgur Geo-Blocked the UK, So I Geo-Unblocked My Network

https://blog.tymscar.com/posts/imgurukproxy/
21•tymscar•20m ago•2 comments

Staff at Irish Meta client firm told 400 jobs at risk

https://www.thejournal.ie/meta-job-losses-6887604-Nov2025/
7•lawlessone•21m ago•0 comments

The Hidden Language of Our Body's Rhythms

https://thereader.mitpress.mit.edu/the-hidden-language-of-our-bodys-rhythms/
3•the-mitr•29m ago•0 comments

Quanta Convert and Quantize AI Models

https://github.com/Mainframework/Quanta
1•trilogic•33m ago•1 comments

Poll HN: What operating system do you primarily develop on?

49•dennis-tra•33m ago•68 comments

28M Hacker News comments as vector embedding search dataset

https://clickhouse.com/docs/getting-started/example-datasets/hackernews-vector-search-dataset
33•walterbell•34m ago•6 comments

Mystery of the Quincunx's Missing Quincunx

https://blog.plover.com/history/quincunx.html
2•masfuerte•35m ago•0 comments

Understand your funnel with Micro Conversions

https://cleancommit.io/blog/micro-conversions/
1•mrkaluzny•36m ago•0 comments

Show HN: Encryptalotta – Free client-side PGP encryption tool for files

https://encryptalotta.com/
2•hireclay•37m ago•0 comments

Intel Secures Apple as Foundry Customer for Future M-Series Chips

https://winbuzzer.com/2025/11/28/report-intel-secures-apple-as-foundry-customer-for-future-m-seri...
5•walterbell•40m ago•1 comments

"Vibe coder" on X claims SaaS makes $60k while app is trivially insecure

https://twitter.com/jp_kiser/status/1994450874207236206
6•johnpaulkiser•41m ago•3 comments

Your Loneliness Was a Design Decision Made by Your Enemy

https://margaretkilljoy.substack.com/p/you-loneliness-was-a-design-decision
7•sandboxdev•41m ago•0 comments

Rock Paper Scissors Solitaire

https://klezlab.it/rock-paper-scissors-solitaire.html
5•klez•42m ago•1 comments

Ask HN: Which cloud provider do you like best and why?

7•trio8453•42m ago•3 comments

Tips for effective prototyping with Rails 8 and Claude Code

https://www.wyeworks.com/blog/2025/11/26/tips-for-effective-prototyping-rails-claude-code/
4•wyeworks•42m ago•0 comments

Show HN: Made a thing to use AI with intervals.icu

https://intervals.pro
3•maxrev17•43m ago•0 comments

This tool might beat NotebookLM at its own game

https://www.xda-developers.com/tool-might-beat-notebooklm/
1•vidyesh•47m ago•0 comments