frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

One Startup Is Gambling. Ten Is Mathematics

https://www.mynameisfeng.com/blog/one-startup-is-gambling-ten-is-mathematics
1•edward8628•2m ago•0 comments

Emergent Quantization from a Dynamic Vacuum

https://journals.aps.org/prresearch/abstract/10.1103/l8y7-r3rm
1•davedx•5m ago•0 comments

What Is BusyBox?

https://specular.fi/post/what-is-busybox
1•maxloh•5m ago•0 comments

Social Media Is Now Parasocial Media

https://journals.sagepub.com/doi/10.1177/20563051261437487
1•john-doe•6m ago•0 comments

Too many people have peed in the pool (2016)

https://www.stephenfry.com/2016/02/peedinthepool/
1•downbad_•6m ago•0 comments

Redis and the Cost of Ambition

https://charlesleifer.com/blog/redis-and-the-cost-of-ambition/
1•maxloh•6m ago•0 comments

Notes on the Physics of Startups by Rob Snyder [video]

https://www.youtube.com/watch?v=6ZJOruNRdpo
1•marttilaine•7m ago•0 comments

Free AI hiring toolkit for startup founders

https://hirelikeapro.app/
1•MihaiVR•10m ago•0 comments

Math reveals the one game of chance you should always accept

https://www.scientificamerican.com/article/math-reveals-the-one-game-of-chance-you-should-always-...
1•beardyw•10m ago•1 comments

The Stream Virtual Machine [pdf]

https://web.archive.org/web/20140921203922/http://metagraph.org/papers/stream_virtual_machine.pdf
1•tosh•12m ago•0 comments

Largest survey of physicists puts Standard Model of cosmology under scrutiny

https://phys.org/news/2026-05-largest-survey-physicists-standard-cosmology.html
1•mtdewcmu•13m ago•0 comments

Utah's 'hyperscale' data center could create heat island near Great Salt Lake

https://www.sltrib.com/news/environment/2026/05/07/utahs-data-center-could-create/
1•xyzal•14m ago•0 comments

WireGuard: Fast, modern, secure VPN tunnel

https://www.wireguard.com/
1•janandonly•18m ago•0 comments

Lumo Chat Export

1•carlostkd•20m ago•0 comments

England Runestones

https://en.wikipedia.org/wiki/England_runestones
1•cl3misch•20m ago•0 comments

Resolving Neighborhood Info with HTTP Range Requests

https://github.com/kevmo314/browser-district
1•kevmo314•28m ago•0 comments

Execs admit AI makes them value human workers less

https://www.theregister.com/ai-ml/2026/05/13/execs-admit-ai-makes-them-value-human-workers-less/5...
3•beardyw•29m ago•0 comments

The AI Tribunal of Truth

https://objection.ai/
1•pretext•32m ago•0 comments

The Unethical Guide to Surviving AI Layoffs [video]

https://www.tiktok.com/@atmoio/video/7638649825382190350
1•theletterf•38m ago•0 comments

Why I Left the Network

https://projects.propublica.org/why-i-left-the-network/
1•mynameisash•40m ago•0 comments

2001: A Space Odyssey

https://typesetinthefuture.com/2014/01/31/2001-a-space-odyssey/
1•andsoitis•43m ago•0 comments

C++26: Standard Library Hardening

https://www.sandordargo.com/blog/2026/05/13/cpp26-library-hardening
2•ingve•43m ago•2 comments

Zerobrew

https://github.com/lucasgelfond/zerobrew
1•zeristor•44m ago•0 comments

"will I be okay?"

https://arstechnica.com/tech-policy/2026/05/will-i-be-ok-teen-died-after-chatgpt-pushed-deadly-mi...
3•yawpitch•50m ago•2 comments

Keep Claude working toward a goal

https://code.claude.com/docs/en/goal
2•pretext•51m ago•0 comments

pg_DuckDB: DuckDB-powered Postgres for high performance apps and analytics

https://github.com/duckdb/pg_duckdb
1•tosh•53m ago•0 comments

Don't Hold My Data Hostage – A Case for Client Protocol Redesign (2017)

https://duckdb.org/library/dont-hold-my-data-hostage/
1•tosh•58m ago•0 comments

gpustats: GPU Library for Statistical Computing in Python (2011) [pdf]

https://proceedings.scipy.org/articles/Majora-ebaa42b7-003.pdf
1•tosh•1h ago•0 comments

The Aesthetic Problem of Namespacing

https://www.gingerbill.org/article/2026/05/13/aesthetic-namespacing/
2•thdr•1h ago•0 comments

AI for Practical Longevity

https://github.com/forever-healthy/AI4L
1•negura•1h ago•1 comments