frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Colabro: AI Employee for Your Computer

https://colabroai.com
1•Jeba_jebarsan•3m ago•0 comments

Chewing gum releases microplastics into your saliva, UCLA research shows

https://newsroom.ucla.edu/releases/bursting-your-bubble-chewing-gum-releases-microplastics-into-y...
1•littlexsparkee•10m ago•0 comments

How do I make painful lyrics feel less generic?

https://suno.com/@zeroxdesignartzero
1•zeroxdesignart•12m ago•0 comments

Task Paralysis and AI

https://g5t.de/articles/20260510-task-paralysis-and-ai/index.html
1•MrGilbert•18m ago•0 comments

Local Models Are Not Frontier. They Are Enough

https://quodeq.ai/blog/local-models-not-frontier/
1•VictorPurMar•20m ago•0 comments

Show HN: I built a 500K LOC production app alone in 7 months. Here is the proof

2•bonjourjoel•23m ago•0 comments

Private Credit Isn't a Major Threat–Probably

https://www.wsj.com/finance/investing/private-credit-financial-system-6039b39e
1•petethomas•24m ago•0 comments

Philosophy of the Left-Hand Path

https://philosophy-of-the-left-hand-path.denys-spirin.workers.dev/
1•jruohonen•38m ago•0 comments

Miniature Armoured Train Fought Hitler's Luftwaffe [video]

https://www.youtube.com/watch?v=Td3oD3cCXZ4
1•burnt-resistor•42m ago•0 comments

Show HN: TokReach – US TikTok as a Service

https://www.tokreach.com
2•gregolo•43m ago•0 comments

tsz: TypeScript checker and LSP written in Rust, designed to outperform tsgo

https://github.com/mohsen1/tsz
1•maxloh•47m ago•0 comments

Programming as Theory Building-Peter Naur[pdf]

https://pages.cs.wisc.edu/~remzi/Naur.pdf
1•nalinidash•50m ago•0 comments

Epupp – Browser Extension to Tamper with Web Pages, Live and with Userscriptss

https://github.com/PEZ/epupp
3•TheWiggles•53m ago•0 comments

History and Science of the Hanta Virus

https://distressedscientists.substack.com/p/hantan-hondius
2•helsinkiandrew•54m ago•0 comments

Fusion's cost floor: what if the core were free?

https://1cfe.substack.com/p/fusions-cost-floor-what-if-the-core
2•helsinkiandrew•56m ago•0 comments

Multiple universities forced to reschedule final exams after Canvas incident

https://therecord.media/universities-forced-to-reschedule-exams-canvas-incident
1•jruohonen•59m ago•0 comments

Plants can 'hear' rain coming, spurring them into action

https://www.scientificamerican.com/article/plants-can-hear-rain-coming-spurring-them-into-action/
1•the-mitr•1h ago•0 comments

Tracing tokens through Llama 3.1 8B inference on H100s

https://krithik.xyz/what-is-inference-actually
2•krithik_7•1h ago•0 comments

Show HN: I audited my own back ends on 5 BaaS – leak in every one

https://github.com/Perufitlife/supabase-security-skill
2•renzom13•1h ago•1 comments

Notes on using GNU Emacs' Tramp system in an unusual shell environment

https://utcc.utoronto.ca/~cks/space/blog/programming/EmacsTrampNotes
1•susam•1h ago•0 comments

Best AI coding plan alternative to Claude and ChatGPT

4•Jsttan•1h ago•3 comments

Debian must ship reproducible packages

https://lists.debian.org/debian-devel-announce/2026/05/msg00001.html
26•robalni•1h ago•4 comments

Agent Harness Engineering

https://twitter.com/addyosmani/status/2053231239721885918
3•pretext•1h ago•0 comments

Late-interaction rerank made our F1 worse, not better – a negative result

https://sverklo.com/blog/late-interaction-rerank-made-our-f1-worse/
1•nike-17•1h ago•0 comments

A Field Study of Institutional Control in an AI-Staffed Prediction-Market Desk

https://github.com/wes-zheng/ai_institutions/blob/main/technical_report/paper.md
3•bbcf•1h ago•0 comments

When life gives you lemons, write better error messages

https://wix-ux.com/when-life-gives-you-lemons-write-better-error-messages-46c5223e1a2f
4•dnw•1h ago•1 comments

Zeta2.1: 3x Fewer Tokens, 50ms Faster

https://zed.dev/blog/zeta2-1
2•ms7892•1h ago•0 comments

Scouting's Real Crisis Is Not Marketing. It Is Decades of Neglect.

https://www.untendedfire.org/2026/05/09/scoutings-real-crisis-is-not-marketing-it-is-decades-of-n...
2•AuthorizedCust•1h ago•0 comments

Giant Virginia Data Center Project Upended by Clerical Error

https://www.bloomberg.com/news/articles/2026-05-08/giant-data-center-project-in-virginia-upended-...
1•1vuio0pswjnm7•1h ago•0 comments

NYC School District Hit by Malware Attack as Well as Canvas Hack

https://www.bloomberg.com/news/articles/2026-05-08/canvas-hack-on-nyc-schools-comes-amid-separate...
2•1vuio0pswjnm7•1h ago•0 comments