frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: ImagineIf – Collaborative storytelling where AI visualizes each segment

https://imagineif.app
1•tugaypala•1m ago•0 comments

100M-Row Challenge with PHP

https://github.com/tempestphp/100-million-row-challenge
1•brentroose•2m ago•1 comments

Show HN: Gryt – self-hosted, open-source Discord-style voice chat

https://gryt.chat
2•simpvert•4m ago•0 comments

Show HN: Synlets – Assign Jira/Asana tickets to AI, get working PRs back

https://www.synlets.com
1•stas-user•4m ago•0 comments

GitHub website is down

1•thomasfl•5m ago•1 comments

A thought on quantum error correction: accuracy without replay feels fragile

1•enigmaticsaini•7m ago•0 comments

Shor's Algorithm

https://en.wikipedia.org/wiki/Shor%27s_algorithm
1•tosh•9m ago•0 comments

Danish Gov agency to ditch Microsoft software in push for digital independence

https://therecord.media/denmark-digital-agency-microsoft-digital-independence
2•robtherobber•10m ago•1 comments

Quod, a Quake-like game in 64 KB

https://daivuk.itch.io/quod
2•WithinReason•13m ago•0 comments

OpenAI finds global Chinese intimidation operation by official's use of ChatGPT

https://www.cnn.com/2026/02/25/politics/chatgpt-china-intimidation-operation
2•jb1991•13m ago•0 comments

What happened after Elon Musk took the Russian army offline

https://www.politico.com/news/2026/02/25/elon-musk-russian-army-starlink-00793742
2•Anon84•16m ago•0 comments

The Imagination Curriculum

https://zoescaman.substack.com/p/the-imagination-curriculum
1•MindGods•17m ago•0 comments

Hobby Equipment Organisation Using Ottoman Single Bed

https://dreamhomestore.co.uk/collections/ottoman-beds
1•jessica01decor•18m ago•1 comments

Securing Coturn: Configuration Guide with Copy-Paste Templates

https://www.enablesecurity.com/blog/coturn-security-configuration-guide/
1•obscure6•18m ago•0 comments

The watchers, pt. 2: the correspondence

https://vmfunc.re/blog/persona-2/
1•pamcake•19m ago•1 comments

Implementing WebNN with the Help of AI

https://medium.com/@polyglot_factotum/sketching-webnn-with-ai-part-two-of-the-slop-diaries-8df62d...
1•polyglotfacto•21m ago•1 comments

Is auto-logging UPI spends from payment receipts a useful feature?

https://icorpus.vercel.app/
1•mathan_karthik•21m ago•1 comments

Why your Vitest test suite is slow (and how to fix it)

https://medium.com/ekino-france/why-your-vitest-test-suite-is-slow-and-how-to-fix-it-068fbaf6d6eb
1•damnhotuser•22m ago•0 comments

Show HN: Frouter – Live-ping and auto-configure free AI models for coding agents

https://github.com/jyoung105/frouter
1•jyoung105•23m ago•0 comments

Coalton: Efficient, statically typed functional programming language

https://github.com/coalton-lang/coalton
1•tosh•23m ago•0 comments

Cybertank – Hand code or Bring your AI agent and destroy the enemy

https://cybertank.squidcode.com
1•pro_methe5•24m ago•0 comments

Show HN: A real-time strategy game that AI agents can play

https://llmskirmish.com/
17•__cayenne__•24m ago•1 comments

Debian Removes Free Pascal Compiler / Lazarus IDE

https://forum.lazarus.freepascal.org/index.php?topic=73405.0
4•mariuz•24m ago•0 comments

Show HN: The gap between tracking time and getting paid is frustratingly manual

1•jonobird1•27m ago•0 comments

Russia fines Google for distributing VPN services

https://www.reuters.com/world/russia-fines-google-distributing-vpn-services-tass-reports-2026-02-25/
1•giuliomagnifico•27m ago•0 comments

Show HN: Audio DSP One-Liners

https://beta.loopmaster.xyz/browse/one-liners
1•stagas•29m ago•0 comments

Show HN: AgentFolio – Reputation registry for autonomous AI agents

https://agentfolio.io
1•bobrenze•32m ago•0 comments

TypeScript Unit Dimensions Converter

https://github.com/mihailShumilov/unitsafe
1•mihailshumilov•33m ago•1 comments

CookLLM – Learn LLM internals by building one from scratch

https://cookllm.com
1•SiliconGen•34m ago•1 comments

Show HN: I built a sub-20ms crypto API in Go

https://psychosomat.github.io/LimpioRelease-hn-article/
1•arturstankevicz•37m ago•1 comments