frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Tiny Terminal

https://meimakes.com/tiny-terminal/
1•the-mitr•2m ago•0 comments

Production query plans without production data

https://boringsql.com/posts/portable-stats/
1•todsacerdoti•3m ago•0 comments

Russia targets Signal and WhatsApp accounts in cyber campaign

https://english.aivd.nl/latest/news/2026/03/09/russia-targets-signal-and-whatsapp-accounts-in-cyb...
1•HelloUsername•4m ago•0 comments

Mahjong Mentor

https://mj-mentor.lovable.app/
1•dpzl•6m ago•1 comments

Fear Is Destroying Your Org

https://yanivpreiss.com/2026/03/08/fear-is-destroying-your-org/
2•PretzelFisch•6m ago•0 comments

Show HN: Yawn – Yet Another Worktree Navigator (CLI, Pipes into Fzf)

https://github.com/ComeBertrand/yawn
1•ComeBertrand•9m ago•0 comments

Show HN: Unpinched – open-source PinchTab and CDP bridge detector

https://github.com/Helixar-AI/Unpinched
1•Siri_D•9m ago•0 comments

API Traffic Analyzer for Kubernetes

https://kubeshark.com/
1•l1am0•12m ago•0 comments

Hermes Agent

https://github.com/NousResearch/hermes-agent
1•tosh•13m ago•0 comments

Show HN: commitgen-cc – Generate Conventional Commit message locally with Ollama

https://github.com/Eaglemann/commitgen-cc
1•eagleman•13m ago•1 comments

Ask HN: What is your oldest living presence on the World Wide Web?

2•dhruv3006•16m ago•0 comments

The Economy of Loneliness

https://app.dateseriously.com/the-economy-of-loneliness.html
1•skanderbm•16m ago•0 comments

Show HN: Think Better – Inject Decision Frameworks into Claude and Copilot

https://github.com/HoangTheQuyen/think-better
1•hoangthequyen01•20m ago•0 comments

35 days in. 28 training cycles across 3 base models

https://forgeintelligence.substack.com/p/forge-intelligence-edition-5
1•beakmull•20m ago•0 comments

Returning to Rails in 2026

https://www.markround.com/blog/2026/03/05/returning-to-rails-in-2026/
4•todsacerdoti•21m ago•2 comments

Russian hackers breach Signal and WhatsApp accounts officials, Netherlands warns

https://www.reuters.com/world/europe/russia-backed-hackers-breach-signal-whatsapp-accounts-offici...
2•repelsteeltje•21m ago•0 comments

Gyro-Claw – Secure execution runtime for AI agents

4•gyroscape•22m ago•0 comments

Building GREMLIN's Lair

https://peebs.org/building-gremlins-lair/
2•nemesisj•22m ago•0 comments

Every language should have a UUID type

https://nafees.bearblog.dev/every-language-should-have-a-uuid-type/
2•mnafees•28m ago•0 comments

Building a Stripe dashboard with an ESP32 desktop clock and Rust

https://duggan.ie/posts/building-a-stripe-dashboard-with-an-esp32-desktop-clock-and-rust
1•duggan•28m ago•1 comments

Show HN: VS Code Agent Kanban: Task Management for the AI-Assisted Developer

https://www.appsoftware.com/blog/introducing-vs-code-agent-kanban-task-management-for-the-ai-assi...
2•gbro3n•29m ago•0 comments

Show HN: SubstanceWiki – Open-source encyclopedia of psychoactive substances

https://substancewiki.org
2•toprak123•34m ago•0 comments

What if you never had to get an API key ever again?

https://stevekrouse.com/x402
2•Tiberium•34m ago•0 comments

A willingness to look stupid is the most underrated moat in doing creative work

https://sharif.io/looking-stupid
2•Samin100•35m ago•0 comments

Why use F# for scripting and automation?

https://iev.ee/blog/why-use-fsharp/
2•nsm•38m ago•2 comments

Custom Agents in Visual Studio

https://devblogs.microsoft.com/visualstudio/custom-agents-in-visual-studio-built-in-and-build-you...
1•ankitg12•40m ago•1 comments

Advice for Operating a Public-Facing API (2023)

https://jcs.org/2023/07/12/api
2•wrxd•40m ago•0 comments

Show HN: TemplUI v1.7.0 – UI components for Go and templ, now with import mode

https://github.com/templui/templui
1•axadrn•40m ago•0 comments

Gemini Exporter – Save Gemini to PDF, Word, Google Docs and Notion

https://chromewebstore.google.com/detail/gemini-exporter-save-gemi/lgipeakgdkcgnkdljeagconfbfeolidj
3•backrun•41m ago•2 comments

Ireland shuts last coal plant, becomes 15th coal-free country in Europe

https://www.pv-magazine.com/2025/06/20/ireland-coal-free-ends-coal-power-generation-moneypoint/
4•robin_reala•42m ago•0 comments