frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Personal Side Project: Open-Sourcing My VPS Security Toolkit

https://github.com/jaymunshi/vps-sentinel
1•jaymunshi•2m ago•1 comments

Memory in Coding Agents

https://nicoritschel.com/writing/memex/
1•nicoritschel•3m ago•0 comments

Show HN: Instagram auto-poster skill for AI agents (bypasses bot detection)

https://github.com/virixlabs/instagram-poster
1•virixlabs•3m ago•0 comments

Show HN: schematra-app skill (bootstrap your scheme web app using agents)

1•funkaster•7m ago•0 comments

Sidemantic: Universal Metrics Layer

https://github.com/sidequery/sidemantic
1•nicoritschel•9m ago•0 comments

Red Hat takes on Docker Desktop with its enterprise Podman Desktop build

https://thenewstack.io/red-hat-enters-the-cloud-native-developer-desktop-market/
2•CrankyBear•11m ago•0 comments

Did a prize-winning novelist steal a woman's life story?

https://www.theguardian.com/books/2026/feb/17/did-a-prize-winning-novelist-steal-a-woman-life-sto...
1•randycupertino•13m ago•0 comments

Ask HN: Is the original iPhone SE just a brick now?

1•stared•15m ago•1 comments

Novel bond coat material enables thermal barrier coatings to operate at 1,200°C

https://techxplore.com/news/2026-02-bond-coat-material-enables-thermal.html
2•PaulHoule•17m ago•0 comments

Spain has blocked access to freedom.gov

https://twitter.com/Pirat_Nation/status/2025643188321714642
3•akyuu•20m ago•0 comments

Bending Time: Retracing Timezones Off Lines

https://reconnaissance.robincoenen.de/bending-time/
1•leonat•20m ago•0 comments

Intermittent errors in skills-related functionality

https://status.claude.com/incidents/5pr1d63fdjml
1•taoh•20m ago•0 comments

Distribution Is the New Engineering

https://sagivo.com/blog/distribution-is-the-new-engineering
1•sagivo•22m ago•0 comments

Training AI Without the Data You Don't Have

https://docs.eventsourcingdb.io/blog/2026/02/23/training-ai-without-the-data-you-dont-have/
1•goloroden•23m ago•0 comments

Show HN: Skill Kit – Local-first analytics for AI agent skills

https://github.com/crafter-station/skill-kit
1•Hunter17•26m ago•1 comments

Pentagi: Autonomous AI Agents for complex penetration testing tasks

https://github.com/vxcontrol/pentagi
1•nateb2022•26m ago•0 comments

Dear researchers: Is AI all you've got?

https://austinhenley.com/blog/dearresearchers.html
2•nomemory•26m ago•0 comments

Ask HN: Share your workflow with AI developer tools

1•fsto•27m ago•0 comments

New algorithm is designed to obey the laws of physics

https://actu.epfl.ch/news/new-ai-algorithm-is-designed-to-obey-the-laws-of-p/
2•geox•28m ago•0 comments

Japanese Death Poems

https://www.secretorum.life/p/japanese-death-poems-part-3
1•NaOH•29m ago•0 comments

Minnesota court justice quietly negotiated deal over ICE enforcement in courts

https://www.startribune.com/white-house-minnesota-supreme-court-chief-justice-quietly-negotiated-...
2•hn_acker•32m ago•1 comments

Bending the CLOS Mop for Java-Style Single Dispatch

https://atgreen.github.io/repl-yell/posts/clos-mop-dispatch/
1•atgreen•33m ago•1 comments

Play CSS-defined animations with JavaScript – KeyframeKit

https://keyframekit.berryscript.com/
1•barhatsor•35m ago•0 comments

The Mythology of Conscious AI

https://www.noemamag.com/the-mythology-of-conscious-ai/
1•MindGods•42m ago•0 comments

The Tears of Donald Knuth

https://cacm.acm.org/opinion/the-tears-of-donald-knuth/
2•todsacerdoti•43m ago•0 comments

ChatGPT Sees the World

https://twitter.com/elonmusk/status/2025265181266153606
1•anonymousiam•43m ago•1 comments

Show HN: Aeterna – Self-hosted dead man's switch

https://github.com/alpyxn/aeterna
2•alpyxn•44m ago•0 comments

'Peanut butter' pay raises could cost companies their top performers

https://www.cnbc.com/2026/02/22/peanut-butter-pay-raises-could-cost-companies-their-top-performer...
7•cebert•44m ago•2 comments

Show HN: GitHub Issues in the Terminal

https://github.com/JayanAXHF/gitv
2•frxgfa•45m ago•0 comments

Robots, Grannies and Meaning-Adjusted Work Days

https://twitter.com/notevenwrongg/status/2025656572458746156
2•georgestrakhov•48m ago•0 comments