frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Beyond MitM: The Rising Danger of Adversary-in-the-Middle Attacks

https://blog.barracuda.com/2025/10/02/beyond-mitm-rising-danger-adversary-middle-attacks
1•walletdrainer•2m ago•0 comments

Unpatched Firefox focus universal XSS 0day poc released

https://twitter.com/i/status/2064119366669435379
1•notRobot•3m ago•0 comments

Soft scrolling on framebuffer consoles – with GPM Handling

https://archives.gentoo.org/gentoo-user/aZt1xIVGbaMAIFBG@MAC.fritz.box/
1•M95D•5m ago•1 comments

A clipboard manager that lives at the top of macOS

https://cliperx.com/
2•xizhechan•5m ago•0 comments

The Nerdy Escorts Cashing in on Silicon Valley's AI Boom

https://www.forbes.com/sites/annatong/2026/06/07/the-nerdy-escorts-cashing-in-on-silicon-valleys-...
1•Michelangelo11•12m ago•0 comments

Plastron: A spreadsheet you grow into an app, in one index.html

https://plastron.ca
2•rheohile10•18m ago•1 comments

SpaceX plans $55B investment to make A.I. chips

https://www.nytimes.com/2026/05/07/business/spacex-chips-terafab.html
2•andsoitis•18m ago•1 comments

Ask HN: Feeling FOMO re: SpaceX, Anthropic, OpenAI IPOs and the future of tech

1•mradek•22m ago•3 comments

Does a token buy you more or less now than it did a few months ago?

https://bigspin.ai/resources/the-decline-of-token-level-purchasing-power
1•pretext•23m ago•1 comments

Facebook is paying people overseas promoting Alberta separatism

https://www.cbc.ca/news/canada/facebook-overseas-alberta-separtism-9.7223966
3•vrganj•25m ago•0 comments

Productivity Effects Across Generations of AI Coding Tools

http://muratbuffalo.blogspot.com/2026/06/writing-code-vs-shipping-code.html
1•ingve•31m ago•0 comments

A game's homemade crypto fell to a DIY supercomputer

https://www.ud2.rip/blog/towerunite/
1•vmfunc•34m ago•0 comments

Siri AI for iPhones and iPads will be delayed indefinitely in the EU

https://www.engadget.com/2189932/siri-ai-for-iphones-and-ipads-will-be-delayed-indefinitely-in-th...
1•adwmayer•35m ago•0 comments

QuillOS: The only Swift-first OS after macOS

https://quillOS.cloud/
1•ljlolel•37m ago•2 comments

Do Better Research with NotebookLM

https://blog.google/innovation-and-ai/products/notebooklm/better-research-notebooklm/
1•nkko•41m ago•0 comments

Is There a Link Between Listening to Music and Mental Health?

https://www.aesthetics.mpg.de/en/newsroom/news/news-article/article/is-there-a-link-between-liste...
1•XzetaU8•42m ago•0 comments

SpaceX CFO telecom analyst discuss

https://twitter.com/elonmusk/status/2064196509780893957
1•__patchbit__•44m ago•0 comments

Suprised to see the open data sources on internet

1•akd29121988•45m ago•0 comments

Stop Asking Claude to Agree with You

https://www.questionpro.com/engineering/engineering/developer%20tools/ai%20&%20machine%20learning...
1•skyDoesWork38•53m ago•0 comments

NASA's X-59 Aircraft Flies Supersonic for First Time

https://www.nasa.gov/aeronautics/x-59-first-supersonic-flight/
3•divbzero•57m ago•0 comments

SpaceX offers details on orbital data center satellites

https://spacenews.com/spacex-offers-details-on-orbital-data-center-satellites/
3•MrBuddyCasino•1h ago•2 comments

Show HN: I created an app to copy OTP from Google Voice to your macOS Clipboard

https://github.com/ptrinh/Notiful
1•ptrinh•1h ago•0 comments

iPhone almost like a birth control device, fertility rates falling after 2007

https://www.indiatoday.in/technology/news/story/iphone-almost-like-a-birth-control-device-fertili...
1•rustoo•1h ago•0 comments

Ask HN: Do you need go-to-market strategy at early stage?

1•2ero_wf•1h ago•0 comments

Built to benefit everyone: our plan By Sam Altman and Jakub Pachocki

https://openai.com/index/built-to-benefit-everyone-our-plan/
1•echan00•1h ago•1 comments

Show HN: Clawcall – give your self-hosted OpenClaw agent inbound phone calls

https://github.com/CODEANDTRUST/clawcall
2•pakbry•1h ago•0 comments

L'Affaire Siloxane

https://mceglowski.substack.com/p/laffaire-siloxane
1•idlewords•1h ago•0 comments

Make Something Wonderful

https://joshuawold.com/make-something-wonderful/
1•ethanplant•1h ago•0 comments

Vulnerability and malware checks in UV: uv audit, malware check in uv add, sync

https://astral.sh/blog/uv-audit
4•Terretta•1h ago•1 comments

OxyJen v0.5: a deterministic graph runtime for AI workflows

https://github.com/11divyansh/OxyJen
1•bdivyansh11•1h ago•0 comments