frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Device found identified as Chinese undersea monitoring system

https://www.abc.net.au/news/2026-04-17/chinese-undersea-monitoring-system-lombok-strait/106569388
1•wslh•55s ago•0 comments

Engineering Managers are going to hate OpenClaw

https://zaidesanton.substack.com/p/engineering-managers-are-going-to
1•flail•1m ago•0 comments

Navigable Degeneracy in the Roots of 1-Bit Language Models

https://github.com/sbenjam1n/Neagari
1•sbenjam1n•3m ago•1 comments

The Conflict in Iran Is Changing How Engine Oil Is Made

https://www.theautopian.com/why-the-conflict-in-iran-is-changing-how-engine-oil-is-made-and-causi...
1•mauvehaus•7m ago•0 comments

Datacenter vs. Megaprojects buildout as a percentage of GDP

https://twitter.com/QuintinPope5/status/2044964528312426890/photo/1
1•MrBuddyCasino•8m ago•0 comments

Adapt: an LLM-based memory layer that restructures itself

https://github.com/unbody-io/adapt
1•Amirhouieh•8m ago•0 comments

Anthropic chief Dario Amodei: 'I don't want AI turned on our own people'

https://www.ft.com/content/9e0e0fc6-ab7d-4b69-a8b1-5a972b82fb06
1•ironyman•9m ago•1 comments

Show HN: Planedrift – Play Infocom games in the browser

https://planedrift.app/
1•techbelly•10m ago•0 comments

Widespread occurrence of large molecular methylsiloxanes in ambient aerosols

https://acp.copernicus.org/articles/26/5005/2026/
1•atombender•10m ago•0 comments

Crypto Faces Increased Threat from Quantum Attacks

https://spectrum.ieee.org/quantum-safe-crypto
1•pseudolus•11m ago•0 comments

The EU's Digital Gulag Is (Apparently) Ready to Roll

https://www.nakedcapitalism.com/2026/04/the-eus-digital-prison-complex-is-ready-for-roll-out.html
1•iamnothere•11m ago•0 comments

Japanese police train for post-hibernation bear encounters

https://twitter.com/DudespostingWs/status/2044854086307049546
1•taubek•11m ago•0 comments

What Are Stablecoins Used for Today? Estimating the Distribution of Stablecoins

https://www.kansascityfed.org/research/payments-system-research-briefings/what-are-stablecoins-us...
1•RickJWagner•11m ago•0 comments

Bernie Sanders: AI is coming for the working class. We must fight

https://www.foxnews.com/opinion/sen-bernie-sanders-artificial-intelligence-coming-working-class-m...
1•RickJWagner•12m ago•0 comments

Validating Data in Go

https://nymity.ch/writing/articles/validation/
1•phw•14m ago•0 comments

Sam Altman's Side Hustles Blur the Line Between OpenAI's Interests and His Own

https://www.wsj.com/tech/ai/chatgpt-openai-ipo-altman-029ae6d5
1•megacorp•14m ago•0 comments

Show HN: Mabon – AI agent that finds jobs continuously and shows strong matches

https://mabon.ai/
1•luckystrike•16m ago•0 comments

Monitoring tool with otel, nagios and Kafka

https://www.almondmonitor.com
1•andreasli72•16m ago•0 comments

Creating a Bootable Backup USB with Encryption (for Pop!OS Linux)

https://hajo.me/blog/2026/02/16/popos-linux-creating-bootable-backup-USB-with-encryption/
1•fxtentacle•16m ago•0 comments

Show HN: Boston Marathon Course Map with T and Rail Stations

https://www.lawruk.com/boston-marathon
1•jimlawruk•17m ago•0 comments

Designing code review for human-agent collaboration

https://tidewave.ai/blog/code-review
1•hugobarauna•17m ago•0 comments

A four-chapter scripture he wrote and continues to inhabit

https://jimiwen.substack.com/p/preface-to-si-wu-zi
1•jimiwen•18m ago•0 comments

Beej: On Making

https://beej.us/blog/data/ai-making/
2•ketanhwr•20m ago•0 comments

Chemical NDMA more likely to cause cancerous mutations after early-life exposure

https://medicalxpress.com/news/2026-04-chemical-ndma-cancerous-mutations-early.html
1•OutOfHere•21m ago•0 comments

The crazy nests built by leaf-cutter ants

https://knowablemagazine.org/content/article/living-world/2026/crazy-nests-leaf-cutter-ants-build
2•sohkamyung•22m ago•1 comments

Peon – A Zero-Trust AI Agent Runtime in Rust (Using Casbin)

https://github.com/stephen94125/peon-lib
1•stephen94125•22m ago•0 comments

I wrapped a LangChain agent without modifying its code (Ascension V2)

https://github.com/SweetKenneth/cmpsbl-langchain-ascended-agent
1•promptfluid•22m ago•0 comments

Anthropic won't own MCP 'design flaw' 200K servers at risk, researchers say

https://www.theregister.com/2026/04/16/anthropic_mcp_design_flaw/
1•beardyw•23m ago•0 comments

Show HN: Zmx – run local code agents on remote machines

https://bower.sh/zmx-ai-portal
1•qudat•23m ago•1 comments

Show HN: Robots2.txt – A robots.txt extension for AI agents and Age ratings

https://robots2.org
1•ID10TError•24m ago•0 comments