frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Blogs.hn

https://blogs.hn
1•surprisetalk•42s ago•0 comments

A Light from the Periphery

https://aeon.co/essays/why-satyendra-nath-bose-was-more-than-einsteins-sidekick
1•rifish•1m ago•0 comments

Technological dependence on American software and cloud services

https://www.cigref.fr/technological-dependence-on-american-software-and-cloud-services-an-assessm...
1•DyslexicAtheist•2m ago•0 comments

The 12,000-Year Solar Cycle and other Space Weather – Stefan Burns [video]

https://www.youtube.com/watch?v=HxsIZ4vVImo
1•keepamovin•2m ago•0 comments

Show HN: How a Hacker News User's AI Posts and Comments Have Evolved Over Time

https://hnai.vercel.app/
1•skydiver7373•3m ago•0 comments

Nearly all Epstein files still unreleased a month after Congress deadline

https://www.theguardian.com/us-news/2026/jan/19/jeffrey-epstein-files-unreleased-trump-doj
1•treadump•3m ago•0 comments

Reader Scores and Commenting

https://pitchfork.com/news/a-new-era-for-pitchfork-introducing-reader-scores-and-commenting/
1•pentagrama•6m ago•0 comments

The reality of trying to make US manufacturing great again

https://www.ft.com/content/33eae8e0-e724-4161-8ed9-7a0ff7d816b8
1•youngtaff•6m ago•1 comments

A renewed commitment to strengthening the United Nations

https://blogs.microsoft.com/on-the-issues/2026/01/20/a-renewed-commitment-to-strengthening-the-un...
1•chmaynard•7m ago•0 comments

OSS ChatGPT WebUI – 530 Models, Tools, MCP, Gemini RAG, Image/Audio Gen

https://llmspy.org
1•mythz•7m ago•0 comments

Guide to designing and testing memory for AI agents

https://theevalloop.substack.com/p/testing-ai-agent-memory-guide
1•dbult•9m ago•0 comments

Show HN: Ocrbase – pdf → .md/.json document OCR and structured extraction API

https://github.com/majcheradam/ocrbase
1•adammajcher•10m ago•0 comments

Show HN: Modo – Describe hardware, get a buildable prototype

https://modo.is
1•Beefin•10m ago•1 comments

Show HN: Repere – Local-first SQL data explorer, no uploads needed

https://repere.ai
1•mattismegevand•10m ago•0 comments

Ham Radio Crackdown in Belarus: Amateurs Face Death

https://steanlab.medium.com/mayday-389f5713fee4
2•DyslexicAtheist•11m ago•0 comments

Nats.io JetStream with PyTorch: lightweight compositional learning

https://docs.nats.io/nats-concepts/jetstream
1•northlondoner•13m ago•1 comments

A difficult case: Diagnosis made by hallucinatory voices

https://www.bmj.com/content/315/7123/1685
2•Anon84•13m ago•0 comments

An A.I. Startup Says It Wants to Empower Workers, Not Replace Them

https://www.nytimes.com/2026/01/20/technology/humans-ai-anthropic-xai.html
1•smoser•14m ago•0 comments

Show HN: Claude skill that scores X posts using X's open-source algorithm

https://github.com/tonkotsuboy/x-impact-checker
1•tonkotsuboy_com•16m ago•0 comments

Front End Architecture Has Reached Its Reasoning Moment

https://spynejs.com/blog/frontend-architecture-has-met-its-reasoning-moment
1•nybatista•16m ago•0 comments

Tech Lead Antipatterns

https://newsletter.terminalprompt.com/p/tech-lead-antipatterns
1•joaoqalves•18m ago•0 comments

Generate a Video from Every Link

https://twitter.com/yonatanbd/status/2013556587911119143
1•yonatan06•20m ago•0 comments

Building an Awesome-Compliance List

https://awesome-compliance.com/
1•ArthurMx•21m ago•0 comments

Netflix Upgrades Warner Bros. Deal to All Cash

https://variety.com/2026/tv/news/netflix-warner-bros-deal-all-cash-shareholder-vote-1236635142/
1•geox•22m ago•0 comments

Ask HN: Call for open community parallel verification system for desktop apps

1•zameermfm•22m ago•0 comments

Show HN: PiRecall – Pi digits memorization game

https://pirecall.netlify.app/
1•ZpJuUuNaQ5•23m ago•0 comments

OpenTelemetry Tracing in 200 lines of code (2024)

https://jeremymorrell.dev/blog/minimal-js-tracing/
1•tosh•23m ago•0 comments

Show HN: AgentCommander - workflow engine for evolutionary code optimization

https://github.com/mx-Liu123/AgentCommander
1•mx-Liu123•24m ago•2 comments

Ask HN: How do solo founders handle security?

2•massi24•24m ago•0 comments

Zohran Mamdani Needs to Create Popular Assemblies

https://jacobin.com/2025/12/mamdani-popular-assemblies-democratic-socialism/
1•PaulHoule•24m ago•1 comments