frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Opencoil – appropriating inductive charging pads in the wild (2020) [video]

https://media.ccc.de/v/rc3-11575-opencoil_a_roaming_speedshow
1•thenthenthen•31s ago•0 comments

Against the Federal Moratorium on State-Level Regulation of AI

https://www.schneier.com/blog/archives/2025/12/against-the-federal-moratorium-on-state-level-regu...
1•hn_acker•2m ago•0 comments

FBI Foils New Year's Eve Attacks in Southern California

https://www.cnn.com/2025/12/15/politics/fbi-los-angeles-turtle-island-bomb-plot
1•puttycat•2m ago•0 comments

Pregnant with Monsters

https://www.lrb.co.uk/the-paper/v47/n22/terry-eagleton/pregnant-with-monsters
1•Caiero•2m ago•0 comments

The Modern Software Developer

https://themodernsoftware.dev/
1•United857•5m ago•0 comments

The Deadweight Loss of Entertainment

https://moultano.wordpress.com/2025/12/09/the-dead-weight-loss-of-entertainment/
1•moultano•5m ago•0 comments

A public API for LLM response time tracking

https://metrik-dashboard.vercel.app/docs/api
1•meh_bouassamii•8m ago•0 comments

Texas universities deploy AI tools to review how courses discuss race and gender

https://www.texastribune.org/2025/12/15/texas-universities-ai-course-audits/
1•hn_acker•8m ago•1 comments

The AIVO Governance Artifacts

https://zenodo.org/records/17927110
1•businessmate•10m ago•1 comments

CLI for Proton Pass

https://proton.me/blog/proton-pass-cli
1•teekert•10m ago•0 comments

Show HN: 100 Million splats, a whole town, rendered in M2 MacBook Air

https://twitter.com/AKurian001/status/1986979144014701026
2•Arun_Kurian•13m ago•0 comments

How We Lost Communication to Entertainment

https://ploum.net/2025-12-15-communication-entertainment.html
4•zdw•14m ago•0 comments

What Anthropic MCP Donation to Linux Foundation Means?

https://glama.ai/blog/2025-12-15-mcp-moves-to-the-linux-foundation-neutral-stewardship-for-agenti...
1•OmShree0709•16m ago•0 comments

Tite

https://blog.cloudflare.com/es-es/welcome-to-connectivity-cloud/
1•EsaaBrooo•18m ago•0 comments

Need chip art / logo for free and open source silicon clone of Z80 CPU

https://github.com/rejunity/z80-open-silicon
2•__rej__•18m ago•2 comments

Break up bad companies; replace bad union bosses

https://pluralistic.net/2025/12/15/class-war-labor-peace/
2•hn_acker•19m ago•0 comments

US Tech Force

https://techforce.gov/
21•purple_ferret•21m ago•25 comments

How AI Is Transforming the Adoption of Secure-by-Default Mobile Frameworks

https://engineering.fb.com/2025/12/15/android/how-ai-transforming-secure-by-default-mobile-framew...
2•fleahunter•22m ago•0 comments

Inside the AI Factory: the humans that make tech seem human (2023)

https://nymag.com/intelligencer/article/ai-artificial-intelligence-humans-technology-business-fac...
1•wonger_•23m ago•0 comments

A simple graph database implementation running on Scryer Prolog

https://github.com/argahsuknesib/scryer-graph
1•triska•23m ago•0 comments

Is it an evil overlay? How can you tell?

https://www.joedolson.com/2025/12/is-it-an-evil-overlay-how-can-you-tell/
1•speckx•25m ago•0 comments

Multiplayer TypeScript Game (2025 update)

https://hoodball.vercel.app/?year=2025
1•PhilDunphy23•26m ago•1 comments

A Matrix of Saab Wheels

https://jpowell.tripod.com/saab-wheels/
1•Kaibeezy•27m ago•1 comments

WebKit Features for Safari 26.2

https://webkit.org/blog/17640/webkit-features-for-safari-26-2/
1•alwillis•27m ago•0 comments

Deciduous: Better programming with LLMs using a living memory and decision graph

https://notactuallytreyanastasio.github.io/deciduous/
2•rhgraysonii•28m ago•3 comments

Don MacKinnon: Why Simplicity Beats Cleverness in Software Design [audio]

https://maintainable.fm/episodes/don-mackinnon-why-simplicity-beats-cleverness-in-software-design
2•mooreds•30m ago•0 comments

Federal Wallet Inspectors

https://pluralistic.net/2025/12/13/uncle-sucker/
1•hn_acker•30m ago•0 comments

Geodesic Is a DevOps Linux Toolbox in Docker

https://github.com/cloudposse/geodesic
1•mooreds•30m ago•0 comments

OpenAI-Backed Chai Discovery Raises $130M for AI-Designed Molecules

https://www.bloomberg.com/news/articles/2025-12-15/openai-backed-chai-discovery-raises-130-millio...
1•doppp•31m ago•0 comments

Interview: Kim Stanley Robinson, Science Fiction Maestro and Utopian (2024)

https://sammatey.substack.com/p/interview-kim-stanley-robinson-science
1•mooreds•31m ago•0 comments