frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Dental group offers to fix Olympic Jack Hughes' smile for free

https://fox56.com/news/local/nepa-dental-group-offers-to-fix-jack-hughes-smile-after-toothless-gr...
1•DivingForGold•53s ago•0 comments

AI's Math Tricks Don't Work for Scientific Computing

https://spectrum.ieee.org/number-formats-ai-scientific-computing
1•rjmunro•1m ago•0 comments

Show HN: TTSLab – Text-to-speech that runs in the browser via WebGPU

https://ttslab.dev
1•MbBrainz•1m ago•0 comments

AIProx: An open registry and manifest standard for autonomous agent discovery

1•LightProx•1m ago•0 comments

Anthropic Links AI Agent with Tools for Investment Banking, HR

https://www.bloomberg.com/news/articles/2026-02-24/anthropic-links-ai-agent-with-tools-for-invest...
1•swolpers•4m ago•0 comments

OpenAI safety reps called to Ottawa after Tumbler Ridge, B.C., mass shooting

https://www.cbc.ca/news/politics/open-ai-summoned-ottawa-tumbler-ridge-9.7103281
3•ChrisArchitect•6m ago•1 comments

Intrator files

1•zerosizedweasle•6m ago•0 comments

Show HN: A minimal coding agent in Elixir (Erlang/OTP)

https://github.com/matteing/opal
1•sergiomattei•6m ago•0 comments

Near-Instantly Aborting the Worst Pain Imaginable with Psychedelics

https://psychotechnology.substack.com/p/near-instantly-aborting-the-worst
1•surprisetalk•7m ago•0 comments

Change your default date format to the least ambiguous

https://practicalbetterments.com/change-your-default-date-format-to-the-least-ambiguous/
1•surprisetalk•7m ago•1 comments

Georgist land taxes balance community benefit and the efficiency of markets (2024)

https://devon.postach.io/post/georgist-land-taxes-balance-community-benefit-the-efficiency-of-mar...
1•surprisetalk•7m ago•0 comments

Pecking Order and Flight Leadership (2019)

https://srconstantin.wordpress.com/2019/04/29/pecking-order-and-flight-leadership/
1•surprisetalk•7m ago•0 comments

Apple's Multibillion-Dollar Push to Make Chips in the U.S. [video]

https://www.youtube.com/watch?v=ktFlaBhpMu8
1•tambourine_man•8m ago•0 comments

A catecholamine-independent pathway controlling adaptive adipocyte lipolysis

https://www.nature.com/articles/s42255-025-01424-5
1•PaulHoule•9m ago•0 comments

Show HN: Search through half a million works of art using natural language

https://artexplorer.ai/
1•stefanvdw1•9m ago•0 comments

Show HN: Rappelo – A small tool for solopreneurs to capture leads faster

https://rappelo.com
1•AlexandruEneDev•9m ago•1 comments

EWM: The Emacs Wayland Manager

https://codeberg.org/ezemtsov/ewm
2•dargscisyhp•10m ago•0 comments

MapReduce: Simplified Data Processing on Large Clusters (2004) [pdf]

https://static.googleusercontent.com/media/research.google.com/en//archive/mapreduce-osdi04.pdf
1•vinhnx•11m ago•0 comments

Show HN: OpenPDB – Generate AI agents with real personalities

https://github.com/gitsual/openpdb
1•gitsual•11m ago•0 comments

Paxos made simple (2001) [pdf]

https://lamport.azurewebsites.net/pubs/paxos-simple.pdf
1•vinhnx•13m ago•0 comments

OpenAI calls in the consultants for its enterprise push

https://techcrunch.com/2026/02/23/openai-calls-in-the-consultants-for-its-enterprise-push/
1•signa11•14m ago•0 comments

Distributed Systems for Fun and Profit

https://book.mixu.net/distsys/single-page.html
2•vinhnx•14m ago•0 comments

Lime's billing model is encouraging cyclists to run red lights

https://tk.gg/posts/lime-bikes-should-stop-charging-when-you-stop
1•rustyhancock•14m ago•0 comments

The Coming War on General Computation

https://en.wikisource.org/wiki/The_Coming_War_on_General_Computation
2•bondarchuk•14m ago•0 comments

AI Removed Every Bottleneck Except One: Cognitive Load

https://medium.com/@a.mandyev/ai-removed-every-bottleneck-except-one-3f25b509f26e
2•andrey_m•15m ago•1 comments

The $10T Fight: Modeling a US-China War over Taiwan

http://www.bloomberg.com/news/articles/2026-02-10/the-10-trillion-fight-modeling-a-us-china-war-o...
1•nkurz•16m ago•1 comments

Zones of Distrust – Open security architecture for agentic AI

https://github.com/bluvibytes/zone-of-distrust
1•sbabylon•16m ago•1 comments

Every bug report has four parts

https://dolphinmade.com/blog/four-parts-of-a-bug-report/
1•rprend•16m ago•0 comments

I Still <3 the Internet

https://www.deezlinks.com/p/i-still-3-the-internet
1•herbertl•17m ago•0 comments

Where can I buy AI-generated antibiotics?

1•john1203•17m ago•0 comments