frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

GB Renewables Map

https://renewables-map.robinhawkes.com/#5/55/-3.2
1•DamonHD•1m ago•0 comments

There Is a RAM Shortage

https://www.npr.org/2026/02/21/nx-s1-5719256/theres-a-shortage-of-ram-computer-memory-how-is-this...
2•paulpauper•3m ago•0 comments

Tokens: The New Oil – By Kent Beck

https://tidyfirst.substack.com/p/tokens-the-new-oil
2•paulpauper•4m ago•0 comments

Show HN: Termtrace – Record and replay terminal sessions as traces

1•amalChandru•4m ago•0 comments

Six Flags Was a Summer Destination. Can It Win Families Back?

https://www.nytimes.com/2026/04/02/business/six-flags-parks-closing-debt-attendance.html
1•duxup•7m ago•1 comments

In the Gulf, GPS jamming leaves delivery drivers navigating blind

https://restofworld.org/2026/gps-disruption-gulf-gig-workers/
1•donohoe•7m ago•0 comments

Show HN: SFS – A FUSE-based filesystem with SSH transport writen in Rust

https://github.com/skorotkiewicz/sfs
2•modinfo•8m ago•0 comments

Carbible.net: Car Guide

https://carbible.net/
1•polyspora•9m ago•1 comments

Iran is a distraction [video]

https://www.youtube.com/watch?v=koa2wUeWJL8
1•teleforce•10m ago•0 comments

Milgram's subjects were never aligned

https://hollisrobbinsanecdotal.substack.com/p/milgrams-subjects-were-never-aligned
1•mattas•10m ago•0 comments

What an Ivy League Education Gets You

https://www.theatlantic.com/ideas/2026/04/ivy-league-education-income/686682/
1•paulpauper•10m ago•1 comments

Octopus Energy's Agile Prices

https://agileprices.co.uk/
3•jonatron•11m ago•0 comments

How much it costs to attend a game at every MLB ballpark

https://visityourteam.com/mlb/rankings/game-day-costs
1•smarthomeu•11m ago•0 comments

Show HN: Detect7 – automatic DDoS protection layered on Cloudflare

https://detect7.com/
1•rk-baku•11m ago•0 comments

Environment as a Service: Agent as the Interface

https://blog.dreambubble.ai/en/posts/environment-as-a-service-agent-as-the-interface
1•qingant•11m ago•1 comments

The White House App Is Riddled with Cybersecurity Vulnerabilities

https://www.notus.org/technology/trump-white-house-app-cybersecurity
1•OrangePilled•11m ago•0 comments

Starlink Replacement: Russia Develops Stratospheric UAV Relay

https://en.topwar.ru/278156-zamena-starlink-v-rossii-razrabotali-stratosfernyj-bpla-retransljator...
1•B1FF_PSUVM•15m ago•0 comments

AI Models Choose Which Businesses to Recommend

https://driftspear.com/blog/how-ai-models-choose-which-business.md
3•woktalk•16m ago•0 comments

map of notable peoples birthplaces around the world.

https://tjukanovt.github.io/notable-people
3•BoredPositron•19m ago•0 comments

"Privacy. That's iPhone." – and Other Things That Need an Asterisk

https://blog.ppb1701.com/privacy-thats-iphone-and-other-things-that-need-an-asterisk
4•upofadown•20m ago•0 comments

What life looks like on the most remote inhabited island

https://apps.npr.org/life-on-tristan-da-cunha/
4•brightbeige•21m ago•1 comments

Amazon in Talks to Buy Globalstar

https://www.satellitetoday.com/finance/2026/04/03/amazon-in-talks-to-buy-globalstar-according-to-...
3•OrangePilled•22m ago•0 comments

Chinese AI Startup Is Mapping Every US Military Asset in Middle East in Realtime

https://breached.company/mizarvision-chinese-ai-satellite-us-military-tracking-2026/
3•Betelbuddy•22m ago•0 comments

Find out if your grandparents were Nazis

https://www.zeit.de/wissen/2026-04/nsdap-mitgliederkartei-karteikarten-familienmitglieder-suche
2•28304283409234•23m ago•1 comments

Buddy Feature in Claude Code Sucks

2•Avicebron•23m ago•0 comments

Apple approves driver that lets Nvidia eGPUs work with Arm Macs

https://www.theverge.com/tech/907003/apple-approves-driver-that-lets-nvidia-egpus-work-with-arm-macs
3•naves•26m ago•0 comments

Structured-ZSTD – A pure Rust Zstandard implementation

https://github.com/structured-world/structured-zstd
1•polaz•28m ago•0 comments

Retail buyer's silver purchases stolen at FedEx by someone with insider info

https://no01.substack.com/p/a-readers-story-worth-passing-on
1•OgsyedIE•29m ago•0 comments

Where Is Artemis?

https://www.whereisartemis.com/
2•kvakkefly•31m ago•0 comments

Real Time Lightning Map

https://www.lightningmaps.org/#m=ses;t=3;s=0;o=0;b=;ts=0;
1•ohjeez•31m ago•0 comments