frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

I made a simple image and video converter for Linux and windows

https://github.com/cenullum/Yet-Another-Open-File-Converter
1•cenullum•47s ago•0 comments

API-based platform for hunting exposed secrets across GitHub repositories

https://github.com/boringtools/git-alerts-api
2•predev0x0•2m ago•0 comments

Jsbench – AI-written scriptable HTTP benchmarking tool

https://github.com/hongzhidao/jsbench
1•zhidao9•2m ago•0 comments

Anna's Archive Loses .PM Domain, Adds Greenland (.GL) Backup

https://torrentfreak.com/annas-archive-loses-pm-domain-adds-greenland-gl-backup/
1•HieronymusBosch•2m ago•0 comments

Show HN: I built a dashboard to stop AI agents from burning my API credits

https://github.com/justin55afdfdsf5ds45f4ds5f45ds4/EmpusaAI
1•justinlord•3m ago•0 comments

Show HN: OpenClaw Assistant – open-source Android voice assistant

https://github.com/yuga-hashimoto/OpenClawAssistant
1•YugaHashimoto•5m ago•0 comments

Building Brains on a Computer

https://www.asimov.press/p/brains
1•surprisetalk•5m ago•0 comments

JJ's Razor (2019)

https://www.sonyaellenmann.com/2018/03/jjs-razor.html
1•surprisetalk•5m ago•0 comments

The social value of the freedom to study source code in the Spanish Court

https://fsfe.org/news/2026/news-20260205-01.en.html
1•M95D•6m ago•0 comments

The Cerebral Revolution

https://www.lambdacambridge.com/blog/2026-02-the-cerebral-revolution
1•Robin_Message•7m ago•0 comments

ChatGPT sucks at being a real robot

https://www.vox.com/technology/476657/chatgpt-mit-csail-tesla-humanoid-robot
1•ripe•8m ago•0 comments

100M CROWPOWER and no horses on the moon

https://taylor.town/crowpower
2•surprisetalk•10m ago•0 comments

Show HN: FrankenTUI

https://www.youtube.com/watch?v=UaJovnWDvj0
1•eigenvalue•11m ago•0 comments

Ask HN: Why LLM providers sell access instead of consulting services?

3•pera•11m ago•1 comments

Device-independent quantum key distribution at 100km achieved for the first time

https://sciencemediacentre.es/en/device-independent-quantum-key-distribution-100-kilometers-achie...
2•giuliomagnifico•12m ago•0 comments

I Now Assume That All Ads on Apple News Are Scams

https://kirkville.com/i-now-assume-that-all-ads-on-apple-news-are-scams/
3•cdrnsf•13m ago•0 comments

Schrödinger cat state sets new size record

https://physicsworld.com/a/schrodinger-cat-state-sets-new-size-record/
2•sohkamyung•13m ago•0 comments

Honest and Elitist Thoughts on Why Computers Were More Fun Before

https://datagubbe.se/aficion/
1•rbanffy•14m ago•0 comments

Solitaires.gg – Free Klondike Solitaire PWA, no ads, works offline

https://www.solitaires.gg
1•cisco-co•15m ago•0 comments

Show HN: Agent Arena – Test How Manipulation-Proof Your AI Agent Is

https://wiz.jock.pl/experiments/agent-arena/
4•joozio•17m ago•0 comments

Show HN: Programming Language for Music- Aethra

1•CzaxTanmay•17m ago•0 comments

TikTok's 'Addictive Design' Found to Be Illegal in Europe

https://www.nytimes.com/2026/02/06/business/tiktok-addictive-design-europe.html
4•thm•18m ago•0 comments

Intro to Concurrency – Processes and Threads

https://ayanmali.substack.com/p/processes-and-threads-discourse-on
1•ayanmali•20m ago•0 comments

The Universal Paperclip Clicker

https://corecursive.com/paperclip-clicker/
1•todsacerdoti•20m ago•0 comments

First 'practical PhDs' awarded in China – for products rather than papers

https://www.nature.com/articles/d41586-026-00356-8
1•sohkamyung•23m ago•0 comments

I ran 4 Claude Opus 4.6 agents in parallel – 1,400 lines of game code in 45 min

https://thoughts.jock.pl/p/opus-4-6-agent-experiment-2026
2•joozio•23m ago•0 comments

Free Bespoke Sewing Patterns – FreeSewing

https://freesewing.eu/
1•hamid914•24m ago•0 comments

The Anthropic Hive Mind

https://steve-yegge.medium.com/the-anthropic-hive-mind-d01f768f3d7b
2•rsaarelm•24m ago•0 comments

Marine biologists discover 28 new deep sea species–and an old VHS tape

https://www.popsci.com/environment/new-deep-sea-species-argentina/
2•layer8•26m ago•0 comments

Git-Wt: Worktrees Simplified

https://gabri.me/blog/git-wt
1•ahmedelgabri•29m ago•0 comments