frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Ronald G. Wayne Is More Than Two Weeks at Apple

https://tedium.co/2026/03/31/ronald-g-wayne-apple-interview/
1•janandonly•1m ago•0 comments

Academic Research Skills for Claude Code

https://github.com/Imbad0202/academic-research-skills
1•arnon•4m ago•0 comments

Retainer: Autonomous agent for extended, independent operation

https://github.com/seamus-brady/retainer
1•s_brady•5m ago•0 comments

Pomotuimer – a dependency-free Pomodoro timer for your terminal

https://github.com/wtbates99/pomotuimer
1•wtbates99•8m ago•1 comments

The Telbex Kernel's Version 0.2 release is near

1•gnu779•8m ago•0 comments

Show HN: X/Twitter video downloader Chrome extension (Plasmo)

https://chromewebstore.google.com/detail/twitter-x-video-downloade/ghmnchjchdadckmencaoomeghcfjjhlf
1•rafayexalter•9m ago•0 comments

Warning: Quantum Advances Are Compressing Timeline for Network Upgrades

https://bitcoinmagazine.com/news/bitcoin-policy-institute-warns-of-quantum
1•janandonly•9m ago•0 comments

More than 92,000 tech workers have been laid off in 2026

https://timesofindia.indiatimes.com/technology/tech-news/after-92000-plus-layoffs-in-2026-economi...
2•rustoo•13m ago•0 comments

Six Seven Six Seven

1•babaeo•14m ago•0 comments

NeuroFilter – YouTube recommendation filtering using AI, transformers.js in MV3

https://chromewebstore.google.com/detail/neurofilterai-—-filter-yo/bmnpefkddaaeolemegkbfhemgemm...
2•iamaayushiiit•14m ago•0 comments

Cancelling Claude subscription renewal immediately revokes Design access

2•o10449366•16m ago•0 comments

BLAS, Lapack and OpenMP

https://pypackaging-native.github.io/key-issues/native-dependencies/blas_openmp/
3•tosh•17m ago•0 comments

'First contact' that may have led to complex life on Earth witnessed

https://phys.org/news/2026-04-contact-complex-life-earth-witnessed.html
2•janandonly•17m ago•0 comments

Running local models on an M4 with 24GB memory

https://jola.dev/posts/running-local-models-on-m4
2•joladev•18m ago•0 comments

UCI LISP: Random Notes (1975)

https://pdp-10.trailing-edge.com/decuslib10-04/01/43,50322/read.me.html
4•jruohonen•20m ago•0 comments

ModelDocker – OpenRouter LLM Desktop Client

https://github.com/Skynet-Pro-Plus/modeldocker
1•Skynetproplus1•22m ago•1 comments

Typing Is Being Replaced by Whispering–and It's More Annoying

https://www.wsj.com/tech/typing-is-being-replaced-by-whisperingand-its-way-more-annoying-a804fee7
2•cebert•23m ago•2 comments

Leonard Nimoy reads "Desiderata" [video]

https://www.youtube.com/watch?v=ZZJ1fJTezFE
1•rglover•24m ago•0 comments

Save the Taxi Drivers

https://www.theatlantic.com/ideas/2026/05/waymo-self-driving-cars/687119/
1•Brajeshwar•27m ago•0 comments

Beginners don't trust the command line

1•ghassenfaidi•28m ago•1 comments

AI Is Forcing CEOs to Make a Stark Choice: Lay Off Workers or Make Them Do More

https://www.wsj.com/tech/ai/ai-is-forcing-ceos-to-make-a-stark-choice-lay-off-workers-or-make-the...
6•gpi•36m ago•1 comments

Midori Sync: Midori is the first Gecko-based browser to have its own Sync

https://astian.org/midori-en/midori-sync/
2•ponchale•37m ago•1 comments

Show HN: OpenTelemetry x DuckDB(Ducklake), SQLite, Clojure

https://github.com/o11ylite/o11ylite
1•mnming•39m ago•0 comments

LLM Inference Throughput Rises 4.5x with Parallel Verification

https://presciente.com/edition/74
2•sebastianperezr•45m ago•0 comments

Global AI Diffusion in Q1 2026 – Microsoft

https://www.microsoft.com/en-us/corporate-responsibility/dmc/topics/ai-economy-institute/reports/...
1•giuliomagnifico•47m ago•0 comments

Comparing a 1980s memory map to the Raspi Pico

https://medium.com/@noborutakahashi/a-40-year-old-memory-map-comparable-to-todays-raspberry-pi-pi...
1•Schlagbohrer•48m ago•0 comments

How much electricity does AI consume?

https://hannahritchie.substack.com/p/ai-electricity-2025
2•mef•49m ago•0 comments

Mapping every European defence tech SME

https://www.defencejobs.org
1•omikk•50m ago•0 comments

Will the stigma around boys who dance ever shift?

https://www.theguardian.com/stage/2026/may/10/balletboyz-billy-elliot-ashley-banjo-diversity-male...
2•YeGoblynQueenne•53m ago•0 comments

Show HN: An index of indie web/blog indexes

https://theindex.fyi
1•rocketpastsix•55m ago•0 comments