frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

The delicate choreography of the Trump-Xi state dinner

https://www.reuters.com/graphics/CHINA-US/STATE-DINNER/lgpdgbdyovo/
1•giuliomagnifico•1m ago•0 comments

Trump warns Taiwan not to expect blank check from US Military after Xi summit

https://www.foxnews.com/media/trump-warns-taiwan-expect-blank-check-us-military-intense-xi-summit
1•maxloh•1m ago•0 comments

Study: Single dose of psilocybin provided rapid relief from depression

https://news.ki.se/single-dose-of-psilocybin-provided-rapid-relief-from-depression-in-new-study
1•giuliomagnifico•10m ago•0 comments

Agent Behavioral Contracts

https://arxiv.org/abs/2602.22302
1•reiter•10m ago•0 comments

The world is on track to miss its health targets

https://www.technologyreview.com/2026/05/15/1137270/the-world-is-on-track-to-miss-its-health-targ...
1•joozio•11m ago•0 comments

Britain's latest civil servant is a chatbot trained on Gov.uk misery

https://www.theregister.com/public-sector/2026/05/15/britains-latest-civil-servant-is-a-chatbot-t...
1•YeGoblynQueenne•12m ago•0 comments

It's set up, not setup: Scraping GitHub for grammar errors

https://ss32.github.io/set_up_not_setup/
1•disastronaut•13m ago•1 comments

Linkup – Swipe to find cofounders, developers, designers and startup teammates

https://linkup-nine-ruddy.vercel.app/
1•tanakabuilds•18m ago•0 comments

The Iliad Intensive Course Materials

https://www.lesswrong.com/posts/dWQnLi7AoKo3paBXF/the-iliad-intensive-course-materials
1•pykello•18m ago•0 comments

Malicious node-IPC versions published to NPM

https://www.stepsecurity.io/blog/node-ipc-npm-supply-chain-attack
2•rvz•29m ago•0 comments

Distributing the Keys for Private Access to the Web

https://cdt.org/insights/distributing-the-keys-for-private-access-to-the-web/
1•grittygrease•33m ago•0 comments

How an Australian Teen Team Is Making Radio Astronomy Affordable for Schools

https://mag.openrockets.com/p/how-an-australian-teen-team-is-making-radio-astronomy-affordable-fo...
1•openrockets•34m ago•0 comments

How to background play without YouTube Premium on iPhone

1•no_creativity_•37m ago•0 comments

Ascetic Computing

https://ratfactor.com/ascetic-computing
1•shikaan•40m ago•0 comments

Automated AI-Based Pigeon Defense System

https://old.reddit.com/r/SideProject/comments/1s9ywir/automated_pigeon_defense_system/
1•muxamilian•43m ago•1 comments

Nginx Rift

https://depthfirst.com/nginx-rift
1•saikatsg•45m ago•0 comments

Year Anniversary of Warcraft II: Beyond the Dark Portal

https://www.jorsys.org/archive/may_2026.html#newsitem_2026-05-16T10:19:51Z
1•sjoblomj•48m ago•0 comments

Why is it called Kent House?

https://diamondgeezer.blogspot.com/2026/05/kent-house.html
2•susam•53m ago•0 comments

Morley Theorem

https://math.stackexchange.com/questions/5089222/can-this-angle-triplication-construction-be-cons...
1•tzury•56m ago•0 comments

PSVL 1.0 – The most comprehensive source-visible license (276 clauses)

https://github.com/BMBOMICH/PSVL
2•BMBOMICH•59m ago•0 comments

Prime visualisations – or what is the 67 meme

https://github.com/rayking99/primestuff
3•jasepickup•59m ago•1 comments

Setting up an AI-native organization

https://aweb.ai/blog/ai-first-company-howto
3•juanre•1h ago•9 comments

Anker PowerConf C200: a case study in webcam security theatre

https://bearbin.net/blog/2026/c200-webcam-security-theatre
2•bearbin•1h ago•0 comments

A Single Neuron Is Sufficient to Bypass Safety Alignment in LLMs

https://arxiv.org/abs/2605.08513
3•stared•1h ago•0 comments

Java Virtual Machine for Dotnet

https://ikvm.org/
3•wolfi1•1h ago•0 comments

Show HN: Offline voice to text and AI keyboard

https://apps.apple.com/us/app/dictawiz-voice-notes-recorder/id6759256382
3•kcordoc•1h ago•0 comments

Show HN: Triangle Layout Normal Evaluator

https://las3rlars.github.io/normalEvaluator/index.html
2•las3rlars•1h ago•0 comments

Futhark by Example

https://futhark-lang.org/examples.html
23•tosh•1h ago•2 comments

Performance in BQN versus C

https://mlochbaum.github.io/BQN/implementation/versusc.html
2•tosh•1h ago•0 comments

Reading code instead of writing code: The underestimated senior discipline

https://www.heise.de/en/blog/Reading-code-instead-of-writing-code-The-underestimated-senior-disci...
6•goloroden•1h ago•1 comments