frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Tell HN: Perplexity is defaulting to religious sources for secular queries

1•fallinditch•2m ago•0 comments

How I Became a Quant [pdf]

https://engineering.nyu.edu/sites/default/files/2021-10/How_I_Became_a_Quant%20%281%29.pdf
1•sonabinu•4m ago•0 comments

Deep Monitor Truth

https://vis.social/@infobeautiful/115916911669220974
1•colinprince•5m ago•0 comments

Anyone still planning their holidays in Sheets? Try this instead

https://journeyjot.app
1•ahmedcader•10m ago•2 comments

The Multidisciplinary Approach to Thinking

https://fs.blog/great-talks/multidisciplinary-approach-thinking-peter-kaufman/
1•zdw•10m ago•0 comments

MacOS Local Network Privacy Revealed

https://eclecticlight.co/2026/01/18/last-week-on-my-mac-local-network-privacy-revealed/
1•chmaynard•10m ago•0 comments

My thoughts on Gas Town after 10k hours of Claude Code

https://simonhartcher.com/posts/2026-01-19-my-thoughts-on-gas-town-after-10000-hours-of-claude-code/
1•todsacerdoti•14m ago•0 comments

Why Walmart still doesn't support Apple Pay

https://9to5mac.com/2026/01/18/heres-why-walmart-still-doesnt-support-apple-pay/
3•colinprince•16m ago•0 comments

Claude voice mode is still a joke in 2026

https://simonhartcher.com/posts/2026-01-19-claude-voice-mode-is-still-a-joke-in-2026/
2•todsacerdoti•17m ago•1 comments

Bending Time: The Successful Time Travel Experiments Using Kozyrev Mirrors [pdf]

https://patentimages.storage.googleapis.com/4b/09/bb/a36f136bf184bb/RU2122446C1.pdf
1•vinyasi•19m ago•0 comments

Show HN: I treated dating like a Seed Round. Here is the Term Sheet

https://series-seed-relationship-term-sheet.tiiny.site/
1•love-doctor•23m ago•0 comments

Show HN: Skyscraper – A Native iPhone and iPad App for Bluesky

https://apps.apple.com/us/app/skyscraper-for-bluesky/id6754198379
1•CameronBanga•25m ago•0 comments

Victoria Leigh Soto

https://en.wikipedia.org/wiki/Victoria_Leigh_Soto
2•handfuloflight•34m ago•1 comments

Don't back down, Europe [video]

https://www.youtube.com/watch?v=SS5Ep3LTqnE
1•mooreds•39m ago•0 comments

The next big thing in heart disease prevention is targeting lipoprotein(a)

https://twitter.com/cremieuxrecueil/status/1961084474122252640
1•tekacs•39m ago•0 comments

Ukraine's kamikaze drones run AI vision/terminal guidance on Raspberry Pi

https://www.nytimes.com/2025/12/31/magazine/ukraine-ai-drones-war-russia.html
4•Lwrless•41m ago•0 comments

Ask HN: 1 year from today what will have been the worst behavior from AI corps?

2•keepamovin•44m ago•0 comments

Show HN: PixelRipple – AI ads agent for e-commerce

https://www.pixelripple.ai/
1•zxzxy1988•47m ago•3 comments

Poolsuite CLI – Ultra-summer internet radio from your terminal

https://github.com/jamespember/poolsuite-cli
2•jep888•52m ago•2 comments

WordWalker Spanish

https://wordwalker.ca/
1•petedrinnan•52m ago•0 comments

Show HN: A 6.9B Moe LLM in Rust, Go, and Python

https://github.com/fumi-engineer/machine_learning
3•fumi2026•56m ago•1 comments

The Stirling Engine: A Wave of the Future Ago [video]

https://www.youtube.com/watch?v=KbnGlcQiL1c
1•akshatjiwan•57m ago•0 comments

Show HN: AI Tryon Product to Video Generator

https://aitryon.art/ai-product-to-video/
1•AITryon•1h ago•0 comments

Watch Cursor build a 3M+ line browser in a week

https://twitter.com/mntruell/status/2012825801381580880
2•hentrep•1h ago•0 comments

The Convolutional Neural Network

https://cocakoala.substack.com/p/the-convolutional-neural-network
1•imranmk•1h ago•0 comments

Writing Your First Compiler

https://popovicu.com/posts/writing-your-first-compiler/
3•thunderbong•1h ago•0 comments

Meta has discontinued its metaverse for work, too

https://www.theverge.com/tech/863209/meta-has-discontinued-its-metaverse-for-work-too
3•prawn•1h ago•0 comments

The Computational Web and the Old AI Switcharoo

https://www.fromjason.xyz/p/notebook/the-computational-web-and-the-old-ai-switcharoo/
1•jayveeone•1h ago•0 comments

Greenland Crisis

https://en.wikipedia.org/wiki/Greenland_crisis
5•handfuloflight•1h ago•0 comments

MH370 operational search reports

https://www.atsb.gov.au/mh370-pages/updates/reports
2•teleforce•1h ago•0 comments