frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Unconscious Brain May Process Sound, Learn, and Predict Words Under Anesthesia

https://www.discovermagazine.com/the-unconscious-brain-may-still-process-sound-learn-patterns-and...
1•gmays•2m ago•0 comments

China's Mandate of AI

https://afraw.substack.com/p/mandate-of-ai
1•chiwilliams•2m ago•0 comments

Mae

https://runmae.ai/
2•ccharleswang•2m ago•0 comments

Checkmate in Iran

https://www.theatlantic.com/international/2026/05/iran-war-trump-losing/687094/
3•znnajdla•3m ago•0 comments

Elsevier vs. Meta: first science publisher sues over scraped research papers

https://www.nature.com/articles/d41586-026-01481-0
1•sohkamyung•4m ago•0 comments

Notifications on Calendar Changes

https://www.grepular.com/Notifications_on_Calendar_Changes
1•Brajeshwar•4m ago•0 comments

The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment

https://arxiv.org/abs/2605.07462
1•evilscript•5m ago•0 comments

The great memory panic of 2026 – Asymco

https://asymco.com/2026/05/11/the-great-memory-panic-of-2026/
1•tambourine_man•7m ago•0 comments

Show HN: textdrop.sh – a simple tool for sharing text, Markdown, and code

https://textdrop.sh
1•tommyross•8m ago•0 comments

The faster you grow, the faster you go bankrupt

https://no01.substack.com/p/the-faster-you-grow-the-faster-you
2•speckx•8m ago•0 comments

Proof of Human as a research agenda, not a product feature

https://mayankagrawalphd.substack.com/p/proof-of-human-as-a-research-agenda
1•timshell•9m ago•0 comments

Speed Month – The Reader Meets the Fediverse

https://activitypub.blog/2026/05/05/radical-speed-month-the-reader-meets-the-fediverse/
1•benwerd•10m ago•0 comments

Kernel Kill Switch

https://lore.kernel.org/all/20260507070547.2268452-1-sashal@kernel.org/
1•unknownhad•10m ago•1 comments

CuqueClicker: CookieClicker-like idle-clicker TUI, but you don't click a Cookie

https://github.com/flipbit03/cuqueclicker
1•nateb2022•10m ago•0 comments

Looping AI for Science

https://github.com/hassard0/itb-engine
1•haz00•11m ago•1 comments

NASA's bid to save Swift from fiery death passes another hurdle

https://www.theregister.com/science/2026/05/11/nasas-bid-to-save-swift-from-fiery-death-passes-an...
1•sohkamyung•12m ago•0 comments

Effect of exercise for depression: systematic review and network meta-analysis

https://www.bmj.com/content/384/bmj-2023-075847
2•like_any_other•12m ago•0 comments

What skills need from an enterprise agent platform

https://h-tu.ch/blog/what-skills-need-from-an-enterprise-agent-platform/
1•htuch•13m ago•0 comments

Multi-Core by Default – By Ryan Fleury – Digital Grove

https://www.dgtlgrove.com/p/multi-core-by-default
1•fagnerbrack•14m ago•0 comments

Paola Antonelli: Design and the Elastic Mind [video]

https://www.ted.com/talks/paola_antonelli_design_and_the_elastic_mind
1•fagnerbrack•14m ago•0 comments

Maggy – Self-improving AI engineering platform with cross-session memory

https://github.com/alinaqi/claude-bootstrap
1•naxmax•14m ago•0 comments

The trap of solving what is easy first

https://www.intelligentproduct.solutions/blog/the-danger-of-low-hanging-fruit
1•freemh•17m ago•0 comments

Transactional Database Usage Survey

https://docs.google.com/forms/d/e/1FAIpQLSdzgAFzvEmSzqXPhm5sQs53m6-XT78xFcpYs2QJszjjpcFEEQ/viewform
1•mooreds•18m ago•0 comments

"I accidentally sent a shutdown loop to the company."

https://old.reddit.com/r/sysadmin/comments/1ta0h9u/i_am_going_to_get_fired_today_i_accidentally_s...
2•petecooper•20m ago•0 comments

AI replaced the management, not the engineers

https://dontdos.substack.com/p/what-if-the-robots-came-for-the-org
3•sirnicolaz•20m ago•0 comments

Principles almost always have exceptions

https://gabrielweinberg.com/p/principles-almost-always-have-exceptions
3•nathanh•23m ago•0 comments

1/3 of Czech infants <11mo spend 30 mins/day w phone/tablet,82% preschoolers 1+h

https://www.ceskenoviny.cz/zpravy/2823337
2•Markoff•23m ago•0 comments

Photiu Image Upscaler

https://www.photiu.ai/image-upscaler
1•bellamoon544•24m ago•0 comments

Online Is Fragile

https://binaryribbons01111011.bearblog.dev/online-is-fragile/
2•speckx•24m ago•0 comments

Show HN: FLOX C++ trading systems framework with MCP

https://github.com/FLOX-Foundation/flox
2•eeiaao•25m ago•0 comments