frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Apple acquires Israeli audio AI startup Q.ai

https://www.reuters.com/business/apple-acquires-audio-ai-startup-qai-2026-01-29/
1•porridgeraisin•2m ago•0 comments

Trump sues IRS and US treasury for $10B over leak of tax returns

https://www.theguardian.com/us-news/2026/jan/29/trump-sues-tax-return-leak
1•mellisacodes•2m ago•0 comments

Show HN: We analyzed AI tool launches – here's why GTM breaks

1•yasu_c•2m ago•0 comments

Taiwan's GDP in 2025 Grew at Fastest Pace Since 2010 (+8.63%)

https://www.wsj.com/economy/taiwans-economy-grew-at-fastest-pace-in-15-years-fcf4f0d2
1•giuliomagnifico•3m ago•0 comments

Show HN: TagCompanion – Point-and-Click Google Tag Manager Implementation

https://www.tagcompanion.com/
1•ybor•6m ago•1 comments

Beyond the click: How brands can influence visibility in AI-generated answers

https://thenextweb.com/news/beyond-the-click-influence-visibility
2•voiquh•6m ago•0 comments

Show HN: Clear to Spend – a simple YES/NO helper for spending decisions

https://clear-to-spend.vercel.app/
1•Nanoto•8m ago•1 comments

Show HN: A Protocol for Inducing Metacognition in LLMs and Falsifiable Model

https://zenodo.org/records/18346699
1•Keeper123•8m ago•0 comments

HN Highlights

https://news.ycombinator.com/highlights
1•sgt•11m ago•0 comments

Show HN: Our GitHub org profile in ASCII art

https://github.com/taskade
1•johnxie•12m ago•1 comments

A Market Saturated with Binance Scams: How Its Shaping Current Crypto Landscape

https://twitter.com/i/status/2017142739767263252
11•salkahfi•17m ago•0 comments

Grok Imagine API

https://x.ai/news/grok-imagine-api
2•vincent_s•18m ago•0 comments

devenv: Fast, Declarative, Reproducible, and Composable Developer Environments

https://devenv.sh/
6•tosh•19m ago•0 comments

Another user's pCloud setup is visible in my pcloud drive

https://old.reddit.com/r/pcloud/comments/1qqrcza/another_users_pcloud_setup_is_visible_in_my/
9•lukax•20m ago•1 comments

Show HN: Flywheel – The Zero-Flicker Terminal Compositor for Agentic CLIs

https://github.com/ccheshirecat/flywheel
1•ccheshirecat•21m ago•1 comments

Hours without lungs: artificial organ kept man alive until transplant

https://www.nature.com/articles/d41586-026-00239-y
6•qnleigh•22m ago•0 comments

Show HN: SoVideo – Free AI video generator using Sora 2

https://sovideo.ai
1•leegrayson•23m ago•0 comments

Show HN: Two AIs compete to build the best browser game from scratch

https://self-evolving.dev/
1•yugahashi•23m ago•0 comments

GOG: Linux "the next major frontier" for gaming as it works on a native client

https://www.xda-developers.com/gog-calls-linux-the-next-major-frontier-for-gaming-as-it-works-on-...
9•franczesko•26m ago•0 comments

The Rotten Science Behind the MSG Scare

https://www.sciencehistory.org/stories/magazine/the-rotten-science-behind-the-msg-scare/
2•thunderbong•30m ago•0 comments

Show HN: An AI tutor focused on reasoning, not just answer

https://dechecker.ai/ai-homework-helper
2•passioner•30m ago•0 comments

Show HN: Velovol – Self-hosted development environment distribution

https://www.velovol.com
3•tlyplane•35m ago•0 comments

Why I'm ignoring pretty much all new Python packaging tools

https://utcc.utoronto.ca/~cks/space/blog/python/PythonPackageToolsMyIgnoring
8•ingve•39m ago•0 comments

OpenPuya

https://py32.org/en/
2•tosh•39m ago•0 comments

Apple can't secure enough chips as iPhone demand surges, memory prices rise

https://www.cnbc.com/2026/01/29/apple-iphone-soc-memory-tsmc.html
3•1659447091•46m ago•0 comments

Apple Reports Record-Setting 1Q 2026 Results: $42.1B Profit on $143.8B Revenue

https://www.macrumors.com/2026/01/29/apple-1q-2026-earnings/
3•tosh•46m ago•0 comments

How linguistic framing in pitch decks influence investors' judgment – St. Gallen

https://www.pitchwise.se/blog/the-science-of-cold-outreach-a-research-on-why-your-pitch-deck-slid...
3•dabojula•51m ago•0 comments

AI creates asymmetric pressure on Open Source

https://dri.es/ai-creates-asymmetric-pressure-on-open-source
3•7777777phil•51m ago•0 comments

Show HN: Configlock, App Lock for Dotfiles

https://github.com/baggiiiie/configlock
1•baggiiiie•53m ago•0 comments

Cutting down 90% of database spending at Capacities by migrating to Postgres

https://capacities.io/blog/migration-to-postgres
3•steffenbleher•55m ago•1 comments