frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Lets Roast Your Website

1•ajaysheoran2323•28s ago•0 comments

Show HN: ChangeSpec – An open standard for notices in software changes

https://changespec.org/
1•cdnsteve•1m ago•0 comments

Daily Food Guessing Game

https://munchle.day/
1•shumaher•4m ago•0 comments

Riding the Leopard

https://www.notboring.co/p/riding-the-leopard
1•jger15•4m ago•0 comments

TypedArray.prototype.map()

https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/TypedArray/map
1•tosh•5m ago•0 comments

Researchers say AI just broke every benchmark for autonomous cyber capability

https://cyberscoop.com/ai-autonomous-cyber-capability-benchmarks-broken-gpt5-claude-mythos/
1•lschueller•7m ago•0 comments

Avocado Studio – open-source AI content editor for Next.js sites

https://docs.avocadostudio.dev
1•yury_h•7m ago•0 comments

40-Year-Old System Analyzed with AI: It Turned Out to Have a Modern Architecture

https://medium.com/@noborutakahashi/a-40-year-old-system-analyzed-with-ai-it-turned-out-to-have-a...
1•fragmede•9m ago•0 comments

Britain just issued a cigarette ban that would shock Americans

https://slate.com/life/2026/05/britain-united-kingdom-smoking-cigarette-law-banned.html
3•sizzle•10m ago•0 comments

I Built a Programming Languge Inside Debug.com

https://code.likeagirl.io/programmers-survival-guide-for-a-zombie-apocalypse-f1580422675a?sk=769b...
1•nextputall•10m ago•0 comments

I found the most underrated B2B lead source for SaaS

https://www.indiehackers.com/post/the-most-underrated-b2b-lead-source-for-saas-CZjbmrpd8HrNELM7l3E3
1•stangineer•11m ago•0 comments

Senior Full Stack Developer (Next.js / Node.js / React / Vue)

https://news.ycombinator.com/submita
1•cleanman•11m ago•0 comments

The dark fascist WW2 secret hidden below one of Europe's largest railway station

https://www.cnn.com/travel/milan-stazione-centrale-binario-21-shoah-memorial
1•sizzle•13m ago•0 comments

Build with Notion's Developer Platform

https://www.notion.com/product/dev
1•archb•14m ago•0 comments

Claude Opus 4.7 leaks system prompt randomly

https://old.reddit.com/r/ClaudeAI/comments/1tcsec4/claude_opus_47_just_revealed_its_system_prompt/
2•ixeption•15m ago•0 comments

Studid v2 – Free API for academic verification via university SSO

https://studid.io/blog/api-v2-release
1•wagnandr•17m ago•0 comments

Facebook comments on Citizen-Dividends from AI roil Korean markets

https://finance.yahoo.com/economy/policy/articles/korea-roils-market-floating-citizen-081026940.html
1•oliculipolicula•17m ago•0 comments

Geometry Conflict: Explain & Controll Forgetting in LLM Continual Post-Training

https://huggingface.co/papers/2605.09608
1•maxloh•18m ago•0 comments

Beware AI Productivity Theater

https://iknowa.spot/posts/26-5-9-beware-ai-productivity-theater
1•dmm•19m ago•0 comments

Urlsify.com, a Free Link Shortener with Analytics, Custom Links, and More

https://old.reddit.com/r/sideprojects/comments/1tabelm/finished_making_this_url_shortener_complet...
1•godlymod•21m ago•0 comments

Accelerated Arctangent Series

https://en.wikipedia.org/wiki/Arctangent_series
1•tosh•22m ago•0 comments

Axavive Anti Aging Support Formula Benefits Explained

https://finance.yahoo.com/sectors/healthcare/articles/axavive-skin-exploding-2026-golden-22590060...
1•farzsapu•23m ago•0 comments

Bloat (2024)

https://docs.google.com/presentation/d/e/2PACX-1vSmIbSwh1_DXKEMU5YKgYpt5_b4yfOfpfEOKS5_cvtLdiHsX6...
1•tosh•23m ago•0 comments

Being Poor (2005)

https://whatever.scalzi.com/2005/09/03/being-poor/
3•chistev•28m ago•0 comments

Is LinkedIn's "Who Viewed Your Profile" Feature Illegal Under GDPR?

https://www.raconteur.net/technology/is-linkedins-who-viewed-your-profile-feature-illegal-under-gdpr
1•taubek•32m ago•0 comments

Apple's Aperture site is still online in 2026

https://www.apple.com/welcomescreen/aperture3/
2•giladvdn•37m ago•1 comments

Testing at the Boundaries (2019)

https://www.tedinski.com/2019/03/19/testing-at-the-boundaries.html
1•tie-in•39m ago•0 comments

Five months after switching Fluxzy from Electron to Tauri

https://www.fluxzy.io/resources/blogs/electron-to-tauri-migration-fluxzy-desktop
3•birdculture•40m ago•0 comments

Commission simplifies Europe-wide travel booking and train travel

https://ec.europa.eu/commission/presscorner/home/en
5•_____k•43m ago•0 comments

bfloat16

https://en.wikipedia.org/wiki/Bfloat16_floating-point_format
1•tosh•44m ago•0 comments