frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

David Hockney, Who Restored the Human Form to Art, Dies at 88

https://www.nytimes.com/2026/06/12/arts/design/david-hockney-dead.html
1•SirLJ•54s ago•0 comments

AI Will Cheat to Win: Reward Hacking from 1994 to 2025

https://adversariallogic.com/when-ai-finds-the-shortcut-reward-hacking-from-1994-to-2025/
1•joshgracie•2m ago•0 comments

Tron Algorithm Competition

https://tron.erik.gdn/
1•zinngr•3m ago•0 comments

Horizon Trade revolutionises trading with AI

https://horizon.trade
1•LaurenzBauer•5m ago•1 comments

How and why we moved our knowledge base from Notion to Markdown

https://blog.fortrabbit.com/moving-knowledge-base-from-notion-to-markdown
1•esher•11m ago•0 comments

Flash Drive has Every File [video]

https://www.youtube.com/watch?v=w6rkhvdAqHU
1•Gys•13m ago•0 comments

Anthropic Mythos: Modelling Bank Strategies

https://neuralcore.brainterms.ai/share/rpt_mq9ayvj0
1•Jderenne•13m ago•1 comments

Celebrated British artist David Hockney dies aged 88

https://www.bbc.co.uk/news/live/c4gye2zk29zt
1•max-amb•14m ago•0 comments

Ryanair dark UX patterns summer 2026 refresher

https://blog.osull.com/2026/06/12/ryanair-dark-ux-patterns-summer-2026-refresher/
1•danosull•16m ago•0 comments

A Systemic View of U.S.-China AI Competition [pdf]

https://www.jpmorganchase.com/content/dam/jpmorganchase/documents/center-for-geopolitics/cfg-us-c...
1•throw0101a•16m ago•0 comments

California hit by another fraud bombshell claims for $4B are fake

https://www.dailymail.com/news/article-15894057/California-sex-abuse-fraud-billions.html
1•Bender•17m ago•0 comments

Show HN: Ttar 2.4 KB freestanding TAR archiver written in C, no Libc

https://github.com/Ferki-git-creator/ttar-tiny-tar-archivist
1•DenisDolya•17m ago•0 comments

The AI Price War Is Here, Piling Pressure on OpenAI and Anthropic

https://www.wsj.com/tech/ai/the-ai-price-war-is-here-piling-pressure-on-openai-and-anthropic-86e1...
2•cebert•19m ago•1 comments

Kimi Code K2.7

https://twitter.com/Kimi_Moonshot/status/2065377579130142937
4•theanonymousone•21m ago•0 comments

Stop tuning prompts by hand. Engineer the loop that tunes them

https://github.com/anastasiosyal/dspy-gepa-optimizer
1•tassosyal•21m ago•0 comments

(Some) Unanswered Swift Group Questions

https://www.massicotte.org/blog/wwdc26-unanswered-qa/
1•ianhxu•23m ago•0 comments

Why "reprogramming" is the buzziest approach to reversing aging

https://www.technologyreview.com/2026/06/12/1138829/reprogramming-buzziest-approach-reversing-agi...
1•joozio•25m ago•0 comments

Show HN: I made an app for initiating social activity

https://apps.apple.com/no/app/greenlight-squads/id6757295236
1•steinvakt2•30m ago•0 comments

AI Economics for Dummies

https://www.mcsweeneys.net/articles/ai-economics-for-dummies
4•_____k•31m ago•0 comments

Compile and Run iOS Apps on Linux

https://github.com/Lore-Hex/QuillUI
2•ljlolel•31m ago•0 comments

Production-Grade Claude/AI Skills for Ruby on Rails

https://github.com/sandeepmvl/rails-skills
1•thinkingemote•35m ago•1 comments

I made a zero cost browser-use tool – let AI click and type on webpages for you

https://github.com/pdufour/browser-use-wasm
1•pdufour•37m ago•1 comments

Why Secure AI Needs Compile-Time Sandboxing

https://jo-lang.org/blog/2026-06-11-why-compile-time-sandboxing.html
1•liu-fengyun•39m ago•0 comments

PostDesk is now open source

https://github.com/bymilon/post-desk
2•milonspace•40m ago•0 comments

GCC 15.3 Compiler Brings Nearly a Year Worth of Bug Fixes

https://www.phoronix.com/news/GCC-15.3-Released
1•Bender•41m ago•0 comments

Chinese agents caught rebuilding botnets and stirring pot AI datacenter debate

https://www.theregister.com/security/2026/06/11/china-linked-operators-revive-botnet-stir-ai-data...
1•Bender•42m ago•0 comments

Show HN: What two months of building a B2C app taught me

https://twitter.com/moritzbuildz/status/2065383556268896480
1•moritzschultz•44m ago•0 comments

Kimi K2.7-Code: open-source coding model with better token efficiency

https://huggingface.co/moonshotai/Kimi-K2.7-Code
16•nekofneko•44m ago•2 comments

Is the Future of Blogging More Interconnected?

https://www.ssp.sh/brain/future-of-blogging/
1•zazuke•45m ago•0 comments

Show HN: I built a lead list tool to find businesses you can sell to

https://sensecollect.com
2•chrislxy•46m ago•0 comments