frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Neutron Holdings Inc [Lime] S-1

https://www.sec.gov/Archives/edgar/data/1699963/000162828026032523/neutronholdingsinc-sx1.htm
1•toomuchtodo•24s ago•0 comments

Seirawan Chess [video]

https://www.youtube.com/watch?v=Nht2TqabPr0
1•nomilk•25s ago•0 comments

Fused Agent Kernel: 4x Tuned SOTA on Benchmarks. MIT OSS

https://github.com/anthony-chaudhary/fak
1•anthonysarkis•1m ago•0 comments

Small island nation tries bold tech education strategy

https://www.theregister.com/offbeat/2026/06/22/small-island-nation-tries-bold-tech-education-stra...
1•rbanffy•1m ago•0 comments

How to build an AI agent in 2026: a practical step-by-step guide

https://www.execlave.com/blog/how-to-build-an-ai-agent
1•brl1313•2m ago•0 comments

Artificial

https://www.inkandswitch.com/tangents/artificial/
1•surprisetalk•3m ago•0 comments

Darktable 5.6.0 Release

https://github.com/darktable-org/darktable/releases/tag/release-5.6.0
1•psibi•3m ago•0 comments

Show HN: Ghtrack: track GitHub Actions workflow duration in GitHub Pages

https://github.com/hatsu38/ghtrack
1•hatsu•3m ago•0 comments

Mars Is Spending Millions to Give M&M's a MAHA Makeover

https://www.wsj.com/business/mars-is-spending-millions-to-give-m-ms-a-maha-makeover-2fa1bb88
1•ksec•5m ago•1 comments

Make GitHub Actions Do More for You

https://mikemcquaid.com/make-github-actions-do-more-for-you/
2•mikemcquaid•5m ago•1 comments

Stressed-Out Soil Bacteria Adapt to Environmental Conditions

https://www.caltech.edu/about/news/stressed-out-soil-bacteria-adapt-to-environmental-conditions
1•gmays•6m ago•0 comments

Two Heads Are Better Than One: Run Many AI Agents, Merge One Auditable Result

https://medium.com/@Koukyosyumei/two-heads-are-better-than-one-run-many-coding-agents-merge-one-a...
1•syumei•7m ago•0 comments

Stripe Directory

https://docs.stripe.com/directory
1•tosh•7m ago•0 comments

Self-hosting High Availability is just Backups

https://blog.greg.technology/2026/06/21/self-hosting-high-availability-is-just-backups.html
2•gregsadetsky•7m ago•0 comments

AI Agent Governance vs. Observability: What's the Difference?

https://www.execlave.com/blog/ai-agent-governance-vs-observability
1•brl1313•7m ago•0 comments

"Buy Canadian" won't fix defence procurement until Ottawa defines "Canadian"

https://policyoptions.irpp.org/2026/06/canadian-defence-procurement-software/
1•ClearwayLaw•8m ago•1 comments

DeepMind AI Control Roadmap [pdf]

https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/securing-the-future-of-ai-agents/...
1•leopoldj•8m ago•1 comments

Inventing the Future, One Lisp Machine at a Time

https://www.patrickdomanico.com/bpm/2026/06/16/inventing-the-future-one-lisp-machine-at-a-time/
1•pamoroso•8m ago•0 comments

Show HN: Sturnus – OpenAI-compatible LLM proxy routing to the fastest provider

https://github.com/sturnus-dev/sturnus
1•dannyboland•8m ago•0 comments

Developers react to AI-scented blog posts

https://writethatblog.substack.com/p/dev-reaction-to-ai-blog-posts
1•mooreds•8m ago•0 comments

'I got crushed': AI giants are funding ad wars in races across the country

https://www.latimes.com/politics/story/2026-06-21/ai-giants-are-funding-ad-wars-in-races-across-c...
2•1vuio0pswjnm7•9m ago•0 comments

Economists have long pushed for prediction markets

https://www.cnn.com/2026/06/21/business/prediction-markets-economists
1•mooreds•10m ago•0 comments

Nff_: Open-source Claude Code for hardware

https://github.com/GLechevalier/nff
2•GL26•10m ago•0 comments

OpenAI hit with multistate probe into possible user harm as its IPO looms

https://apnews.com/article/openai-chatgpt-subpoena-attorneys-general-probe-a95894407773307fae8ae3...
2•1vuio0pswjnm7•11m ago•0 comments

Oxide computer 3D rack guided tour

https://explorer.oxide.computer/
1•darthcloud•12m ago•0 comments

ZCode – Simple, Fast, Vibe‑Ready

https://zcode.z.ai/en
3•edg5000•13m ago•0 comments

The road to Unreal Engine 6

https://www.unrealengine.com/news/the-road-to-ue-6
1•ksec•13m ago•0 comments

Sakana Fugu: A full multi-agent orchestration system

https://twitter.com/SakanaAILabs/status/2068861630327443966
1•thedebuglife•15m ago•0 comments

Chirps – AI widget that knows your site, takes actions, and sees what users see

https://chirps.cc
1•Monzed•15m ago•0 comments

Bain tests software takeover targets by vibecoding AI replicas

https://www.ft.com/content/e5bac4d1-b1f8-43a4-bd54-b182d5357af0
1•macleginn•16m ago•1 comments