frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Iran war triggers global race to build oil reserves

https://www.reuters.com/commentary/reuters-open-interest/iran-war-triggers-global-race-build-oil-...
1•kaycebasques•32s ago•0 comments

The systemic decay of tech hiring

https://thoughtspile.github.io/2026/06/22/systemic-decay-of-hiring/
1•theanonymousone•1m ago•0 comments

What If the Work We're Busy Automating Is Needless?

https://www.oftwominds.com/blogjune26/needless-work6-26.html
1•spking•2m ago•0 comments

UK considers forcing social media firms to prioritise trusted news

https://www.reuters.com/legal/litigation/uk-considers-forcing-social-media-firms-prioritise-trust...
1•frm88•3m ago•0 comments

Gaussian splats on visionOS Beta

https://developer.apple.com/documentation/visionos/gaussian-splats-on-visionos
1•LoganDark•4m ago•0 comments

That I'm Using .NET

https://medium.com/c-sharp-programming/7-ways-that-im-using-net-93c669143cc8
1•sukhpinder0804•7m ago•0 comments

Seedance 2.5, generating a complete 30-second video in one go

https://twitter.com/xiaohu/status/2069236441964818508
3•vantareed•7m ago•0 comments

Vim Adventures – learning Vim by playing a game

https://www.youtube.com/watch?v=GVjtUK5JEw0
1•coolwulf•9m ago•1 comments

GLM-5.2 is the step change for open agents

https://www.interconnects.ai/p/glm-52-is-the-step-change-for-open
3•vantareed•15m ago•0 comments

Spartan Programming

https://spartan.wiki
1•skogstokig•19m ago•0 comments

Open-source security auditors for Supabase, Strapi, Hasura and Ollama

https://github.com/Perufitlife/awesome-backend-security
1•renzomad•23m ago•0 comments

Nonstop Trading, Lots of Leverage. How 'Perp Futures' Are Changing Wall Street

https://www.wsj.com/finance/commodities-futures/nonstop-trading-loads-of-leverage-how-perp-future...
4•JumpCrisscross•27m ago•0 comments

Reddit account history viewer, even for deleted posts

https://deletedby.com
2•Mobile-Spread•30m ago•1 comments

The System Moved On, but It Didn't Forget

https://webmnem.here.now/system-moved-on-but-it-didnt-forget/
1•InfraStack•33m ago•0 comments

The AI Tarpit: Why You Can't Stop Reading Your Code

https://www.williamangel.net/blog/2026/06/22/The-AI-Tarpit-Why-You-Cant-Stop-Reading-Your-Code.html
1•datadrivenangel•36m ago•0 comments

I Bought the Trump Phone [video]

https://www.youtube.com/watch?v=b1ytw85Npt8
3•billfor•37m ago•0 comments

Show HN: A 3D world you grow from your phone's camera flash

https://phronesis.world/similar
3•degibug•40m ago•0 comments

King's study finds AI chose nuclear signalling in 95% of simulated crises

https://www.kcl.ac.uk/news/artificial-intelligence-under-nuclear-pressure-first-large-scale-kings...
1•totetsu•41m ago•1 comments

Man used massage gun on his tired eyeballs. It went as well as you'd expect

https://arstechnica.com/health/2026/06/man-used-massage-gun-on-his-tired-eyeballs-it-went-as-well...
6•canucker2016•44m ago•5 comments

Ask HN: Has Codex gotten slower recently?

4•aurenvale•44m ago•0 comments

Show HN: ExtensionBooster – Get real human reviews for your apps

https://extensionbooster.net/
3•quangpl•45m ago•1 comments

How I made the firewood splitting simulator

https://old.reddit.com/r/vibecoding/comments/1uckug1/heres_how_i_made_the_firewood_splitting_simu...
4•thunderbong•47m ago•0 comments

Show HN: Loft gives thumb-keys and split-layout on a standard laptop or keyboard

https://loftkeyboard.com
4•RuleOfBirds•48m ago•0 comments

Prompt Preflight – catch vague AI-agent prompts before they burn tokens

https://github.com/akg268/prompt-preflight/
2•akg268•49m ago•0 comments

Chain-of-Trust with AI

https://marirs.net.in/cot/
2•sriramster•50m ago•1 comments

Show HN: I scanned every YC Spring 2026 startup for what AI crawlers see

https://potatometer.com/blog/yc-spring-2026-ai-reach-vs-readability
2•apswin•58m ago•0 comments

AWS Lambda MicroVMs for isolated execution of user and AI-generated code

https://aws.amazon.com/about-aws/whats-new/2026/06/aws-lambda-microvms/
16•leemoore•1h ago•3 comments

HR consultant wins English court case using AI lawyer in apparent legal first

https://www.theguardian.com/technology/2026/jun/22/artificial-intelligence-law-firm-wins-court-ca...
5•mellosouls•1h ago•0 comments

How to be a `web' `designer' (1999)

https://chris.ex-parrot.com/design.html
4•DASD•1h ago•0 comments

Zhipu AI Surges Past Trillion Yuan Market Cap in China's AI Boom

https://asiaai.fyi/zhipu-ai-surges-past-trillion-yuan-market-cap-in-chinas-ai-boom/
6•dweisinger•1h ago•0 comments