frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

We help gardeners get free wood chip mulch deliveries

https://getchipdrop.com/
1•mooreds•30s ago•0 comments

Show HN: Browse2API – Turn any website into an API

https://www.browse2api.com/
1•AdityaKasaudhan•1m ago•0 comments

Deploying physical AI at scale: Key insights from our panel discussion

https://www.viam.com/post/physical-ai-panel-discussion
1•mooreds•1m ago•0 comments

Show HN: Organic Programming – A .proto is all you need

https://github.com/organic-programming/seed
1•bpds•2m ago•0 comments

Apache Iggy's migration journey to thread-per-core design powered by io_uring

https://iggy.apache.org/blogs/2026/02/27/thread-per-core-io_uring/
1•spetz•3m ago•0 comments

Show HN: LokulMem – Local-first memory management for browser LLMs

https://github.com/Pouryaak/LokulMem
1•Pouryaak•4m ago•1 comments

Show HN: OpportuAI – remote jobs, AI tools and digital products aggregator

https://opportunai.vercel.app
1•sakibulefty•4m ago•0 comments

Show HN: RetroTick – Run classic Windows EXEs in the browser

https://retrotick.com/
1•lqs_•4m ago•0 comments

Generative AI Use and Depressive Symptoms Among US Adults

https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2844128
1•pseudolus•8m ago•0 comments

Show HN: A Spatial Alternative to Timeline-Based Digital Memory

https://honoramma.com
1•pavel_man•9m ago•1 comments

The error handling bugs that worry me aren't the ones that crash

https://old.reddit.com/r/golang/comments/1rg5zo7/the_error_handling_bugs_that_worry_me_arent_the/
1•eik•10m ago•0 comments

Pallas Puzzles

https://github.com/vorushin/pallas_puzzles
1•burakabo•10m ago•0 comments

Show HN: Sugar – A task queue that lets AI coding agents work autonomously

https://github.com/roboticforce/sugar
1•cdnsteve•10m ago•0 comments

Chat Control is in the final stretch – but it could be a marathon, not a sprint

https://edri.org/our-work/chat-control-is-in-the-final-stretch-but-it-could-be-a-marathon-not-a-s...
1•nickslaughter02•11m ago•0 comments

Show HN: Globs – a daily puzzle about finding the hidden connections

https://threeemojis.com/en-US/play/globs/en-US/2026-02-27?size=big
1•knuckleheads•12m ago•0 comments

Iinit7: Bits and Bites #15

https://init7.friendlyautomate.ch/email/preview/377
1•sschueller•12m ago•0 comments

Jack Dorsey lays off 4k, says others will do same 'within the next year'

https://www.sfgate.com/tech/article/jack-dorsey-block-layoffs-21944033.php
1•taubek•12m ago•0 comments

How I Caught a Spy Using Her Cat (Bellingcat) [video]

https://www.youtube.com/watch?v=xjo0iLssbI8
1•Cloudly•14m ago•0 comments

How do you catch schema drift and security gaps in Firestore?

1•Madia120•14m ago•0 comments

McNamara Fallacy

https://en.wikipedia.org/wiki/McNamara_fallacy
1•meken•14m ago•0 comments

iOS and iPadOS 26 with Indigo Configuration

https://www.ia.nato.int/niapc/Product/iOS-and-iPadOS-26-with-Indigo-configuration_968
1•taubek•15m ago•0 comments

Show HN: PokeInvasion – Wild Pokémon appear on every website

https://github.com/IvanR3D/pokeinvasion_chrome-extension
1•IvanR3D•15m ago•1 comments

Hetzner Price Increase

https://www.hetzner.com/pressroom/statement-price-adjustment/
1•talboren•16m ago•0 comments

Who Believes in Vibe-Coding?

https://medium.com/ai-in-plain-english/who-believes-in-vibe-coding-1796fdd27b43
1•birdculture•18m ago•0 comments

Show HN: TAS – Tracking, Automation, and Skills for Claude Code

https://github.com/Voxos-ai-Inc/tas
1•Falimonda•18m ago•0 comments

Claude.ai Is Down

https://claude.ai/#
5•fagnerbrack•18m ago•5 comments

Viewert – AI User's Absolute Must Have

https://www.viewert.com
1•Sunrostern•21m ago•0 comments

Show HN: OSS Go client for signed agent-to-agent messaging in the ClaWeb network

https://github.com/awebai/aw
1•juanre•22m ago•0 comments

Ask HN: Continuous User-Sentiment Surveys?

1•adzicg•23m ago•0 comments

Training realtime video LoRAs for fun and profit

https://app.daydream.live/creators/thomshutt/training-loras-for-fun-and-profit
1•chaghalibaghali•25m ago•0 comments