frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•5mo ago

Comments

tocs3•5mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Small Web Utility Built on a JSON→Toon Open-Source Library

https://www.utilbolt.com/tools/json-to-toon
1•xohails•1m ago•0 comments

I built a cheaper alternative to Marker.io – not sure if there's a market

https://notedis.com
1•notedis•2m ago•1 comments

How long before a GPU depreciates?

https://www.cnbc.com/2025/11/14/ai-gpu-depreciation-coreweave-nvidia-michael-burry.html
1•pm2222•5m ago•0 comments

Furry Studies conference gathers worldwide wisdom at second annual event

https://dogpatch.press/2025/11/14/furry-studies-conference-second/
2•Kye•8m ago•0 comments

EU Commission breaches own AI guidelines by using ChatGPT in public documents

https://www.iccl.ie/news/european-commission-breaches-own-ai-guidelines-by-using-chatgpt-in-publi...
3•nickslaughter02•8m ago•0 comments

Intentional, Not Reflexive: A Manager's Thoughts on AI

https://scottkosman.com/post/blog/intentional-not-reflexive-a-managers-thoughts-on-ai/
1•scottkosman•8m ago•1 comments

UnPlotter: Extract numerical data from PDF figures

https://www.unplotter.com/
1•bouchard•9m ago•0 comments

Show HN: Vibe Capsule – Turn your music into shareable apps that work offline

https://github.com/hunterirving/vibe_capsule
1•hunterirving•10m ago•0 comments

NHRA legend John Force retires from driving after record 157 wins

https://www.espn.com/racing/story/_/id/46960318/nhra-great-john-force-retires-17-months-traumatic...
2•HardwareLust•11m ago•0 comments

AGI fantasy is a blocker to actual engineering

https://www.tomwphillips.co.uk/2025/11/agi-fantasy-is-a-blocker-to-actual-engineering/
2•tomwphillips•13m ago•0 comments

Luminar is cutting jobs, losing its CFO, and warning of a cash shortage

https://techcrunch.com/2025/10/31/luminar-is-cutting-jobs-losing-its-cfo-and-warning-of-a-cash-sh...
2•PaulHoule•14m ago•0 comments

Capital One Debit-Card Users Aren't All Happy After the Switch to Discover

https://www.wsj.com/finance/banking/capital-one-discover-debit-card-customers-6dc93571
2•sgerenser•14m ago•1 comments

Mathematical Maturity in Elementary School

https://kidswholovemath.substack.com/p/mathematical-maturity-in-elementary
2•sebg•14m ago•0 comments

Show HN: We built a subtitle generator that auto-detects real speaker names

https://harku.com/tools/subtitle-generator
1•howardV•15m ago•0 comments

Nvidia is gearing up to sell servers instead of just GPUs and components

https://www.tomshardware.com/tech-industry/artificial-intelligence/jp-morgan-says-nvidia-is-geari...
2•giuliomagnifico•16m ago•1 comments

A structural regular expression engine for Rust

https://www.sminez.dev/match-it-again-sam/
1•todsacerdoti•16m ago•0 comments

FineWeb2

https://huggingface.co/datasets/HuggingFaceFW/fineweb-2
1•tamnd•17m ago•0 comments

Don't turn your brain off

https://computingeducationthings.substack.com/p/22-dont-turn-your-brain-off
1•azhenley•22m ago•0 comments

Backblaze Drive Stats for Q3 2025

https://www.backblaze.com/blog/backblaze-drive-stats-for-q3-2025/
2•woliveirajr•24m ago•0 comments

Show HN: CBK Agent SDK

https://github.com/chatbotkit/node-sdk/tree/main/packages/agent
1•_pdp_•25m ago•0 comments

AMD GPUs Go Brrr

https://hazyresearch.stanford.edu/blog/2025-11-09-amd-brr
1•todsacerdoti•25m ago•0 comments

Weight-sparse transformers have interpretable circuits [pdf]

https://cdn.openai.com/pdf/41df8f28-d4ef-43e9-aed2-823f9393e470/circuit-sparsity-paper.pdf
2•0x79de•26m ago•0 comments

Prompt generation vs. Context generation [video]

https://www.youtube.com/watch?v=IS_y40zY-hc
1•acro-v•27m ago•1 comments

GPT-5.1 Prompting Guide

https://cookbook.openai.com/examples/gpt-5/gpt-5-1_prompting_guide
1•0x79de•28m ago•0 comments

Rockets vs. Drones: Rethinking Over-Engineered IT

https://tidesofsea.com/the-rocket-building-delusion
2•_phnd_•28m ago•1 comments

Google Releases CodeWiki

https://codewiki.google/
2•0x79de•28m ago•0 comments

Research is cheaper than search

https://slate.greyb.com/blog/research-is-cheaper-than-search/
2•I_Nidhi•30m ago•0 comments

When I stopped trying to be a great engineer

https://supremecodr.medium.com/-56094c225549
1•verax5•32m ago•0 comments

Unikraft Support for MirageOS Unikernels

https://tarides.com/blog/2025-11-13-announcing-unikraft-support-for-mirageos-unikernels/
1•todsacerdoti•35m ago•0 comments

SimRacing Expo Dortmund 2025: Europe Embraces the 1.4ms Revolution

https://www.ark-dynamics.com/post/simracing-expo-dortmund-2025-europe-embraces-the-1-4ms-revolution
2•Kinemaniacs•35m ago•2 comments