frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Apps Let You Bet on Deportations and Famine. Mainstream Media Is Eating It Up

https://theintercept.com/2025/12/29/polymarket-kalshi-betting-prediction-cnn-news-media/
1•thm•3m ago•0 comments

Show HN: S3Broker – CF Worker library to protect your S3 storage from ransomware

https://github.com/tsunrise/s3broker
1•tsunrise•4m ago•0 comments

Show HN: Tool to pass perfetto traces to an LLM

https://perfetto-to-llm.vercel.app/
1•ak2242•5m ago•0 comments

Nexels

https://lessvrong.com/cs/nexels/
1•ibobev•6m ago•0 comments

Show HN: Supertictactoe.gg – A real-time PvP implementation of Ultimate TTT

https://supertictactoe.gg
1•dheesh•7m ago•0 comments

Direct3D 12: The Behavior of ClearUnorderedAccessViewUint/Float

https://asawicki.info/news_1795_secrets_of_direct3d_12_the_behavior_of_clearunorderedaccessviewui...
1•ibobev•7m ago•0 comments

Microsoft's Nadella overhauls leadership as he plots AI strategy beyond OpenAI

https://www.ft.com/content/255dbecc-5c57-4928-824f-b3f2d764f635
2•JamesAdir•8m ago•1 comments

OpenUSD Core Spec 1.0 is Here

https://aousd.org/blog/foundations-of-open-3d-development-introducing-aousd-core-specification-1-0/
1•ibobev•9m ago•0 comments

RunST does not prevent resources from escaping

https://welltypedwit.ch/posts/runst-does-not-prevent-resources-from-escaping.html
1•todsacerdoti•11m ago•0 comments

ByteDance to pour US$14B into Nvidia chips in 2026

https://www.scmp.com/tech/big-tech/article/3338191/bytedance-pour-us14-billion-nvidia-chips-2026-...
2•mfiguiere•12m ago•0 comments

New Yorker Dr. Berkan's New Channel RogoTRON

https://www.youtube.com/channel/UCGlaL2xCv4X1hDb1fQhU74w
1•northlondoner•15m ago•0 comments

Questions to ask yourself every year

https://gourav.io/blog/yearly-review
2•jerrygoyal•23m ago•1 comments

I Won a Teknofest 2025: A Step-by-Step Guide

https://www.notion.so/yapsgg/How-I-Won-a-TEKNOFEST-2025-A-Step-by-Step-Guide-2d2465f04ab58023bed5...
1•abdibrokhim•24m ago•1 comments

Study links America's favorite cooking oil to obesity

https://medicalxpress.com/news/2025-11-links-america-favorite-cooking-oil.html
2•PaulHoule•32m ago•0 comments

Show HN: Weekly newsletter with tactical frameworks from 50 $1M+ founders

https://www.doanything.com/preview/uXalImXcFZk
1•AlexMorganFndr•33m ago•0 comments

How musicals use motifs to tell stories

https://pudding.cool/2025/12/motifs/
1•gmays•39m ago•0 comments

Ask HN: What to do when Claude Code is writing code?

1•brihati•40m ago•1 comments

Show HN: Schengen Calculator – Avoid €5K Fines for Overstaying EU"

https://owlfacts.com
1•sunrays•40m ago•1 comments

A personal recap of 2025: on running, LLMs, family, coffee, work

https://dimitarmisev.com/blog/2025-recap
1•misev•45m ago•0 comments

I Built a Module System for a Language That Doesn't Have One

https://www.claudianadalin.com/blog/building-pinecone
1•xbmcuser•46m ago•0 comments

Show HN: Magic CSV – Transform CSVs with plain English, no formulas

https://magiccsv.app/
1•bored-developer•48m ago•0 comments

The Lore of the World: Field Notes for a Child's Codex

https://www.theintrinsicperspective.com/p/the-lore-of-the-world
3•Jun8•54m ago•0 comments

Show HN: Agape – human-centered CLI task manager

https://github.com/josequiceno2000/agape
2•josequiceno2000•54m ago•0 comments

Show HN: PDU – Open-source PostgreSQL data rescue tool

https://github.com/wublabdubdub/PDU-PostgreSQLDataUnloader
2•zhangchenPDU•54m ago•1 comments

Build Your Own ML Framework

https://mlsysbook.ai/tinytorch/intro.html
2•auraham•54m ago•0 comments

Observations on safety friction and misclassification in conversational AI

2•ayumi-observer•55m ago•0 comments

A Woman on a NY Subway Just Set the Tone for Next Year

https://www.honest-broker.com/p/a-woman-on-a-ny-subway-just-set-the
4•thomassmith65•55m ago•1 comments

A Woman on a NY Subway Just Set the Tone for Next Year

https://honest-broker.com/p/a-woman-on-a-ny-subway-just-set-the
1•thomassmith65•57m ago•3 comments

Advice for generalists who want to join startups

https://twitter.com/benln/status/2006057848430604705
2•gmays•1h ago•0 comments

Languish – Programming Language Trends

https://tjpalmer.github.io/languish/
3•nickswalker•1h ago•0 comments