frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

The Year Everything Changed

https://networkgames.fyi/the-year-everything-changed
1•daniloc•1m ago•0 comments

A minimal hackable implementation of policy gradients (GRPO, PPO, REINFORCE)

https://github.com/zafstojano/policy-gradients
1•starzmustdie•1m ago•0 comments

An Elizabethan mansion's secrets for staying warm

https://www.bbc.com/future/article/20260116-an-elizabethan-mansions-secrets-for-staying-warm
1•Tachyooon•1m ago•0 comments

AI is speeding into healthcare. Who should regulate it?

https://news.harvard.edu/gazette/story/2026/01/ai-is-speeding-into-healthcare-who-should-regulate...
1•gnabgib•5m ago•0 comments

Signal feature request: mesh networking

https://community.signalusers.org/t/mesh-network-messaging-p2p-networking/2188?page=3
2•arthuredelstein•5m ago•0 comments

The Road to Serfdom (1944) [pdf]

https://dn790006.ca.archive.org/0/items/in.ernet.dli.2015.218162/2015.218162.The-Road_text.pdf
1•RyanShook•5m ago•0 comments

Will we forget what pre-LLM era looked like?

https://invertedpassion.com/will-we-forget-what-pre-llm-era-looked-like/
1•speckx•7m ago•0 comments

Why US cities are reverting 1-way streets back to their original 2-way design

https://apnews.com/article/one-way-streets-safety-indianapolis-louisville-lynchburg-6ca9c3610c447...
1•bikenaga•9m ago•0 comments

Apples, Trees, and Quasimodes

https://systemstack.dev/2025/09/humane-computing/
6•entaloneralie•10m ago•1 comments

The Resonant Computing Manifesto

https://resonantcomputing.org/
5•sinak•11m ago•0 comments

Ask HN: Why are restaurants allowed to question your order based on race?

3•amichail•12m ago•4 comments

My new minimal static site generator

https://maurycyz.com/misc/new_ssg/
2•todsacerdoti•12m ago•0 comments

From Desired State to Negotiated State

https://jlmr.dev/posts/from-desired-state-to-negotiated-state/
2•robteix•14m ago•0 comments

Trump announces plan to hit European countries with tariffs over Greenland

https://www.bbc.co.uk/news/live/c1j8kw866p3t
7•c-oreills•14m ago•0 comments

Pragmatic Bitmap Filters in Microsoft SQL Server

https://www.vldb.org/cidrdb/2026/i-cant-believe-its-not-yannakakis-pragmatic-bitmap-filters-in-mi...
2•tanelpoder•16m ago•0 comments

Top Gadgets That Stole the Spotlight at CES 2026

https://techfusiondaily.com/top-10-gadgets-ces-2026/
2•nelkazzu•17m ago•0 comments

Show HN: Cppsp v1.4 –– multi-var support: var a,b,C = 1,2,3 int

https://github.com/user19870/cppsp
2•user19870•18m ago•0 comments

Nothin

2•BadOmen•20m ago•1 comments

Chinese Fishing Boats Quietly Form Vast Sea Barriers

https://www.nytimes.com/interactive/2026/01/16/world/asia/china-ships-fishing-militia-blockade.html
3•donohoe•21m ago•0 comments

Claude Code with Anthropic API Compatibility [ollama blog]

https://ollama.com/blog/claude
3•laacz•23m ago•0 comments

Mandeville's Travels

https://en.wikipedia.org/wiki/Mandeville%27s_Travels
2•petethomas•24m ago•0 comments

Syscallargs: List all Linux system calls with their arguments from tracefs

https://tanelpoder.com/posts/list-linux-system-call-arguments-with-syscallargs/
2•tanelpoder•27m ago•0 comments

World models could unlock the next revolution in artificial intelligence

https://www.scientificamerican.com/article/world-models-could-unlock-the-next-revolution-in-artif...
2•beardyw•27m ago•0 comments

Have an Arrest Plan

https://old.reddit.com/r/selfhosted/comments/1qfffz5/have_an_arrest_plan/
2•speckx•27m ago•1 comments

How to use the hn4 file system?

https://github.com/hn4-dev/hn4
1•kentsummer•28m ago•0 comments

Which countries are adopting AI the fastest?

https://www.economist.com/graphic-detail/2026/01/12/which-countries-are-adopting-ai-the-fastest
1•andsoitis•28m ago•0 comments

Haptic Pad – 6 Button Macropad with haptic wheel

https://github.com/dmcke5/Hapticpad
1•saltmate•28m ago•0 comments

Tired of recruiters who ask for availability via email?

https://github.com/smccaffrey/wya
2•smccaffrey•29m ago•0 comments

Show HN: How to build decentralized apps on the new Freenet

https://freenet.org/resources/manual/tutorial/
1•sanity•29m ago•0 comments

Same-sex sexual behaviour in primates is a survival strategy

https://www.economist.com/science-and-technology/2026/01/14/same-sex-sexual-behaviour-in-primates...
2•andsoitis•30m ago•0 comments