frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

The Hitchhiker's Guide to Coherent Fabrics: 5 Programming Rules

https://www.sigarch.org/the-hitchhikers-guide-to-coherent-fabrics-5-programming-rules-for-cxl-nvl...
1•matt_d•1m ago•0 comments

First ChatGPT Ad (Link to Image)

https://drive.google.com/file/d/1FA4e-2mGuWPAxrmbj-gRySDbZZfWiKO4/view?usp=sharing
1•Dim_A•1m ago•0 comments

Show HN: Debrief, an AI tracker for every work thread

https://www.trydebrief.com/
1•baetylus•2m ago•0 comments

Preloading File Explorer in Windows 11 Doubles RAM, Offers Minimal Speed Boost

https://www.techpowerup.com/343459/preloading-file-explorer-in-windows-11-doubles-ram-usage-offer...
1•speckx•3m ago•0 comments

The 'Race Against Time' to Save Music Legends' Decaying Tapes

https://www.nytimes.com/2025/12/01/arts/music/iron-mountain-audio-tape-preservation.html
2•JamesAdir•5m ago•2 comments

Tell HN: Nascent idea: "super intelligence" is not about superior intelligence

2•keepamovin•5m ago•1 comments

Show HN: MCP Server for Real-Time NSE/BSE Data

https://github.com/bshada/nse-bse-mcp
2•_bshada•7m ago•1 comments

Synopsys and Nvidia Double Down on Acceleration

https://morethanmoore.substack.com/p/synopsys-and-nvidia-double-down-on
1•blakepelton•11m ago•0 comments

TikTok's Enshittification (2023)

https://pluralistic.net/2023/01/21/potemkin-ai/#hey-guys
1•redbell•13m ago•0 comments

AI is coming for the world of competitive Excel

https://thehustle.co/originals/ai-is-coming-for-the-world-of-competitive-excel
1•shsachdev•13m ago•0 comments

An Introduction to the Empirics of Auctions

https://nicholasdecker.substack.com/p/an-introduction-to-auctions
1•paulpauper•15m ago•0 comments

Coregex: Go regex lib 3-3000x+ as fast as stdlib via multi-engine arch and SIMD

https://github.com/coregx/coregex
1•benhoyt•15m ago•0 comments

Your Intelligence Isn't Making You Lonely

https://cognitivewonderland.substack.com/p/your-intelligence-isnt-making-you
1•paulpauper•16m ago•0 comments

A single-fibre computer enables textile networks and distributed inference'

https://www.rle.mit.edu/a-single-fibre-computer-enables-textile-networks-and-distributed-inference/
2•colinprince•17m ago•1 comments

eXoWin9x

https://www.retro-exo.com/win9x_M.html
1•redbell•17m ago•0 comments

Show HN: CodeViz – A diagram editor that understands your code (YC S24)

2•LiamPrevelige•18m ago•0 comments

Assumption in Apps

https://rumination.computer/app-assumptions
1•hazn•19m ago•0 comments

Validate ideas fast using Reddit

https://microsaasresearch.com
1•hmontazeri•19m ago•2 comments

Seeking Work

https://docs.google.com/document/d/11cD8-bSwKRINIWQ22JpzUNZkfVln7D8s/edit?usp=sharing&ouid=107263...
1•ikiselev•20m ago•1 comments

InstaPoT: Using InstaVM with DSPy's Program of Thought

https://instavm.io/blog/instavm-dspy-program-of-thought
1•mkagenius•20m ago•0 comments

The Coming War on General Computation [Cory Doctorow, 2011]

http://opentranscripts.org/transcript/coming-war-general-computation/
2•sundarurfriend•20m ago•0 comments

Asteroid 2024 YR4 was Earth's first real-life planetary defense test

https://www.universetoday.com/articles/asteroid-2024-yr4-was-earths-first-real-life-defense-test
2•speckx•21m ago•1 comments

LLMs perf on Path-X or Path-256?

1•timdel•24m ago•0 comments

Show HN: ReferralLoop – Waitlist platform with viral referral mechanics

https://www.referralloop.dev/
1•soyzamudio•24m ago•1 comments

Show HN: Garmin Watch Face

https://github.com/Lallassu/garminwatchface
2•nergal•28m ago•0 comments

Show HN: PayPerBill, pay per invoice sent without monthly subscriptions

https://payperbillapp.com/
1•blampack•29m ago•0 comments

A big list of things I disable in WordPress

https://shkspr.mobi/blog/2025/11/a-big-list-of-things-i-disable-in-wordpress/
3•speckx•31m ago•0 comments

NFS Server on the Android Smartphone: Termux, proot-distro, Alpine Linux, unfs3

https://gist.github.com/NoteAfterNote/e1719f4029b91918d996216939d5bff2
1•sipofwater•31m ago•0 comments

IDF tightens cellphone regulations, bars Android phones

https://www.israelnationalnews.com/news/418418
4•walterbell•32m ago•1 comments

Map of the developing brain provides insight into origin of mental disorders

https://english.elpais.com/science-tech/2025-11-07/first-map-of-the-developing-brain-provides-ins...
2•PaulHoule•35m ago•0 comments