frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Exploring the Myers Diff Algorithm in ColdFusion

https://www.bennadel.com/blog/4867-exploring-the-myers-diff-algorithm-in-coldfusion.htm
1•speckx•52s ago•0 comments

Git Protects You

https://RuntimeArguments.fm/2469780/episodes/18555806-20-git-protects-you
1•jammcq•55s ago•1 comments

Show HN: Donkey Support –> Reply to Support Chats from Slack/Discord/Telegram

https://www.donkey.support/
1•sjorsfest•2m ago•0 comments

A Linux app that darkens your screen when you slouch

https://github.com/vadi2/postured
1•VadimPR•4m ago•1 comments

Silencing the Kinesis Advantage 2 (2022)

https://yboulkaid.com/2022/03/15/kinesis
1•yboulkaid•5m ago•0 comments

A Complete Guide to Neural Network Optimizers

https://chizkidd.github.io//2026/01/22/neural-net-optimizers/
1•ibobev•6m ago•0 comments

Categories of Inference-Time Scaling for Improved LLM Reasoning

https://magazine.sebastianraschka.com/p/categories-of-inference-time-scaling
1•ibobev•7m ago•0 comments

What Your VPN Knows About You (and Why It Matters)

1•CulperLink•7m ago•0 comments

The Devastating Decline of a Brilliant Young Coder (2020)

https://www.wired.com/story/lee-holloway-devastating-decline-brilliant-young-coder/
1•abelanger•7m ago•1 comments

IIFE for Complex Initialization

https://www.cppstories.com/2016/11/iife-for-complex-initialization/
1•ibobev•7m ago•0 comments

The Rise and Fall of the American Monoculture

https://www.wsj.com/business/media/american-pop-culture-history-ce8672f1
1•mikhael•8m ago•0 comments

Reetcode: Extension to add LeetCode style features to Rosalind

https://github.com/zkirby/reetcode
1•zkirby•8m ago•0 comments

What Were the Crusades and Why Do They Still Matter?

https://www.thecollector.com/what-were-crusades/
1•Tomte•8m ago•0 comments

Show HN: Sign and Attest Kubernetes Manifests

https://github.com/meigma/blob-argo-cmp
2•aliasxneo•9m ago•0 comments

Sonic R: The R&R mod – Hacks the Saturn Racing Game into a Platformer

https://32bits.substack.com/p/sonic-r-the-r-and-r-mod
3•regus•10m ago•0 comments

Meta drops appeal against ruling for non-algorithmic timelines in Nederlands

https://nltimes.nl/2026/01/26/meta-drops-appeal-court-ruling-requiring-non-algorithmic-social-med...
2•giuliomagnifico•10m ago•0 comments

FOSS maintenance as emotional labor: software stewardship mimics librarianship

https://www.hughrundle.net/i-accidentally-became-a-foss-maintainer-and-all-i-got-was-this-lousy-n...
1•speckx•10m ago•0 comments

Show HN: A simple way to send secrets between teammates

https://www.30s.sh/
2•dannytatom•11m ago•1 comments

Rare Data Hunters [video]

https://www.youtube.com/watch?v=IU4ByUbDKNc
1•olivierestsage•11m ago•0 comments

The all new Mecha Comet, live on Kickstarter

https://www.youtube.com/watch?v=utZajNmPe1Y
2•krthr•11m ago•0 comments

Kevin Kelly: The March of Nines

https://kk.org/thetechnium/the-march-of-nines/
1•swolpers•11m ago•0 comments

Animate – iPad app for raster and vector animation by Canvas Software

https://www.canvassoftware.org
1•authman2•14m ago•1 comments

Payment processors were against CSAM until Grok started making it

https://www.theverge.com/ai-artificial-intelligence/867874/stripe-visa-mastercard-amex-csam-grok
3•cdrnsf•14m ago•0 comments

Show HN: A Local OS for LLMs. MIT License. Zero Hallucinations. Local Memory

https://github.com/merchantmoh-debug/Remember-Me-AI
1•MohskiBroskiAI•14m ago•2 comments

RCS, SMS via the internet, is good, but that doesn't matter

https://manualdousuario.net/en/using-rcs/
1•rpgbr•15m ago•0 comments

Tech workers urge CEOs to condemn ICE

https://www.axios.com/2026/01/26/tech-workers-ceos-ice
4•gdilla•16m ago•3 comments

Sabotage or 'systems failure': What caused the Air India crash?

https://www.telegraph.co.uk/news/2026/01/24/air-india-crash-evidence/
1•gmac•16m ago•1 comments

Show HN: Malan Chat, a full immersion language learning app for 62 languages

https://www.malan.chat
1•sam_osterfeld•16m ago•0 comments

US Army 11th Airborne completes annual cold-weather training exercise (2025)

https://www.webcenterfairbanks.com/2025/02/06/itg-11th-airborne-completes-annual-cold-weather-tra...
1•throw0101a•18m ago•1 comments

Global Investment in Clean Tech Hit a New High Last Year

https://e360.yale.edu/digest/2025-clean-energy-investment
2•speckx•18m ago•0 comments