frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Claude Code sessions are now link-shareable

https://github.com/OmkarKovvali/claude-session-share
1•reflectivetrap•1m ago•1 comments

Nearly 5M Accounts Removed Under Australia's New Social Media Ban

https://www.nytimes.com/2026/01/15/world/australia/social-media-ban-australia.html
1•bookofjoe•3m ago•1 comments

Anything Will Work (In AI)

https://publish.obsidian.md/ueaj/Machine+Learning/Theory/Anything+WILL+work
1•qouteall•6m ago•0 comments

Matthew McConaughey trademarks catchphrase in bid to beat AI fakes

https://www.theguardian.com/film/2026/jan/15/matthew-mcconaughey-trademarks-all-right-all-right-a...
2•puttycat•8m ago•0 comments

ClickHouse Handles Strings

https://rushter.com/blog/clickhouse-strings/
2•gm678•10m ago•0 comments

Drone Hacking Part 1: Dumping Firmware and Bruteforcing ECC

https://neodyme.io/en/blog/drone_hacking_part_1/
2•tripdout•12m ago•0 comments

Shamash an IntelliJ plugin and CLI to enforce JVM architecture boundaries

https://github.com/aalsanie/shamash
1•aalsanie•13m ago•1 comments

Plunging US Birth Rate Leaves Too Many Colleges with Too Few Kids

https://www.bloomberg.com/graphics/2026-college-enrollment-cliff/
2•toomuchtodo•17m ago•1 comments

Is it still worth pursuing a software startup?

1•newbebee•18m ago•0 comments

MySQL GitHub repository did not have commits for three months

https://github.com/mysql/mysql-server/graphs/commit-activity
1•chemodax•19m ago•0 comments

Justice Dept. launches criminal investigation of Minnesota governor

https://www.washingtonpost.com/national-security/2026/01/16/trump-minnesota-walz-frey-criminal-in...
5•perihelions•19m ago•0 comments

Bell Boy BB2: backup-first Windows ODE for safe file ops (PowerShell 5.1)

https://github.com/TrishulaSoftware/BellBoy-BB2
1•trishulasoftwre•22m ago•1 comments

Show HN: Explain Yourself – An AI party game app built with SwiftUI

1•sntedo•23m ago•0 comments

daff: data diff

https://paulfitz.github.io/daff/
2•indigodaddy•27m ago•0 comments

Ask HN: What will happen to dev work if companies start using LLM coding agents

2•tbharath•28m ago•0 comments

Ancient designs may be the first evidence of humans doing math

https://www.cnn.com/2026/01/16/science/halafian-pottery-first-math-intl-scli
1•smoyer•30m ago•0 comments

Golb's Law of Laws

https://abidsikder.com/blog/2026-01-16-golb/
1•caaaadr•33m ago•1 comments

Donald Trump Wants to Cancel the Midterm Elections (Jamelle Bouie) [video]

https://www.youtube.com/watch?v=YRV-9vO4Grs
2•consumer451•34m ago•1 comments

Officials showed off a robo-bus in DC. It got hit by a Tesla driver

https://www.msn.com/en-us/news/us/officials-showed-off-a-robo-bus-in-dc-it-got-hit-by-a-tesla-dri...
2•MilnerRoute•34m ago•0 comments

Compressing Cellular Automata Images (2017)

https://cloudinary.com/blog/compressing_cellular_automata
2•matthberg•37m ago•0 comments

CSS Houdini

https://developer.mozilla.org/en-US/docs/Web/CSS/Guides/Properties_and_values_API/Houdini
1•embedding-shape•42m ago•0 comments

Building Amiga 4000T (Part 1, Daughterboards)

https://wordpress.hertell.nu/?p=1942
1•doener•44m ago•0 comments

The Bitter Lesson of Agent Frameworks

https://browser-use.com/posts/bitter-lesson-agent-frameworks
2•gregpr07•45m ago•0 comments

Pituffik Space Base

https://en.wikipedia.org/wiki/Pituffik_Space_Base
1•doener•47m ago•0 comments

Computation Sovereignty and the Future of Tastes – An RGA Account

https://jimiwen.substack.com/p/death-of-the-architect
1•jimiwen•51m ago•0 comments

Myths we tell ourselves about software engineering

https://medium.com/feenk/rewilding-software-engineering-ca3ad1e612d8
1•todsacerdoti•54m ago•0 comments

I'm Being Prosecuted for the Opposite of Insider Trading

https://www.wsj.com/opinion/im-being-prosecuted-for-the-opposite-of-insider-trading-3a7b5f85
9•mudil•1h ago•5 comments

Visual Lexicon of Aztec Hieroglyphs

https://aztecglyphs.wired-humanities.org/content/visual-lexicon-aztec-hieroglyphs
1•thunderbong•1h ago•0 comments

Why Flutter Isn't Dead

https://shorebird.dev/blog/flutter-not-dead/
2•satvikpendem•1h ago•0 comments

RFK, Jr., shifts focus to questioning whether cell phones are safe

https://www.scientificamerican.com/article/rfk-jr-shifts-focus-to-questioning-whether-cell-phones...
5•voxadam•1h ago•2 comments