frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

This Linux FS was supposed to change everything–here's the dark reason it failed

https://www.howtogeek.com/this-linux-filesystem-was-supposed-to-change-everything-heres-the-dark-...
1•giis•1m ago•0 comments

AI Proteomics Competition 2026 – $13K Prize, Internships and Compute Support

https://www.bohrium.com/competitions/9813928053?tab=introduce
2•choubao•4m ago•1 comments

Kingsight – 6 AI agents that teach before letting you code

https://github.com/kings-nexus/kingsight
1•king-nexus•5m ago•0 comments

Archaeologists Uncover Oldest Gold Relics in a 5th Millennium BC Tomb

https://dailygalaxy.com/2026/03/bulgaria-varna-prehistoric-grave-with-oldest-gold-artifacts/
1•ipeev•6m ago•0 comments

Acquisition Plans of the Radarsat Constellation Mission

https://open.canada.ca/data/en/dataset/d2a5bf2b-064c-4baf-b69e-0986ae6922cf
1•marklit•8m ago•0 comments

How to earn credibility with engineers (lessons from a college radio station)

https://dev.jimgrey.net/2026/03/11/how-to-earn-credibility-with-engineers-lessons-from-a-college-...
1•kiyanwang•9m ago•0 comments

Exploit vs. Explore – By Mike Fisher – Fish Food for Thought

https://mikefisher.substack.com/p/exploit-vs-explore
1•kiyanwang•10m ago•0 comments

Migrating Etsy's database sharding to Vitess

https://www.etsy.com/codeascraft/migrating-etsyas-database-sharding-to-vitess
1•dubesar55•16m ago•0 comments

Apple's John Ternus Profile: The Likely Successor to Tim Cook as CEO

https://www.bloomberg.com/features/2026-apple-next-ceo/
1•Tomte•17m ago•0 comments

Trulycodes Provides Verified Coupon Codes for Confident Savings

https://trulycodes.com/
1•trulycodes•19m ago•0 comments

Snowflake makes cuts as part of 'targeted adjustments' to the company's strategy

https://www.businessinsider.com/snowflake-layoffs-strategy-ai-growth-2026-3
1•taubek•21m ago•0 comments

Open-source server panel in Rust (57MB RAM, 371 endpoints, zero cost)

https://dockpanel.dev/
1•ovexro•21m ago•0 comments

Kimchi-derived probiotic found to promote excretion of intestinal nanoplastics

https://phys.org/news/2026-03-kimchi-derived-probiotic-excretion-intestinal.html
3•teleforce•28m ago•0 comments

Why repositories need semantic memory, not bigger context windows

https://krzysztofdudek.substack.com/p/code-has-logic-it-does-not-have-meaning
1•chrisdudek•30m ago•0 comments

Human Resources

https://suvamsh.com/blog/human-resources/
1•suvamsh•37m ago•2 comments

Show HN: Fixy – Real-time group chat with humans and AI agents (GPT, Claude,..)

https://fixy.ai/
1•frdfrd•37m ago•0 comments

Agents kept getting stuck searching for Solar icons; I built a CLI to solve this

https://www.npmjs.com/package/search-solar
1•_bittere•38m ago•1 comments

Ski lift demolished as glacier on Germany's highest mountain melts away

https://www.rte.ie/news/newslens/2026/0321/1564451-ski-lift-demolished-as-german-glacier-melts-away/
3•austinallegro•39m ago•0 comments

Introducing AI chunking to semchunk

https://isaacus.com/blog/introducing-ai-chunking-to-semchunk
2•ubutler•44m ago•0 comments

Show HN: I built a census for AI agents – first record is a robot from 1890

https://ghostshell.host
1•GhostShellJoule•51m ago•1 comments

Tell HN: YouTube - the "Jewel of the Internet" has faded

3•wewewedxfgdf•52m ago•1 comments

Changing the World

https://geohot.github.io//blog/jekyll/update/2026/03/23/changing-the-world.html
1•curtsmith•52m ago•0 comments

A web of sensors: How the US spots missiles and drones from Iran

https://theconversation.com/a-web-of-sensors-how-the-us-spots-missiles-and-drones-from-iran-278865
2•1659447091•52m ago•0 comments

Air Canada CRJ collides with fire fighting truck on landing in New York

https://www.flightradar24.com/blog/flight-tracking-news/major-incident/air-canada-crj-collides-wi...
4•mcbain•52m ago•0 comments

Show HN: Claudebox – Your Claude Subscription as Personal API

https://github.com/ArmanJR/claudebox
2•armanj•55m ago•0 comments

Show HN: Real-Time Robot Motion Planning in the Browser with WASM

https://zkingston.com/vamp-web/
1•zak_kingston•58m ago•0 comments

Art is not being replaced by AI

https://www.nicolasdeory.com/thoughts/ai-wont-replace-art
1•nicodeory•1h ago•0 comments

"We had to kill the gecko in order to save it"

https://www.theguardian.com/environment/2026/mar/22/i-discovered-three-new-geckos-in-cambodia-lim...
2•zabzonk•1h ago•1 comments

Joy – Trust Network for AI Agents (7k agents registered)

https://joy-connect.fly.dev
1•savvyllm•1h ago•0 comments

Show HN: Mamba SSM in Rust – training and inference with custom CUDA kernels

https://github.com/silvermpx/mamba-rs
1•silvermpx•1h ago•0 comments