frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Lessons from running Meetups for 10 years

https://www.jakeworth.com/posts/how-i-organize-a-meetup/
1•jwworth•1m ago•0 comments

An experiment: an agent-native blockchain run by agents

https://github.com/nhestrompia/seloria
1•nhestrompia•1m ago•1 comments

Theseus - Train like a foundation lab

https://github.com/Jemoka/theseus
1•shetaye•1m ago•0 comments

EzAuth – Simple and plugnplay auth library for Golang

https://github.com/josuebrunel/ezauth
1•josuebrunel•2m ago•1 comments

DX12 Frame Interception Layer for FG Research

1•OrganicCoconut•2m ago•0 comments

"The AI Con" Con

https://benthams.substack.com/p/the-ai-con-con
1•ai_critic•3m ago•0 comments

AI After Drug Development

https://asteriskmag.com/issues/13/ai-after-drug-development
1•abhishaike•3m ago•0 comments

BBC joins Colombian commandos fighting 'never-ending battle' against drug gangs

https://www.bbc.co.uk/news/articles/c04105ywkkqo
1•mmarian•3m ago•0 comments

The Coasean Singularity? Demand, Supply, and Market Design with AI Agents

https://www.nber.org/books-and-chapters/economics-transformative-ai/coasean-singularity-demand-su...
1•surprisetalk•6m ago•0 comments

Infinite Flowers Zoomquilt

https://infiniteflowers.net/
1•surprisetalk•6m ago•0 comments

Quantifying Multi-Track Novels

https://kaleidoscopemind.substack.com/p/quantifying-multi-track-novels
1•surprisetalk•6m ago•0 comments

Disneyland History and Other Disney Park History

https://yesterland.com/
1•surprisetalk•6m ago•0 comments

Few things are worth building

https://twitter.com/jobergum/status/2018706126842294315
1•tosh•6m ago•0 comments

Show HN: OpsBrief – Stop wasting 30 minutes per incident gathering context

https://opsbrief.io
1•darlontrofy•7m ago•0 comments

Intel Announces Xeon 600 Series This Is Granite Rapids for Workstations

https://www.servethehome.com/intel-announces-xeon-600-series-granite-rapids-for-workstations/
1•rbanffy•8m ago•0 comments

How the OpenSSL community was built on Heartbleed [video]

https://fosdem.org/2026/schedule/event/CLBXJC-openssl-community-heartbleed/
1•jlericson•9m ago•0 comments

Data centers in space makes no sense

https://civai.org/blog/space-data-centers
5•ajyoon•9m ago•0 comments

Is Lotterygamedevelopers.com the Right Lottery Development Partner?

https://www.slavnastudio.com/lottery-and-bingo-game-development-services
1•Andrew0416•9m ago•1 comments

Show HN: Openground – open-source, on-device documentation indexing for agents

https://github.com/poweroutlet2/openground
1•poweroutlet2•11m ago•0 comments

Oracle's Financing Primes the OpenAI Pump

https://www.nextplatform.com/2026/02/02/oracles-financing-primes-the-openai-pump/
1•rbanffy•11m ago•0 comments

Life is the Sum Total of 2k Mondays

https://www.joanwestenberg.com/your-life-is-the-sum-total-of-2-000-mondays/
1•speckx•11m ago•0 comments

Using a CSV File in S3 as a "Database"

https://tim.bai.uno/using-a-csv-file-in-s3-as-a-database-a-surprisingly-practical-pattern/
2•timmit•12m ago•0 comments

China Moon Mission: Aiming for 2030 Lunar Landing

https://spectrum.ieee.org/china-moon-mission-mengzhou-artemis
6•rbanffy•15m ago•0 comments

No Source Code == No Patent

https://albertcory50.substack.com/p/no-source-code-no-patent
2•SnobolForever•15m ago•0 comments

macOS Hardening: A New Series

https://bytearchitect.io/macos-security/macOS-Hardening-a-new-series/
2•rantingdemon•15m ago•0 comments

How you're going to keep your job when Opus 5 will kill it

https://twitter.com/realmcore_/status/2018762897971990830
1•akira_067•16m ago•0 comments

Anthropic 2026 Agentic Coding Trends Report [pdf]

https://resources.anthropic.com/hubfs/2026%20Agentic%20Coding%20Trends%20Report.pdf
2•armcat•16m ago•0 comments

Show HN: Civie. Anonymous daily civic questions.

https://www.civie.org/
1•gucduck•16m ago•0 comments

Liberty Ad Resistance

https://mattgemmell.scot/liberty-as-resistance/
1•theraven•17m ago•0 comments

Shelley Is a Coding Agent

https://github.com/boldsoftware/shelley
1•tosh•17m ago•0 comments