frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Embedded acoustic AI with <16ms latency running on 8MB RAM

https://www.voisace.com/blog
1•shermanliu•8m ago•0 comments

BambuStudio has been violating PrusaSlicer AGPL license since their fork

https://xcancel.com/josefprusa/status/2054602354851254330
1•Tomte•9m ago•0 comments

Concerning Emacs (and Jazz)

https://omidmash.de/blog#concerning-emacs
1•omidmash•13m ago•0 comments

Show HN: Chord Commander – A webapp to organize guitar chords

https://codeberg.org/joexo/chord-commander
1•joexo•16m ago•0 comments

SpaceX IPO: Nice Try Though [video]

https://www.youtube.com/watch?v=IHD8BDFYyGI
1•u1hcw9nx•19m ago•0 comments

A New Supercarrier Emerges Tracking China's Fourth Aircraft Carrier

https://features.csis.org/hiddenreach/china-fourth-carrier/
1•_____k•20m ago•0 comments

Legends of the Ancient Web (2017)

https://idlewords.com/talks/ancient_web.htm
1•downbad_•22m ago•0 comments

Building an AWS Lambda-Like Runtime with Firecracker MicroVMs

https://medium.com/@vivek1502/building-an-aws-lambda-like-runtime-with-firecracker-microvms-42a41...
1•nreece•26m ago•0 comments

Does anyone in your organisation own "correctness" in your AI products?

https://alokit.substack.com/p/nobody-in-your-organization-owns
2•avikalp•30m ago•0 comments

ChatGPT as the AOL of AI

https://rebecca-powell.com/posts/return-on-intelligence-02-moats/
1•maille•33m ago•1 comments

Ask HN: How do small teams securely share env files?

1•tmr_praveen•34m ago•0 comments

Pausing New Challenges – Codecrafters

https://codecrafters.io/blog/pausing-new-challenges
13•prakashqwerty•37m ago•2 comments

I reproduced a Claude Code RCE. The bug pattern is everywhere

https://vechron.com/2026/05/i-reproduced-a-claude-code-rce-the-bug-pattern-is-everywhere/
4•GeorgeWoff25•37m ago•1 comments

Show HN: GobanFTP – the board game Go played through FTP listings

https://github.com/molang163/GobanFTP
1•molang163•39m ago•0 comments

The three futures nobody is building for

https://andrebyrd.substack.com/p/the-three-futures-nobody-is-building-for
1•manofstyle04•40m ago•0 comments

You're Being Judged

https://zenodo.org/records/20352897
1•anasteciadunu•42m ago•0 comments

Nobody Understands Kafka Costs

https://getkafkanated.substack.com/p/nobody-understands-kafka-costs-stanislav
1•enether•43m ago•0 comments

Show HN: Klimkit: my Codex setup for multiple machines

https://github.com/klimentij/klimkit
1•klimentij•43m ago•0 comments

Twelve Ways to Be Wrong About AI-Assisted Coding

https://third-bit.com/2026/05/20/twelve-ways-to-be-wrong/
2•signa11•45m ago•2 comments

AI Ops SOP Pack: SOPs for reviewing AI-assisted engineering work

https://github.com/monkidy/ai-ops-sop-pack
1•monkidy•45m ago•0 comments

Show HN: Source-check politician stock-trade claims against public filings

https://tinyopsstudio.com/congress-disclosure-watchlist-digest
2•tinyopsstudio•47m ago•0 comments

Don't Read the Comments

https://kennethreitz.org/essays/2026-04-10-dont_read_the_comments
3•NicoHartmann•54m ago•0 comments

An interactive linear algebra primer aimed at LLM readers

https://algo-rhythm.dev/en/
6•bytegogogo•54m ago•0 comments

The most RAM efficient modern Linux. Noctalia v5 and LabWC and Artix [video]

https://www.youtube.com/watch?v=CnG32ZOi11s
2•grigio•54m ago•0 comments

AI Reconstructed Dead Pilots' Voices from Public NTSB Records

https://firethering.com/ai-recreated-dead-pilots-voices-ntsb-database/
1•steveharing1•55m ago•0 comments

Show HN: Larksson-lang: everything is a map or an atom

https://github.com/arthurzhu29/larksson
2•arthurzhu29•59m ago•0 comments

Experience: We found a baby on the subway – now he's our 26-year-old son

https://www.theguardian.com/lifeandstyle/2026/may/22/experience-found-baby-subway-now-26-year-old...
44•Michelangelo11•1h ago•5 comments

Don't Roll Your Own

https://susam.net/do-not-roll-your-own.html
3•Tomte•1h ago•0 comments

Sci-bot – AI-powered research assistant, powered by Sci-Hub

https://sci-bot.ru/
2•gasull•1h ago•0 comments

The New Luddite Movement

https://www.ft.com/content/f5c96fa6-5b9b-4951-b71d-e32b3b57d8df
2•quick_brown_fox•1h ago•0 comments