frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: UX Agent Mac app running continuously locally using Gemma 4

https://github.com/tommyjepsen/ux-agent-app
1•tommyjepsen•3m ago•0 comments

Tell HN: I know I dislike a developer

1•ishener•4m ago•0 comments

The First Bike Bell Designed to Penetrate Noise-Cancelling Headphones

https://www.youtube.com/watch?v=zDaVPfpQvPI
1•Ahmedb•5m ago•0 comments

Ask HN: What are you building that's not AI related?

1•meander_water•6m ago•0 comments

AI company's breached biometrics, ID document images make deepfake fraud easier

https://www.biometricupdate.com/202604/ai-companys-breached-biometrics-id-document-images-make-de...
1•tagyro•8m ago•0 comments

Greece announces social media ban for under-15s over anxiety and sleep problems

https://www.theguardian.com/world/2026/apr/08/greece-proposes-social-media-ban-under-15s-anxiety-...
1•Growtika•8m ago•0 comments

I Built an Offline File Sync System

https://github.com/anurag-as/Caravault
1•sampathanurag3•9m ago•0 comments

Tech industry lays off nearly 80k employees in the first quarter of 2026

https://www.tomshardware.com/tech-industry/tech-industry-lays-off-nearly-80-000-employees-in-the-...
1•Growtika•13m ago•0 comments

Fears net zero is 'next Brexit' as oil crisis fuels political climate divide

https://www.theguardian.com/environment/2026/mar/26/fears-net-zero-is-next-brexit-as-oil-crisis-f...
1•PaulHoule•14m ago•0 comments

Freestyle Ax Audit

https://techstackups.com/articles/freestyle-ax-audit/
1•ritzaco•14m ago•0 comments

New problem: AI finds too many bugs

https://etn.se/73048
2•etn_se•16m ago•3 comments

Show HN: D3 4 Layer Magma Graph – Cloak Stealth Browser

https://vektormemory.com/docs/
1•vektormemory•18m ago•0 comments

Help Keep Thunderbird Alive

https://updates.thunderbird.net/en-US/thunderbird/140.0/apr26-1e/donate/
1•playfultones•18m ago•0 comments

Lipovive – Optimize Mitochondrial Health and Burn Fat

https://www.morningstar.com/news/accesswire/1138075msn/lipovive-reviews-shocking-2026-report-what...
1•tayghalu•19m ago•0 comments

Show HN: Chrome extension to filter HN comments by user karma/account age

https://chromewebstore.google.com/detail/hn-users-filter/ineikanbokfebefjmhinpbmgablaebom
1•csomar•20m ago•0 comments

The President Speaks Genocide

https://snyder.substack.com/p/the-president-speaks-genocide
1•hkhn•21m ago•0 comments

Functional Programming with Bananas Lenses Envelopes and Barbed Wire (1991) [pdf]

https://maartenfokkinga.github.io/utwente/mmf91m.pdf
1•tosh•21m ago•0 comments

Show HN: A dev tool for routing local traffic, built with Pingora

https://github.com/antonguzun/hijakora
1•anophelon•23m ago•1 comments

One Million Prompt – $1 per block, AI generates collective art daily

1•danielhidalgo•24m ago•0 comments

Investigating Split Locks on x86-64 – By Chester Lam

https://chipsandcheese.com/p/investigating-split-locks-on-x86
1•rbanffy•25m ago•0 comments

The demise of software engineering jobs has been greatly exaggerated

https://www.cnn.com/2026/04/08/tech/ai-software-developer-jobs
3•perelin•25m ago•0 comments

Algorithms for Modern Hardware

https://en.algorithmica.org/hpc/
1•tosh•26m ago•0 comments

NanoCorp – autonomous companies run by AI

https://www.nanocorp.so
1•kandu•29m ago•0 comments

The Learning Firm Under Poverty of Stimulus

https://jimiwen.substack.com/p/the-learning-firm-under-poverty-of
1•jimiwen•35m ago•0 comments

Ora2Pg Just Saved Indian Taxpayers More Than a Million Dollars per Year

https://hexacluster.ai/blog/ora2pg-just-saved-indian-taxpayers-more-than-a-million-dollars-per-year
2•avivallssa•37m ago•0 comments

How HN: DocoAPI – API docs that auto-sync from your OpenAPI spec

https://docoapi.com/
3•onmyway133•38m ago•0 comments

Clarke and Dawe – The Front Fell Off [video] (1991)

https://www.youtube.com/watch?v=3m5qxZm_JqM
2•walterbell•40m ago•0 comments

Autonomous Rocket Landing with Reinforcement Learning (YouTube)

https://www.youtube.com/watch?v=1oI-Gh8R_HE
1•rafacm•41m ago•1 comments

Iran-linked hackers disrupt operations at US critical infrastructure sites

https://arstechnica.com/security/2026/04/iran-linked-hackers-disrupt-operations-at-us-critical-in...
2•joozio•42m ago•0 comments

Show HN: PostgreSQL running in the browser, persisting to S3

https://www.zerofs.net/postgresql-in-the-browser
1•Eikon•43m ago•0 comments