frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

MIT EECS/CSAIL Agentic Coding in Practice Seminar Series

https://people.csail.mit.edu/saman/acpss/
1•matt_d•51s ago•0 comments

The Load-Balance Problem Behind Hybrid Parallelism

https://hecate0821.github.io/blogs/hybrid-parallel-post-training/
1•matt_d•1m ago•0 comments

Deadly fungal storms are sweeping US and spreading disease few doctors recognize

https://www.sciencefocus.com/planet-earth/dust-storms-us-blood-rain
1•ck2•4m ago•1 comments

"I somehow managed to import 1.8M books to calibre"

https://old.reddit.com/r/DataHoarder/comments/1tr37eb/i_somehow_managed_to_import_18m_books_to_ca...
1•r721•5m ago•0 comments

Show HN: I launched a micro-gig marketplace and used it to buy my own GTM plan

1•alanhalley•8m ago•0 comments

Auto-review mode is now available in Cursor

https://cursor.com/changelog/auto-review
1•davidgomes•9m ago•0 comments

Delayed Tensor Parallelism for Faster Transformer Inference

https://blog.kog.ai/delayed-tensor-parallelism-for-faster-transformer-inference/
1•matt_d•9m ago•0 comments

Aliens.gov

https://www.whitehouse.gov/aliens/
2•saikatsg•10m ago•1 comments

The Compose Key and –/.XCompose

https://blog.gavide.dev/blog/compose-key-linux
2•gavide•10m ago•0 comments

The Machine God Wants to Talk to You

https://twitter.com/olvrgln/status/2060419489351754049
1•OliverGilan•12m ago•0 comments

SQLite is all you need for durable workflows

https://obeli.sk/blog/sqlite-is-all-you-need-for-durable-workflows/
12•tomasol•13m ago•3 comments

A call for secure coding standards across the Canada government

https://bsky.app/profile/shehackspurple.bsky.social/post/3mmz25aplk52a
1•mooreds•13m ago•0 comments

Nvidia Twitter Post Teasing: A New Era of PC

https://twitter.com/nvidia/status/2060390710797328574
1•HeyMeco•15m ago•2 comments

Multiplayer Harness for Agents and Humans

https://thruwire.ai
1•noashavit•15m ago•0 comments

The Greatest Show on Earth: The Evidence for Evolution

https://en.wikipedia.org/wiki/The_Greatest_Show_on_Earth:_The_Evidence_for_Evolution
1•chistev•16m ago•0 comments

Tesla's AI trainers don't trust its self-driving tech – or its safety stats

https://www.reuters.com/investigations/why-teslas-ai-trainers-dont-trust-its-self-driving-tech-or...
3•grassfedgeek•17m ago•2 comments

What if remote working, not AI, is to blame for weak junior hiring?

https://www.ft.com/content/2205e2d0-50dc-4e80-9bf7-78d0272276c0
3•uxhacker•18m ago•1 comments

Google DeepMind's AlphaProof Nexus solves decades-old math problems

https://the-decoder.com/google-deepminds-alphaproof-nexus-solves-decades-old-math-problems-for-a-...
1•gmays•21m ago•0 comments

Robinhood now lets your AI agents trade stocks

https://techcrunch.com/2026/05/27/robinhood-now-lets-your-ai-agents-trade-stocks/
23•wapasta•21m ago•19 comments

Sample Music with Chrome

https://www.tabsampler.com/
1•asolis0105•22m ago•2 comments

Mosquitoes seem to be getting over insect repellent

https://www.economist.com/science-and-technology/2026/05/28/mosquitoes-seem-to-be-getting-over-in...
1•Brajeshwar•22m ago•1 comments

Windows PC Industry Reacts to MacBook Neo

https://www.macrumors.com/2026/05/29/windows-pc-industry-reacts-to-macbook-neo/
1•tosh•22m ago•1 comments

National Design Service Websites Registry

https://thedreydossier.github.io/NDS_servers_map/
1•ravenical•23m ago•0 comments

Normalized Compression Distance

https://en.wikipedia.org/wiki/Normalized_compression_distance
1•woliveirajr•23m ago•0 comments

Another tech company says it will cut jobs amid pivot to AI

https://www.latimes.com/business/story/2026-05-29/another-tech-company-says-it-will-cut-hundreds-...
1•1vuio0pswjnm7•23m ago•0 comments

Zero Evidence of AI-Related Job Losses

https://www.apollo.com/wealth/the-daily-spark/zero-evidence-of-ai-related-job-losses
2•akyuu•24m ago•0 comments

Generative Unix CTF for RL

https://vmax.ai/team/unix-ctf-procedural-environments-for-unix-competence-reinforcement-learning
1•ronald_raygun•25m ago•0 comments

Open-source security mess: IBM and Red Hat bet $5B and 20k engineers can fix it

https://www.zdnet.com/article/open-source-security-is-a-mess-ibm-and-red-hat-bet-5-billion-to-fix...
1•CrankyBear•25m ago•0 comments

AionOS – self-healing microkernel in Zig (boots on real hardware)

https://github.com/rodancz/aion
1•rodancz•25m ago•0 comments

The origin of quorum systems in distributed computing [pdf]

https://vukolic.com/QuorumsOrigin.pdf
2•fanf2•26m ago•0 comments