frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Nocturnal migratory birds follow rhythm of the moon

https://www.lunduniversity.lu.se/article/nocturnal-migratory-birds-follow-rhythm-moon
1•hhs•15s ago•0 comments

Who is funding the future of British defence?

https://vulpesetleo.substack.com/p/who-is-funding-the-future-of-british
1•foxandlion•1m ago•0 comments

Cottage – A modern Git based age-encrypted secrets manager for teams

https://github.com/sayanarijit/cottage
1•sayanarijit•2m ago•1 comments

No brain required: This is how the single-celled “Stentor” learns

https://www.ucsf.edu/news/2026/04/431841/no-brain-required-how-single-celled-stentor-learns
1•hhs•3m ago•0 comments

PEP 661 – Sentinel Values, accepted 5 years later

https://peps.python.org/pep-0661/
1•birdculture•3m ago•0 comments

Gute Form

https://de.wikipedia.org/wiki/Gute_Form
1•doener•4m ago•0 comments

AI Wrote the Code. Can Your Enterprise Ship It?

https://stackgen.com/blog/your-ai-wrote-the-code.-can-your-enterprise-actually-ship-it
1•SanjeevSharma•6m ago•0 comments

Ask.com shuts down after 30 years

https://mashable.com/article/ask-jeeves-shut-down
2•el_duderino•6m ago•1 comments

Berkshire Has a Website from the '90s and Buffett Fans Say Don't Mess with It

https://www.wsj.com/tech/personal-tech/berkshire-hathaway-shareholder-meeting-warren-buffett-greg...
2•firexcy•7m ago•0 comments

How the legal opium market shaped global trade - and led to an opioid crisis

https://www.bu.edu/articles/2026/how-the-legal-opium-market-led-to-an-opioid-crisis/
1•hhs•7m ago•0 comments

Former head of 'Pentagon's think tank' joins Anthropic

https://www.defenseone.com/technology/2026/05/former-head-pentagons-think-tank-joins-anthropic/41...
1•Jimmc414•10m ago•0 comments

Tesla owner won $10k in court for Tesla's FSD lies. Tesla is still fighting him

https://electrek.co/2026/05/02/this-tesla-owner-won-10k-in-court-for-teslas-fsd-lies-tesla-is-sti...
2•breve•11m ago•0 comments

Show HN: Language app with spaced repetition and comprehensible input

1•ChadNauseam•12m ago•0 comments

The Claude Delusion: Richard Dawkins believes his AI chatbot is conscious

https://www.dailygrail.com/2026/05/the-claude-delusion-richard-dawkins-believes-his-female-ai-cha...
1•SwellJoe•12m ago•0 comments

Google Summer of Code 2026 selected projects

https://blog.rust-lang.org/2026/04/30/gsoc-2026-selected-projects/
1•kazu11max17•14m ago•0 comments

AI agents are briefly overhyped

https://stevekrouse.com/agent-hype
1•stevekrouse•24m ago•0 comments

To Make Orchestras More Diverse, End Blind Auditions

https://www.nytimes.com/2020/07/16/arts/music/blind-auditions-orchestras-race.html
1•bilsbie•27m ago•0 comments

Meta faces New Mexico trial that could force change to Facebook, other platforms

https://www.reuters.com/legal/government/meta-faces-new-mexico-trial-that-could-force-changes-fac...
3•1659447091•35m ago•0 comments

The Race Is on to Find the Treasure Buried in San Francisco

https://www.nytimes.com/2026/05/02/us/san-francisco-buried-treasure-chest.html
1•mistersquid•38m ago•0 comments

AWS Lightsail's $0.09/GB Bandwidth Overage Is a Trap for Small Projects

https://galaxycloudsolutions.com/blog/aws-lightsail-vs-galaxy-cloud-solutions/
2•rougereaper420•40m ago•0 comments

With $1 Cyberattacks on the Rise, Durable Defenses Pay Off

https://spectrum.ieee.org/ai-cyberattacks-memory-safe-code
1•rbanffy•48m ago•0 comments

Coatue has a plan to buy up land for data centers, possibly for Anthropic

https://techcrunch.com/2026/05/01/coatue-has-a-plan-to-buy-up-land-for-data-centers-possibly-for-...
1•Brajeshwar•48m ago•0 comments

The Computer Programme Episode 1, 1982 [video]

https://archive.org/details/the_computer_programme_ep01
2•petethomas•48m ago•0 comments

Voice-AI-for-Beginners – A curated learning path for developers

https://github.com/mahimairaja/voiceai
2•mahimai•54m ago•0 comments

Restorative Yoga and the Biology of Belonging

https://parrik.com/puzzles/the-partition-problem/
1•parrik•54m ago•0 comments

Facepunch launches s&box, the highly anticipated successor to Garry's Mod

https://www.gamingonlinux.com/2026/04/facepunch-launches-s-box-the-highly-anticipated-successor-t...
5•embedding-shape•56m ago•1 comments

Dynamic Traefik configuration with multiple Docker hosts

https://blog.vasi.li/automating-mantrae-traefik-management-with-mantrae-agent/
2•vsviridov•57m ago•0 comments

Grinta – Local-first coding agent, 7 months solo, open source today

https://github.com/josephsenior/Grinta-Coding-Agent
1•YoussefMejdi•57m ago•1 comments

Trump's border wall expansion just bulldozed an ancient tribal site

https://www.washingtonpost.com/climate-environment/2026/04/30/border-wall-damage-indigenous-arizona/
5•gnabgib•59m ago•0 comments

What Is GStack? Gary Tan's Open-Source Startup Framework for Claude Code

https://www.mindstudio.ai/blog/what-is-gstack-gary-tan-claude-code-framework
2•evo_9•1h ago•0 comments