frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Angular jasmine unit tests are harder to code/maintain than the actual feature

1•GamingAtWork•54s ago•0 comments

New college grads are doing better than the vibes suggest

https://www.vox.com/future-perfect/490383/college-graduation-artificial-intelligence-2026-jobs-labor
1•dabinat•1m ago•0 comments

Project Tapestry: The Path to Frontier Sovereign AI

https://thealliance.ai/blog/project-tapestry-the-path-to-frontier-sovereign-ai
1•AI_Alliance•2m ago•0 comments

Show HN: Clor – give your agent claws

https://clor.com/
3•jacobgold•3m ago•1 comments

Seven states sue US for paying $1B to make TotalEnergies exit wind power

https://www.ft.com/content/d4f25b34-ce45-4321-a4ee-cb5879f7a57a
2•JumpCrisscross•3m ago•0 comments

NoLoRa: Ultra-Low-Power LoRa Tx Without Active Radios for Battery-Free Devices [pdf]

https://pure.hw.ac.uk/ws/portalfiles/portal/166072930/EuCAP2026_template.pdf
1•thomasdziedzic•4m ago•0 comments

Codex Discovered a Hidden HTTP/2 Bomb

https://blog.calif.io/p/codex-discovered-a-hidden-http2-bomb
1•Yenrabbit•5m ago•0 comments

What I learned from making my own drone (Part II)

https://nbelakovski.substack.com/p/what-i-learned-from-making-my-own-887
1•actinium226•5m ago•0 comments

Striped Rock Dismissed As Natural In 1928 Reclassified As UK’s Oldest Cave Art

https://www.theguardian.com/science/2026/jun/01/striped-rock-dismissed-as-natural-reclassified-uk...
1•optimalsolver•6m ago•0 comments

Silo: Isolated workspace manager for parallel agentic development

https://github.com/rsn491/silo
1•rsn491•8m ago•0 comments

Rk New York police investigate mysterious cases of people coming out of manholes

https://www.theguardian.com/us-news/2026/jun/02/new-york-police-investigate-people-emerging-manholes
4•worik•8m ago•0 comments

Microsoft Scout: Your always-on personal agent

https://www.microsoft.com/en-us/microsoft-365/blog/2026/06/02/introducing-microsoft-scout-your-al...
1•TechTechTech•9m ago•2 comments

Do you want that computer-science degree?

https://economist.com/graphic-detail/2026/06/01/do-you-really-want-that-computer-science-degree
2•andsoitis•10m ago•0 comments

Gold replaces US Treasuries as top reserve asset, ECB says

https://www.ft.com/content/87ef8f25-eb81-4eed-919c-fe5b49a1ac2c
3•petethomas•11m ago•1 comments

XML and JSON in 2026

https://www.tbray.org/ongoing/When/202x/2026/06/01/XML-and-JSON-in-2026
1•jandeboevrie•13m ago•0 comments

Windsurf is now Devin Desktop

https://devin.ai/blog/windsurf-is-now-devin-desktop/
1•meetpateltech•13m ago•0 comments

The advertising cartel coming to your web browser

https://blog.zgp.org/the-advertising-cartel-coming-to-your-web-browser/
11•speckx•17m ago•2 comments

Army 'Jailbreaks' Its Own Weapon Systems to Counter Drone Threats

https://www.wsj.com/politics/national-security/army-jailbreaks-its-own-weapon-systems-to-counter-...
4•fortran77•19m ago•1 comments

Open Repair Data Standard – Open Repair Alliance

https://openrepair.org/open-data/open-standard/
5•cassepipe•19m ago•0 comments

This viral guitarist is about to get exposed

https://www.youtube.com/watch?v=0d9jnsnYz34
1•YeGoblynQueenne•19m ago•0 comments

I wrote a book about refusing to claim authorship of an AI "million dollar"proof

https://www.amazon.com/Moral-Reality-Authorship-Declined-Million-ebook/dp/B0G445PZZD
1•fluktuacije•19m ago•1 comments

PaceVer – Pace Versioning (and alternative to SemVer, for mobile apps)

https://pacever.org
2•pvinis•20m ago•0 comments

Do turmeric and curcumin have any actual health benefits?

https://www.newscientist.com/article/2528418-do-turmeric-and-curcumin-have-any-actual-health-bene...
3•hilux•24m ago•0 comments

USTR proposes 25% tax on all Brazilian products

https://ustr.gov/about/policy-offices/press-office/press-releases/2026/june/ustr-section-301-dete...
1•badosu•24m ago•0 comments

JLink JTAG Access on the Pinecil

https://danielmangum.com/posts/jlink-jtag-pinecil/
1•hasheddan•25m ago•0 comments

Show HN: Oneconfig – Set up any dev envs with one command

https://github.com/Thanos2002/Oneconfig
1•ThanosAkr•28m ago•0 comments

Gmail thinks I'm stupid, so I left

https://moddedbear.com/gmail-thinks-im-stupid-so-i-left
58•speckx•28m ago•19 comments

Ask HN: Are you updating your app to comply with Texas SB2420 age verification?

1•smalltorch•30m ago•0 comments

Amazon faces class action lawsuit over Ring facial-recognition feature

https://techcrunch.com/2026/06/02/amazon-faces-class-action-lawsuit-over-ring-facial-recognition-...
10•rolph•33m ago•0 comments

Scientists Find Groundbreaking Amputated Tissue Regrowth

https://www.bigelow.org/news/articles/2026-05-27.html
3•wjSgoWPm5bWAhXB•35m ago•0 comments