frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

AI Companies' Shared Destiny Recalls Dot-Com Bubble Memories

https://www.bullbear.ninja/board/10
1•oswarld•45s ago•0 comments

Floors, 1,240 rooms: largest Baltic Sea hotel opens – but there is a catch

https://www.euronews.com/travel/2026/06/07/13-floors-1240-rooms-largest-baltic-sea-hotel-opens-bu...
1•taubek•1m ago•0 comments

The EU tech sovereignty plan

https://hamishcampbell.com/the-eus-tech-sovereignty-plan/
3•birdculture•2m ago•0 comments

gonum

https://github.com/gonum
1•tosh•2m ago•0 comments

I built a front-end web app to replace Obsidian/Roam Research at work

https://cognir.netlify.app/user_guide.html
1•sailpvp998•7m ago•0 comments

To be smarter, try being crazier?

https://lemire.me/blog/2014/01/30/to-be-smarter-try-being-crazier/
1•jruohonen•9m ago•0 comments

15 Years of StarCraft II Balance Changes Visualized

https://github.com/stared/sc2-balance-timeline
1•stared•9m ago•0 comments

Coreutils for Windows

https://learn.microsoft.com/en-us/windows/core-utils/overview
1•nickysielicki•9m ago•0 comments

Christophe Pettus: Async I/O in PostgreSQL 19: The Year After

https://thebuild.com/blog/async-io-in-postgresql-19-the-year-after/
1•PaulHoule•10m ago•0 comments

Why Companies Are Replacing Traditional Interviews with AI Assessments

https://pressearn.it.com/blog/2026/06/02/how-ai-is-changing-job-interviews-in-2026/
1•Kaysource•10m ago•0 comments

Show HN: keep it simple. – a free focus timer for iPhone & iPad

https://apps.apple.com/us/app/keep-it-simple-focus-timer/id6766077920
1•dsemianovich•11m ago•1 comments

BoredOS: Three years of building an OS from scratch (And loving every minute)

https://blog.boreddev.nl/posts/boredos/
2•boreddevnl•11m ago•0 comments

Scientists Discover Hidden Symmetry on Earth That Nobody Can Explain

https://www.404media.co/scientists-discover-hidden-symmetry-on-earth-that-nobody-can-explain/
1•Brajeshwar•12m ago•0 comments

Robots Create more jobs than they Kill

https://julienreszka.com/blog/robots-create-more-jobs-than-they-kill/
2•julienreszka•13m ago•0 comments

Protect the Shire

https://wordpress.org/news/2026/06/pts/
1•taubek•13m ago•0 comments

A Request Becomes Memory

https://rayredington.substack.com/p/how-a-request-becomes-memory
1•Visheshrwl•14m ago•0 comments

The dark side of Japanese convenience stores

https://spectator.com/article/the-dark-side-of-japanese-convenience-stores/
1•Michelangelo11•15m ago•0 comments

"Terrorists?": The Suffragette Arson and Bombing Campaign – Egham Museum

https://eghammuseum.org/terrorists-the-suffragette-arson-and-bombing-campaign/
1•lifeisstillgood•19m ago•0 comments

Show HN: LimitPing – Keep Claude Code and Codex rate-limit windows continuous

https://github.com/wavever/CCLimitPing
1•wavever•21m ago•0 comments

What is the value of releasing software that leaves people unemployed?

2•rondaerth92•22m ago•0 comments

netcat

https://en.wikipedia.org/wiki/Netcat
1•tosh•22m ago•0 comments

Cave of Forgotten Dreams

https://charlesleifer.com/blog/cave-of-forgotten-dreams/
1•cleifer•25m ago•0 comments

Generation is cheap, the decisions are the artifact

https://noemica.io/blog/generation-is-cheap
1•SebastianSosa•28m ago•0 comments

The Wearable Showdown: OURA Ring 5 vs. Fitbit Air vs. Whoop MG vs. Apple Watch

https://www.wsj.com/tech/personal-tech/oura-ring-fitbit-air-whoop-apple-sleep-wearables-99783661
1•odig•31m ago•0 comments

Google's Unique Approach to Getting Data Centers Built

https://www.wsj.com/tech/ai/googles-unique-approach-to-getting-data-centers-built-2cfae652
2•odig•31m ago•0 comments

Taxation with Representation- How Communities/Coops Turn Spending into Ownership

https://cahootzcoops.com/blog/taxation-with-representation-how-communities-and-co-ops-turn-spendi...
2•DeonRob•36m ago•0 comments

The Origin of Lorem Ipsum

https://www.youtube.com/watch?v=kL1PDqzqhM4
1•jofzar•36m ago•1 comments

Anthropic, please ship an official Claude Desktop for Linux

https://github.com/anthropics/claude-code/issues/65697
37•predkambrij•41m ago•21 comments

Show HN: I made a better zsh autosuggestion tool that predicts your next command

https://github.com/Giammarco-Ferranti/deja
3•giammiferr•42m ago•1 comments

Polymarket Annotation Injection

https://sam.elborai.me/articles/polymarket-prompt-injection/
1•dgellow•49m ago•0 comments