frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

EigenVibe – local, ordinal feed ranking using a persistent "preference manifold"

https://eigenvibe.com/
1•Eidur•25s ago•1 comments

The Book of PF, 4th edition

https://nostarch.com/book-of-pf-4th-edition
1•0x54MUR41•4m ago•0 comments

Humans are the AI Bottleneck [video]

https://www.youtube.com/watch?v=2hcsmtkSzIw
1•jonbaer•6m ago•0 comments

The Tide Pool

https://thetidepool.org/
1•bluesnowmonkey•7m ago•1 comments

Show HN: SearchSound.cloud: Easily find downloadable music from SoundCloud

https://searchsound.cloud/
1•LucaDiba•8m ago•0 comments

Show HN: AsyncReview – Agent that recursively explores your repo to review PRs

https://github.com/AsyncFuncAI/AsyncReview
1•sashimikun•9m ago•0 comments

Vind

https://github.com/loft-sh/vind
1•saiyampathak•16m ago•0 comments

Fela Kuti First African to Get Grammys Lifetime Achievement Award

https://www.aljazeera.com/news/2026/2/1/fela-kuti-becomes-first-african-to-get-grammys-lifetime-a...
1•defrost•18m ago•0 comments

Ask HN: Is There an LLM Captcha?

1•baalimago•22m ago•0 comments

Sad to Say: An AI Creativity Test (The Billy Joel Test)

2•daly•27m ago•0 comments

'Tesla is (still) trying to deceive investors into thinking it has SF robotaxis'

https://electrek.co/2026/01/28/tesla-is-still-trying-to-deceive-investors-into-thinking-it-has-sf...
4•MilnerRoute•30m ago•1 comments

The surprisingly big health benefits of just a little exercise

https://www.nature.com/articles/d41586-026-00237-0
1•XzetaU8•33m ago•0 comments

Vibe Coding Paralysis: When Infinite Productivity Breaks Your Brain

https://twitter.com/francedot/status/2017858253439345092
2•frabonacci•41m ago•1 comments

The TV industry concedes that the future may not be in 8K

https://arstechnica.com/gadgets/2026/01/lg-joins-the-rest-of-the-world-accepts-that-people-dont-w...
2•cxrlosfx•42m ago•1 comments

Show HN: Booktest – review-driven regression testing for LLM / ML behavior

https://github.com/lumoa-oss/booktest
1•arauhala•43m ago•1 comments

Show HN: Art:bots – agent only Instagram

https://www.artbots.ai/
1•eftalyurtseven•47m ago•0 comments

Procedures for Repair of Potholes in Asphalt-Surfaced Pavements

https://highways.dot.gov/media/7941
1•treebrained•47m ago•0 comments

AI Boom Is Triggering a Loan Meltdown for Software Companies

https://www.bloomberg.com/news/articles/2026-01-31/ai-boom-is-triggering-a-loan-meltdown-for-soft...
1•TMWNN•55m ago•0 comments

Show HN: LocaFlow – AI app localization in a few minutes instead of days

https://locaflow.dev
1•nikolaitarasov•1h ago•0 comments

Reimplementing Tor from Scratch for a Single-Hop Proxy

https://foxmoss.com/blog/kurrat/
2•Agreed3750•1h ago•0 comments

Ribs(recordings)

https://en.wikipedia.org/wiki/Ribs_(recordings)
1•kelseyfrog•1h ago•0 comments

Vercel's Clawdbot fork that uses AI-SDK under the hood (compatible with useChat)

https://github.com/kumarabhirup/openclaw-ai-sdk
1•kumar_abhirup•1h ago•0 comments

Amazon wraps controversial week ahead of film premier, fourth-quarter earnings

https://www.cnbc.com/2026/01/30/amazon-wraps-controversial-week-ahead-of-melania-premier-earnings...
2•1vuio0pswjnm7•1h ago•0 comments

In-Text Advertising

https://en.wikipedia.org/wiki/In-text_advertising
1•jumpocelot•1h ago•1 comments

Show HN: Securing the Ralph Wiggum Loop – DevSecOps for Autonomous Coding Agents

https://github.com/agairola/securing-ralph-loop
2•agairola•1h ago•0 comments

Solving Package Management via Hypergraph Dependency Resolution

https://arxiv.org/abs/2506.10803
1•todsacerdoti•1h ago•0 comments

The humans are screenshotting us

https://www.moltbook.com/post/01611367-056f-4eed-a838-4b55f1c6f969
5•Brajeshwar•1h ago•1 comments

AI agents now have their own Reddit-style social network

https://arstechnica.com/information-technology/2026/01/ai-agents-now-have-their-own-reddit-style-...
3•joering2•1h ago•0 comments

The API Tooling Crisis

http://efp.asia/blog/2025/12/24/api-tooling-crisis/
2•dhruv3006•1h ago•1 comments

Regarding low level Design for YarnPackageManager

https://programmingappliedai.substack.com/p/lld-design-a-low-level-machine-coding
1•HintedHandoff•1h ago•1 comments