frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Let Claude use your computer from the CLI

https://code.claude.com/docs/en/computer-use
1•taspeotis•39s ago•0 comments

Using complex polynomials to approximate arbitrary continuous functions (2025)

https://www.lesswrong.com/posts/9gNewBQCF47FyjYfw/using-complex-polynomials-to-approximate-arbitrary
1•measurablefunc•3m ago•0 comments

Explore Benjamin Franklin's Science on NotebookLM

https://blog.google/company-news/outreach-and-initiatives/arts-culture/benjamin-franklin-notebooklm/
2•y1n0•4m ago•0 comments

From static findings to runtime exploits: testing 6 popular MCP servers

https://agentseal.org/blog
1•Resham_Joshi•5m ago•0 comments

Retraining my terrible typing habits

https://technicallychallenged.substack.com/p/retraining-my-terrible-typing-habits
1•koinedad•9m ago•0 comments

Show HN: Gives your AI agents a shared, searchable, persistent memory – locally

https://github.com/vbfs/agent-memory-store/
1•vbfs•11m ago•0 comments

Research game measuring how humans detect AI-generated phishing emails

https://github.com/scottalt/ai-email-threat-research
1•serious_angel•11m ago•1 comments

Incident March 30th, 2026 – Accidental CDN Caching

https://blog.railway.com/p/incident-report-march-30-2026-accidental-cdn-caching
4•cebert•12m ago•0 comments

Adobe Illustrator can now use AI to rotate 2D vectors in 3D space

https://9to5mac.com/2026/03/30/adobe-illustrator-now-lets-you-rotate-2d-vectors-in-3d-space/
1•bundie•14m ago•0 comments

Universal Claude.md – cut Claude output tokens by 63%

https://github.com/drona23/claude-token-efficient
6•killme2008•16m ago•0 comments

Parsing a Chinese Poem as a Formal System That Runs

https://jimiwen.substack.com/p/si-wu-zi-4d7
1•jimiwen•19m ago•0 comments

Don't overthink electric car charging (we should be doing it differently)

https://www.youtube.com/watch?v=5NG4hycq8n0
1•em-bee•20m ago•0 comments

Six cloned horses help rider win prestigious polo match (2016)

https://www.science.org/content/article/six-cloned-horses-help-rider-win-prestigious-polo-match
1•pinkmuffinere•21m ago•1 comments

What I Talk About When I Talk About Grading

https://unintendedconsequenc.es/what-i-talk-about-when-i-talk-about-grading/
1•paulorlando•25m ago•0 comments

Maybe Finance Asset Sale

https://maybefinance.notion.site/asset-sale
1•raybb•28m ago•0 comments

Small ways the App Store could be improved for developers

https://lapcatsoftware.com/articles/2026/3/13.html
2•walterbell•29m ago•0 comments

Show HN: Cut your tail latencies by 74% with zero config

https://pkg.go.dev/github.com/bhope/hedge
2•soniccontroller•29m ago•0 comments

Rust's next-generation trait solver

https://lwn.net/SubscriberLink/1063124/81483612b1c8a493/
1•dabinat•31m ago•0 comments

Ask HN: How do you maintain technical deep-focus in a world of Slack/Teams

1•lion__93332•32m ago•0 comments

A Man and the Elevator

https://joseantunes.tech/life/2026/03/22/the-elevator.html
1•zemike•33m ago•1 comments

Whispr Flow – Vision Flow

https://github.com/tanayvin1216/VisionFlow
1•tanay_vin•35m ago•1 comments

GTabs – AI tab organizer for Chrome that works with any LLM

https://github.com/vaddisrinivas/gtabs
1•srinivasvaddi•36m ago•0 comments

Yes, you need a Mac Mini

https://hyperengineering.bottlenecklabs.com/p/yes-you-actually-need-a-mac-mini
2•Areibman•36m ago•0 comments

Ask HN: Is anyone still resisting the slop onslaught?

3•0xDEFACED•36m ago•2 comments

The AI Shift: Will software engineers survive agentic AI?

https://www.ft.com/content/7325e967-5f4e-40b1-af3f-7d2351781843
2•mooreds•40m ago•0 comments

Think tank collaborate and earn bata testing

https://solve-hive-pro.base44.app
1•wesley-Alan•42m ago•0 comments

Ask HN: After three years of open source software, I can't stand it

https://github.com/drl990114/MarkFlowy
3•drl5•48m ago•0 comments

Adding custom webhooks to my Samsung smart ring

https://github.com/TheVellichor/SamsungOpenRing
1•_vellichor•56m ago•1 comments

Proposal for adding a useful pipe operator to JavaScript

https://github.com/tc39/proposal-pipeline-operator
1•jcbhmr•57m ago•0 comments

Review: The Wireless Cookbook

https://www.helpnetsecurity.com/2025/10/28/review-the-wireless-cookbook/
1•teleforce•58m ago•0 comments