frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

DASL Web Tiles

https://dasl.ing/tiles.html
1•packetlost•1m ago•0 comments

YC Startups Outsourcing Sales Teams

https://client.prompx.com/=SYjTJLwiJ?cid=ycombinator
1•PrompX•3m ago•1 comments

'Slow Tuesday Night' (1965 scifi short story)

https://www.baen.com/Chapters/9781618249203/9781618249203___2.htm
1•gojomo•5m ago•0 comments

An MCP server that lets AI agents sleep for a specified duration

https://github.com/usamaasfar/isleep
1•usamaasfar•6m ago•0 comments

Speed dating firm scrambling after being dumped by payment provider

https://www.rnz.co.nz/news/business/586469/speed-dating-firm-scrambling-after-being-dumped-by-pay...
2•lostlogin•7m ago•0 comments

Ask HN: AI to Replace Compiled Languages?

1•exodys•7m ago•1 comments

My journey to the microwave alternate timeline

https://malmesbury.substack.com/p/my-journey-to-the-microwave-alternate
1•ctoth•9m ago•0 comments

Show HN: Creature – Desktop Client for Building and Sharing MCP Apps Within Orgs

https://www.creature.run/
7•ac360•9m ago•2 comments

Faster, cheaper, messier: lessons from our switch to self-hosted GitHub Actions

https://theguardian.engineering/blog/faster-cheaper-messier-lessons-from-switch-to-self-hosted-gi...
1•ptrhvns•9m ago•0 comments

Show HN: Deidentify data before LLM with Go

https://github.com/aliengiraffe/deidentify
2•nicolasbistolfi•14m ago•0 comments

Clerk Is Down

https://status.clerk.com/
3•prasoonds•14m ago•1 comments

AI reduced stress of IPv6 migrations in university experiment

https://www.theregister.com/2026/02/10/ipv6_generative_ai_experiment/
2•Hotdogsteve•15m ago•0 comments

So you want to build your own datacenter

https://namespace.so/blog/so-you-want-to-build-your-own-datacenter
1•intheairtonight•15m ago•0 comments

Launch HN: Livedocs (YC W22) – An AI-native notebook for data analysis

https://livedocs.com
5•arsalanb•15m ago•0 comments

The Switch to Linux and the Beginning of My Self-Hosting Journey

https://hazemkrimi.tech/blog/linux-self-hosting-journey/
2•kingcrimson1000•15m ago•0 comments

Language Lens – A desktop screen translator lens built with Python

https://apps.microsoft.com/detail/9p77pw1xff4m?hl=en-US&gl=US
2•LanguageLens•16m ago•1 comments

World's first fully Client-Side Webmail Client

https://mail.cock.li/cock-mail/
2•zebreus•17m ago•0 comments

Sushi rolls inspired a flexible fiber chip as thin as a human hair

https://techxplore.com/news/2026-01-sushi-flexible-fiber-chip-thin.html
1•PaulHoule•17m ago•0 comments

Show HN: I Made a New Game

https://glow-dash-chase.lovable.app/
1•glow_dash_chase•17m ago•0 comments

The latest Linux kernel release closes out the 6.x era

https://www.zdnet.com/article/latest-linux-kernel-ends-6x-era-cloud-admins/
1•CrankyBear•18m ago•0 comments

Technical Details of My LLM-Generated Book

https://mattbruenig.com/2026/02/10/technical-details-of-my-llm-generated-book/
1•bestcoder69•18m ago•0 comments

Armstrong Limit

https://en.wikipedia.org/wiki/Armstrong_limit
1•gurjeet•19m ago•0 comments

Intent – The developer workspace for agent orchestration – Augment Code

https://www.augmentcode.com/product/intent
2•tortilla•19m ago•0 comments

Coalition Letter Re: Covert ALPRs

https://www.eff.org/document/coalition-letter-re-covert-alprs
1•hn_acker•19m ago•1 comments

Khronos at 25: Shaping Visual Computing with Open Standards

https://www.khronos.org/blog/the-khronos-group-celebrates-25-years-shaping-the-future-through-ope...
1•ibobev•20m ago•0 comments

Mountain Disappeared Villagers Used AI to Investigate 20 Years of Illegal Mining

https://qindawu.pages.dev/en/
2•yueq54211•21m ago•1 comments

We just built AWS Lambda with a browser built-in. (Browserbase Functions)

https://www.browserbase.com/blog/building-browserbase-functions
2•Kylejeong21•21m ago•0 comments

Show HN: We got sick of juggling terminals for AI agents so we built a workspace

https://www.augmentcode.com/blog/intent-a-workspace-for-agent-orchestration
3•knes•22m ago•0 comments

How do you measure alignment without adding more meetings?

2•ivogosp•24m ago•1 comments

Reliability of LLM medical assistants for the general public: a randomized study

https://www.nature.com/articles/s41591-025-04074-y
1•zzzeek•25m ago•0 comments