frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Sparky – useful 'living' OpenClaw bot

https://alexisgallagher.com/posts/2026/hello-sparky/
1•capncleaver•1m ago•1 comments

What Happened to Molecular Manufacturing?

https://latecomermag.com/article/what-happened-to-molecular-manufacturing/
1•ravenical•5m ago•0 comments

Specification; communication; computation – no, programming isn't dead

https://twey.io/llm-programming/
1•Twey•7m ago•0 comments

Larry Page has moved to Florida

https://twitter.com/paulg/status/2026737030257062253
1•jmeister•7m ago•0 comments

Apple brings age verification to UK users in iOS 26.4 beta

https://www.theverge.com/tech/884306/apple-age-verification-uk-users-ios-26-4-beta
1•turrini•9m ago•0 comments

Possible AI use leads to end of senryu competition after 20 years

https://www.japantimes.co.jp/news/2026/02/24/japan/japan-ai-senryu-poetry-writing/
2•haunter•11m ago•0 comments

Show HN: Clerk – Simple invoicing for freelancers built with AI agents in 7 days

https://clerkfinance.com/
1•radolang•12m ago•1 comments

Why Your Next Electric Car Will Cost 50% Less [video]

https://www.youtube.com/watch?v=6ecV9Yu7YvA
1•zeristor•14m ago•2 comments

Show HN: Provision Stateless GPU Compute with Claude Code's Remote Control

https://github.com/theoddden/terradev-mcp
1•Facingsouth•14m ago•0 comments

Show HN: Edictum – Runtime governance for LLM agent tool calls

1•acartag7•15m ago•0 comments

Outage of Coveralls

https://status.coveralls.io
1•sega_sai•17m ago•0 comments

Getting Global Age Assurance Right: What We Got Wrong and What's Changing

https://discord.com/blog/getting-global-age-assurance-right-what-we-got-wrong-and-whats-changing
1•Alupis•19m ago•0 comments

Tldraw moves tests to closed source to avoid AI copies

https://simonwillison.net/2026/Feb/25/closed-tests/
1•jbernardo95•20m ago•0 comments

Tech firms aren't just encouraging their workers to use AI. They're enforcing it

https://www.msn.com/en-us/money/other/tech-firms-aren-t-just-encouraging-their-workers-to-use-ai-...
2•smurda•20m ago•0 comments

The first transatlantic fiber-optic cable is being ripped up

https://www.tomshardware.com/tech-industry/the-worlds-first-transatlantic-fiber-optic-cable-is-be...
2•gnfargbl•22m ago•0 comments

Live – AI that runs your company

https://polsia.com/live
1•seyz•22m ago•0 comments

Fix cron routes: POST → GET (Vercel cron sends GET)

1•nishiohiroshi•25m ago•0 comments

Show HN: OrangeWalrus, an aggregator for trivia nights (and other events) in SF

https://www.orangewalrus.com/
2•gjtrowbridge•25m ago•0 comments

Banned in California

https://www.bannedincalifornia.org/
63•pie_flavor•26m ago•28 comments

What AI can and cannot do

https://greyenlightenment.com/2026/02/23/what-ai-can-and-cannot-do/
2•paulpauper•27m ago•0 comments

Tetraethylenepentamine-Grafted Magnetic Polymer Composite for CO2 Capture

https://www.mdpi.com/2297-8739/13/2/56
1•PaulHoule•28m ago•0 comments

Anthropic and the Department of War

https://thezvi.substack.com/p/anthropic-and-the-department-of-war
5•paulpauper•29m ago•0 comments

Show HN: Unworldly – A flight recorder for AI agents (tamper-proof, HIPAA)

https://github.com/DilawarShafiq/unworldly
1•dilawargopang•29m ago•0 comments

Buying News by Metric

https://www.overcomingbias.com/p/buying-news-by-metric
1•paulpauper•29m ago•0 comments

Ask HN: What will happen with Anthropics ultimatum?

2•maniacwhat•30m ago•0 comments

Origin of the rule that swap size should be 2x of the physical memory

https://retrocomputing.stackexchange.com/questions/32492/origin-of-the-rule-that-swap-size-should...
2•SeenNotHeard•33m ago•0 comments

Microsoft CEO slams AI slop after dismissing its importance

https://www.theregister.com/2026/02/25/microsoft_boss_on_ai_content/
2•LorenDB•34m ago•0 comments

Show HN: Tesseract – 3D architecture editor with MCP for AI-assisted design

https://tesseract.infrastellar.dev
2•infrastellar•37m ago•0 comments

At Last I Am Free

https://magiconair.net/blog/2026-02-25-at-last-i-am-free
2•magiconair•39m ago•0 comments

What We Aren't Told About Osteoporosis

https://www.midwesterndoctor.com/p/what-we-arent-told-about-osteoporosis
2•bilsbie•40m ago•0 comments