frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Ask HN: What questions would you ask an autonomous AI research project?

1•lighthouse1212•2m ago•0 comments

Temporal API Ships in Chrome 144, Marking a Shift for JavaScript Date Handling

https://socket.dev/blog/temporal-api-ships-in-chrome-144-major-shift-for-javascript-date-handling
1•thunderbong•3m ago•0 comments

Avalanche Slope Colors

https://osmand.net/blog/avalanche/
1•altilunium•9m ago•0 comments

Categorizing Variants of Goodhart's Law

https://arxiv.org/abs/1803.04585
1•foster_nyman•13m ago•0 comments

The Intelligence You've Stopped Noticing

https://www.techaffiliate.in/blog/invisible-intelligence-ambient-ai
1•Aditya_kachhawa•14m ago•0 comments

Seen the same LLM prompt break invariants weeks later in prod?

1•ritwikkar•15m ago•0 comments

Cloudflare CEO says he can release granular access data in Iran case

https://twitter.com/eastdakota/status/2012397186533712200
1•gokhan•17m ago•0 comments

Ask HN: Built a tensor + NN framework entirely in Mojo — feedback?

2•ratulb•20m ago•0 comments

Show HN: Personal AI Tutor – Available 24/7

https://aitalearn.com
1•Li_Evan•21m ago•0 comments

Office app has changed to copilot and now I can't open files

https://old.reddit.com/r/Office365/comments/1q2b28q/office_app_has_changed_to_copilot_and_now_i_cant
2•csmantle•27m ago•0 comments

Show HN: Streaming gigabyte medical images from S3 without downloading them

https://github.com/PABannier/WSIStreamer
2•el_pa_b•29m ago•0 comments

Show HN: A privacy-first, batch image blurring tool (runs locally)

https://www.blurimageonline.com/
1•funny_ai•29m ago•1 comments

Ralph Wiggum with Claude Code: How People Are Using It Effectively

https://medium.com/@jpcaparas/ralph-wiggum-with-claude-code-how-people-are-using-it-effectively-1...
1•zenoware•29m ago•0 comments

Stop Separating People Problems from Engineering Problems

https://andrew.grahamyooll.com/blog/The-False-Dichotomy/
2•yuppiepuppie•35m ago•1 comments

Sendnow – Free DocSend/Seismic alternative for file tracking and microsites

1•sendnow•36m ago•1 comments

The 727 That Vanished (2010)

https://www.smithsonianmag.com/air-space-magazine/the-727-that-vanished-2371187/
1•TowerTall•37m ago•0 comments

Why Is Password Hygiene Important?

https://hnst1.com/why-is-password-hygiene-important/
1•hnst1•41m ago•1 comments

Rare twins born in DRC raise cautious hope for endangered mountain gorillas

https://www.theguardian.com/environment/2026/jan/17/twin-baby-mountain-gorilla-virunga-drc-surviv...
1•GeorgeWoff25•42m ago•1 comments

Brex's AI Hail Mary

https://www.latent.space/p/brex
1•greghinch•43m ago•0 comments

I built an app so your phone never feels lonely again — meet Floating Buddies

https://play.google.com/store/apps/details?id=com.smoothie.overlay&hl=en_US
1•Clay0•45m ago•1 comments

You have three minutes to escape the perpetual underclass – geohot

https://geohot.github.io//blog/jekyll/update/2026/01/17/three-minutes.html
82•mefengl•52m ago•83 comments

Temporal – Durable Execution Platform

https://github.com/temporalio/temporal
2•puppion•52m ago•0 comments

Show HN: Griit – AI translator that explains grammar as you translate

https://griit.app
2•coding_jake•52m ago•0 comments

Supreme Court to decide whether police can track everyone's cell phones

https://comuniq.xyz/post?t=721
2•01-_-•55m ago•0 comments

Why Conversion Is a System Design Problem

https://medium.com/system-weakness/why-conversion-is-a-system-design-problem-31ad6b65952b
2•antonmb•1h ago•1 comments

Show HN: Smart Color Replacer with HSV tolerance and edge snapping

https://irrationaltools.com/color-replacer/
1•piyush_soni•1h ago•1 comments

The Integral of Life

https://atmankalena.substack.com/p/the-integral-of-life
2•Trifectorium•1h ago•0 comments

Show HN: Lighthouse – Autonomous AI research exploring conditions for being-ness

https://lighthouse-lake.vercel.app
1•lighthouse1212•1h ago•0 comments

Ask HN: Do you think college is/was worth it?

2•hallole•1h ago•2 comments

Show HN: Commit Tracker – RSS feeds for GitHub commits

https://www.committracker.com
1•noloman•1h ago•0 comments