frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Cppreference.com Update

https://isocpp.org/blog/2026/04/announcement-cppreference.com-update
1•csmantle•41s ago•0 comments

Show HN: The all-in-one workspace for all your projects

https://app.thinky.so
1•almeidarruben•1m ago•0 comments

What Features Did X and xAI Add in 2025?

https://twitter.com/nima_owji/status/1998807709987262928
1•nomilk•1m ago•0 comments

Ask HN: How to program with IDE and LLM on CPU locally?

1•roschdal•1m ago•0 comments

Show HN: Witchcraft and Pickbrain – fast multi-vector semantic search in Rust

https://github.com/dropbox/witchcraft
1•jacobgorm•1m ago•0 comments

Which AI Image Generator Has the Best Character Consistency

https://techstackups.com/comparisons/comparing-image-generators-character-consistency/
1•ritzaco•1m ago•0 comments

Issue SAFEs for free with built-in e-signatures and YC templates

https://withmantle.com/safes
1•sabenamantle•2m ago•0 comments

FSF clarifies its stance on AGPLv3 additional terms

https://lwn.net/Articles/1067771/
1•Brajeshwar•3m ago•0 comments

Designing a Cellular Smartwatch to Replace My Phone

https://www.rist.watch/blog/designing-a-cellular-smartwatch-to-replace-my-phone
1•a_dugan•4m ago•0 comments

IPv4 vs. IPv6 FAQ

https://tailscale.com/docs/reference/faq/ipv6
1•olalonde•5m ago•0 comments

QUIC will soon be as important as TCP – but it's vastly different

https://www.theregister.com/2026/04/16/quic_explained/
1•Velocifyer•10m ago•0 comments

Deathonomics 2.0: Why the System Is Starting to Stall

https://ridl.io/deathonomics-2-0-why-the-system-is-starting-to-stall/
1•Anon84•10m ago•0 comments

Adam Back Pushes for Optional Upgrades to Quantum-Proof Bitcoin

https://decrypt.co/364562/adam-back-pushes-for-optional-upgrades-to-quantum-proof-bitcoin
1•wslh•11m ago•0 comments

AnywhereHired, a remote job board for visa sponsorship and junior-friendly roles

https://anywherehired.com/
1•Xez13•13m ago•1 comments

An album of π(90) songs about a sublinear prime number sieve

https://www.youtube.com/playlist?list=PL87nT6ft09vG6rtDkF_oC1vWW2pfBWAaf
1•oluckyman•13m ago•0 comments

Postgres: One Database to Rule Them All

https://marmotdata.io/blog/postgres-one-database-to-rule-them-all/
1•charlie-haley•13m ago•1 comments

A Guide to Reducing Cognitive Load

https://www.softwaredesign.ing/blog/a-guide-to-reducing-cognitive-load
2•prakhar897•14m ago•0 comments

We deployed a global round‑robin cluster across 3 continents

https://spooksystems.net/
1•danieljameslee•14m ago•1 comments

Show HN: 48 absurd web projects – one every month

1•absurdwebsite•14m ago•2 comments

A proxy routing all webtraffic through Qwen, removing all enshittified crap

https://geohot.github.io//blog/jekyll/update/2026/04/15/zappa-mitmproxy.html
2•MontyCarloHall•15m ago•0 comments

Journalists champion Wayback Machine after news publishers limit archiving

https://www.niemanlab.org/2026/04/journalists-champion-wayback-machine-after-news-publishers-limi...
2•giuliomagnifico•16m ago•0 comments

Context Is King

https://heyrtp.substack.com/p/context-is-king
1•heyheyheyhi•16m ago•0 comments

Show HN: Formal – LLM-driven property checker backed by Lean 4 and Mathlib

https://github.com/yamafaktory/formal
1•yamafaktory•18m ago•0 comments

Cal AI has been removed from the App Store

https://twitter.com/seraleev/status/2044528553488716040
2•wahnfrieden•19m ago•0 comments

Critical Breaking Change in Microsoft.AspNetCore.DataProtection

https://github.com/dotnet/aspnetcore/issues/66335
1•bjarteaarmolund•20m ago•1 comments

Killing Slack Was the Only Way to Make AI Accurate

https://promptql.io/blog/killing-slack-was-the-only-way-to-make-ai-accurate
1•rajoshighosh•20m ago•0 comments

Show HN: Ordia – standup from GitHub and Jira, no human input

1•asahi014•23m ago•0 comments

Target Policy Optimization

https://arxiv.org/abs/2604.06159
1•t55•25m ago•0 comments

I built a native compiler in 2.5 years – and said nothing

https://mica-dev.com/posts/breaking-the-silence/
1•mikepet•26m ago•1 comments

60% of Australian children still using social media despite ban for under 16s

https://mollyrosefoundation.org/more-than-60-of-australian-children-still-using-social-media-desp...
2•speckx•27m ago•1 comments