frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: Xweather Live – real-time global weather maps rendered with WebGL

https://live.xweather.com/
1•unstyledcontent•23s ago•0 comments

Diffray – Open-source multi-agent code review CLI

https://github.com/diffray/diffray
1•i_strelov•3m ago•0 comments

Three LLMs in a Trenchcoat

https://buildsharerepeat.substack.com/p/three-llms-in-a-trenchcoat
2•benmann•4m ago•0 comments

The collapse of "Human Signal" on the web

https://agoranet.substack.com/p/the-collapse-of-human-signal
1•kisamoto•4m ago•2 comments

Show HN: Native app to scaffold and build Cursor-ready Next.js projects

https://vibecodingstarterkit.io
1•dpitkevics•5m ago•1 comments

Show HN: Aurora – open-source cross-platform music player (lossless)

https://github.com/bbbneo333/aurora/releases/tag/v1.0.0
1•bbbneo333•5m ago•0 comments

Making AI helpful for everyone, including the planet

https://sustainability.google
1•frizlab•5m ago•0 comments

Show HN: RAGGuard – Permission-aware retrieval for RAG applications

1•maximus242•6m ago•0 comments

Ask HN: Browser Use, Skyvern or Other for Automating Directory Submission

1•onescales•10m ago•0 comments

Leaving the Matrix

https://raccoonland.us/posts/leaving-the-matrix/
2•edent•11m ago•0 comments

Show HN: Haraltd – A cross-platform Bluetooth daemon with a JSON-based RPC

https://github.com/bluetuith-org/haraltd
2•darkhz•13m ago•0 comments

Show HN: Talkolia – An AI chatbot that understands your website

https://www.talkolia.co/
1•kokau•13m ago•0 comments

The J Incunabulum

https://tony-zorman.com/posts/j-incunabulum.html
1•fanf2•14m ago•0 comments

Ask HN: How do you use AI tools when learning unfamiliar code?

1•Rperry2174•15m ago•1 comments

The UK is shaping a future of Precrime and dissent management

https://freedomnews.org.uk/2025/04/11/how-the-uk-is-shaping-a-future-of-precrime-and-dissent-mana...
3•robtherobber•17m ago•0 comments

Fundamental skills and knowledge you must have in 2026 for SWE

https://www.youtube.com/watch?v=Jr2auYrBDA4
1•ghuntley•20m ago•0 comments

The novelists who predicted our present

https://www.theguardian.com/books/2026/jan/10/mass-surveillance-the-metaverse-making-america-grea...
2•bookofjoe•20m ago•0 comments

Same-sex sexual behavior observed in dozens of primate species

https://www.nbcnews.com/science/science-news/primates-same-sex-sexual-behavior-evolution-rcna252693
2•jackmalpo•20m ago•0 comments

What is <input type="text">?

https://twitter.com/wycats/status/1376984460088934400
1•TheAceOfHearts•22m ago•0 comments

Alternative for Microsoft Lens

2•tritiy•26m ago•0 comments

Autotunnel – K8s On-Demand Port Forwarder

https://github.com/atas/autotunnel
1•mesto1•27m ago•0 comments

Show HN: SnackBase – Open-source, GxP-compliant back end for Python teams

https://snackbase.dev
1•lalitgehani•28m ago•0 comments

Lack of isolation in agentic browsers resurfaces old vulnerabilities

https://blog.trailofbits.com/2026/01/13/lack-of-isolation-in-agentic-browsers-resurfaces-old-vuln...
2•ingve•29m ago•0 comments

Making post-publication code checks a first-class research artifact

https://www.arxiv.org/pdf/2601.07189
1•nkko•29m ago•1 comments

Your Dog Might Be Eavesdropping on You

https://www.scientificamerican.com/article/some-dogs-learn-new-words-just-like-toddlers-do/
3•sohkamyung•37m ago•0 comments

Novel AI Method Sharpens 3D X-ray Vision

https://www.bnl.gov/newsroom/news.php?a=222627
2•cl3misch•39m ago•0 comments

Show HN: Where do my taxes go in Berlin? A personal receipt generator

https://berlin-bill.eamag.me
3•eamag•40m ago•0 comments

39C3 – Cracking Open What Makes Apple's Low-Latency WiFi So Fast [video]

https://media.ccc.de/v/39c3-cracking-open-what-makes-apple-s-low-latency-wifi-so-fast
2•amanverasia•41m ago•0 comments

Tik-Tok (Novel)

https://en.wikipedia.org/wiki/Tik-Tok_(novel)
3•firebaze•42m ago•1 comments

Pentagon is embracing Grok AI chatbot as it draws global outcry

https://apnews.com/article/artificial-intelligence-pentagon-hegseth-musk-7f99e5f32ec70d7e39cec92d...
2•geox•45m ago•1 comments