frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

GGML GGUF File Format Vulnerabilities

https://www.databricks.com/blog/ggml-gguf-file-format-vulnerabilities
1•eatonphil•11s ago•0 comments

Answer to a simple question made me rethink my trust in AI

https://chatgpt.com/c/69922ba7-df38-832d-ad54-d062784c9020
1•johnnyApplePRNG•34s ago•0 comments

Compound Engineering: The AI-native engineering philosophy

https://every.to/guides/compound-engineering
1•Karrot_Kream•2m ago•0 comments

The T Project

https://mumble.net/~jar/tproject/
1•shakna•4m ago•0 comments

Voith Schneider Propeller

https://en.wikipedia.org/wiki/Voith_Schneider_Propeller
1•Luc•4m ago•0 comments

The Four Burner Theory

https://twitter.com/thecurioustales/status/2023042136686682141
1•dsego•6m ago•0 comments

Show HN: Clue (Cluedo) Solver/Assistant

https://github.com/dmd/clue-assistant
1•dmd•7m ago•0 comments

Multi-Agent Teams Hold Experts Back

https://www.arxiv.org/pdf/2602.01011
1•fauigerzigerk•8m ago•0 comments

PicoClaw: Ultra-Efficient AI Assistant in Go

https://github.com/sipeed/picoclaw
1•redbell•9m ago•0 comments

Speed Can Reindustrialize America

https://www.austinvernon.site/blog/manufacturing.html
1•mfiguiere•10m ago•0 comments

Show HN: ContextLedger – CLI to track and handoff context b/w AI coding sessions

https://github.com/manthan787/context-ledger
1•EmTekker•10m ago•1 comments

Possible identification of the Luna 9 Moon landing site using machine learning

https://www.nature.com/articles/s44453-025-00020-x
1•marcodiego•12m ago•0 comments

New and Upcoming IRCv3 Features

https://libera.chat/news/new-and-upcoming-features-3
1•iamnothere•12m ago•0 comments

Karma Engineering

https://aimlbling-about.ninerealmlabs.com/blog/karma-engineering/
1•namnnumbr•14m ago•1 comments

With Apple: Fortify your app: Essential strategies to strengthen security

https://developer.apple.com/events/view/TUHA23T82K/dashboard
9•pjmlp•16m ago•0 comments

AI analysis for UK Parliament bills

https://ukparliament.vercel.app/
1•ArisC•18m ago•3 comments

iPhotron 4.10 Is Released

https://github.com/OliverZhaohaibin/iPhotron-LocalPhotoAlbumManager/releases/tag/v4.1.0
1•main-protect•20m ago•0 comments

Court orders Acer and Asus to stop selling PCs in Germany over H.265 patents

https://videocardz.com/newz/acer-and-asus-are-now-banned-from-selling-pcs-and-laptops-in-germany-...
2•ledoge•20m ago•0 comments

The Prompt of Babel

https://joemclean.github.io/writing/the-prompt-of-babel.html
1•jjjjjjjjoe•22m ago•3 comments

How Can Something Fall Faster Than Gravity? [video]

https://www.youtube.com/watch?v=dosAbCCKXLs
1•zahlman•23m ago•0 comments

Top AI SDR tools analysis

https://revenuesystemslab.substack.com/p/ai-sdr-tools
1•Atbech•24m ago•0 comments

Pentagon threatens to cut off Anthropic in AI safeguards dispute

https://www.reuters.com/technology/pentagon-threatens-cut-off-anthropic-ai-safeguards-dispute-axi...
1•MKais•25m ago•0 comments

Baseband, Bessel and Beyond

https://www.youtube.com/watch?v=0GjWRQMFVA8
1•michh•25m ago•0 comments

Addicted to your phone? Try "bricking" it

https://economist.com/culture/2026/02/15/addicted-to-your-phone-try-bricking-it
1•andsoitis•26m ago•0 comments

Codeberg is why developers are broke

https://sharemygit.com/
2•onesandofgrain•34m ago•1 comments

Show HN: Claude-relais – A plan/build/judge loop mixing Claude with Cursor

https://github.com/clementrog/claude-relais
1•crog•34m ago•0 comments

Can agentic coding raise the quality bar?

https://lpalmieri.com/posts/agentic-coding-raises-quality/
2•LukeMathWalker•35m ago•1 comments

Learning Kubernetes with the official docs and NotebookLM

https://randomwrites.com/
1•mutahirs•35m ago•0 comments

List of Sports Clichés

https://en.wikipedia.org/wiki/List_of_sports_clich%C3%A9s
1•carlos-menezes•36m ago•0 comments

State Attorneys General Want to Tie Online Access to ID

https://reclaimthenet.org/40-attorneys-general-back-ids-online-safety-act
31•computerliker•37m ago•17 comments