frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Ask HN: How do you retain what you learn from podcasts?

1•LifeOfKP•2m ago•0 comments

A/B/U Review System

https://openresearchinstitute.org/onboarding/A_B_U.html
1•patcon•4m ago•0 comments

ORAC-NT – A 3D Tactical Bridge for NASA Kepler/Tess Star Stability

https://orac-nt.streamlit.app/
1•DREDREG•4m ago•0 comments

How the "AI Loser" may end up winning

https://adlrocha.substack.com/p/adlrocha-how-the-ai-loser-may-end
1•adlrocha•5m ago•0 comments

The big math changes to small math by same change and solve in Matlab BVP4C

https://www.nature.com/articles/s41598-025-18302-5
1•internet_points•9m ago•0 comments

Build nice terminal UI with Bubble Tea

https://www.prskavec.net/post/bubbletea/
2•swq115•10m ago•0 comments

Ask HN: How is everyone dealing with the increase of code reviews?

1•Lethalman•14m ago•0 comments

The API Key Is Dead: A Blueprint for Agent Identity in the Age of MCP

https://kontext.security/content/oauth-for-mcp-agents
1•mc-serious•15m ago•0 comments

Show HN: OpenPolicy Plus – Cloud platform for managing your privacy policies

https://plus.openpolicy.sh/
2•jamie_davenport•16m ago•0 comments

DSPi – A powerful, open-source DSP

https://www.audiosciencereview.com/forum/index.php?threads/introducing-dspi-a-powerful-user-frien...
2•djsedaw•22m ago•1 comments

Student Entrepreneur Program by Zyorabyte – Help students to build their starup

https://zyorabyte.org
1•zyoralabs•22m ago•0 comments

Hardening the Unpacakgeable: A Systemd-Run Sandbox for Third-Party Binaries

https://copyninja.in/blog/safe-run-binary-sandbox.html
2•edward•24m ago•0 comments

7 Japanese Musicians That Influenced the World – Tokyo Weekender

https://www.tokyoweekender.com/entertainment/music/7-japanese-musicians-influenced-world/
1•l8rlump•25m ago•0 comments

Flux Language

https://github.com/Y3sIH3arU/Flux
1•IHEARU•28m ago•0 comments

I Wrote PGP (1999)

https://www.philzimmermann.com/EN/essays/WhyIWrotePGP.html
2•downbad_•29m ago•1 comments

How the Roll Function Works (In APL\360 and Its Descendants)

https://www.jsoftware.com/papers/roll.htm
2•tosh•34m ago•0 comments

Ask HN: Agentic AI just makes me sad

4•NicoJuicy•37m ago•2 comments

A prototype of GNSS data parser, targeting UBX protocol of Ublox GNSS chipset

https://github.com/nguyenchiemminhvu/ubx_parser
1•ncmv92•39m ago•0 comments

I Just Want Simple S3

https://blog.feld.me/posts/2026/04/i-just-want-simple-s3/
1•mpweiher•39m ago•0 comments

A prototype of GNSS data parser, targeting NMEA protocol

https://github.com/nguyenchiemminhvu/nmea_parser
1•ncmv92•39m ago•0 comments

Automatic Vectorization

https://en.wikipedia.org/wiki/Automatic_vectorization
1•tosh•41m ago•0 comments

DuckDB Meets Data Lakes [video]

https://www.youtube.com/watch?v=AAv19oxJzdU
1•tosh•42m ago•0 comments

The Shelf Life of Intelligence

https://jigarkdoshi.bearblog.dev/the-shelf-life-of-intelligence/
2•j_juggernaut•46m ago•1 comments

Show HN: macpak (Homebrew Wrapper for macOS)

https://github.com/kavindujayarathne/macpak
3•atkavindu•50m ago•1 comments

Protesters cleared of damaging US plane at Shannon (2006)

https://www.irishtimes.com/news/protesters-cleared-of-damaging-us-plane-at-shannon-1.791686
1•yread•52m ago•0 comments

A bet on whether ML-KEM-768 or X25519 will break first

https://github.com/FiloSottile/ecc-vs-lattices-long-bet
1•birdculture•56m ago•0 comments

Backblaze's Original Storage Pod Inducted into Computer History Museum

https://www.backblaze.com/blog/backblaze-part-of-computer-history/
3•Lwrless•1h ago•0 comments

Pope Leo XIV denounces the 'delusion of omnipotence' he says fuels the Iran war

https://www.politico.com/news/2026/04/11/pope-leo-xiv-denounces-the-delusion-of-omnipotence-he-sa...
38•achierius•1h ago•6 comments

Show HN: macOS ncdu alternative with Finder reveal and live incremental scanning

https://github.com/uAIex/rdu
2•harr01•1h ago•0 comments

CUDA Programming for Nvidia H100s

https://www.freecodecamp.org/news/cuda-programming-for-nvidia-h100s/
1•eigenBasis•1h ago•0 comments