frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: Founder-OS: Open Sourcing how I automate my company

https://github.com/cloudrepo-io/founder-os
1•256BitChris•54s ago•0 comments

Self-Contained Map Component for Swift with Multiple, Aggregated, Custom Markers

https://github.com/LittleGreenViper/BigJuJuMap
1•mooreds•1m ago•0 comments

The Riemann Hypothesis: Past, Present and a Letter Through Time

https://arxiv.org/abs/2602.04022
1•stared•1m ago•0 comments

Show HN: Savior – Prevent silent form data loss in the browser

https://github.com/Pepp38/Savior
1•Pepp38•2m ago•0 comments

OpenAI and Ginkgo Bioworks (YC S14) used GPT5 to lower protein production costs

https://openai.com/index/gpt-5-lowers-protein-synthesis-cost/
1•snitty•7m ago•0 comments

PPE Stockpile Degradation

https://chillphysicsenjoyer.substack.com/p/ppe-stockpile-degradation
1•crescit_eundo•8m ago•0 comments

What's the hardest thing about tracking your validated learnings?

1•localeyes•8m ago•0 comments

Show HN: Atomic Afterglow – Local-first audio analysis (Librosa/Streamlit)

https://atomic-afterglow.streamlit.app/
1•phasesequencer•9m ago•0 comments

ICE and CBP's Face-Recognition App Can't Verify Who People Are

https://www.wired.com/story/cbp-ice-dhs-mobile-fortify-face-recognition-verify-identity/
2•cdrnsf•10m ago•0 comments

Spotify, a Major Audiobook Provider, Will Soon Offer Physical Books

https://www.wsj.com/business/media/spotify-a-major-audiobook-provider-will-soon-offer-physical-bo...
2•bookofjoe•10m ago•1 comments

Distributed ML training through Web Cams

https://www.sarthakmangla.com/blog/wccl
1•amrrs•12m ago•0 comments

Unlocking a global audience with auto dubbing

https://blog.youtube/news-and-events/youtube-auto-dubbing-expressive-speech/
2•ingve•14m ago•0 comments

Ask HN: Will Crypto Currencies survive past this market downturn?

1•halamadrid•16m ago•1 comments

RMA – Compile Semgrep rules to native Rust/Tree-sitter matchers

https://github.com/bumahkib7/rust-monorepo-analyzer
1•bumahkib7•18m ago•1 comments

AI coding gap: Why senior devs are getting faster and juniors spin their wheels

https://www.zdnet.com/article/why-gen-ai-boosts-productivity-some-developers-not-others/
2•CrankyBear•18m ago•0 comments

Outlawed executable code encoded in a prime number

https://www.cs.cmu.edu/~dst/DeCSS/Gallery/Stego/illegal-primes.html
2•nucatus•19m ago•0 comments

How do you deal with SEO nowadays?

3•jackota•19m ago•6 comments

My bird has a pet hamster

https://brooke.substack.com/p/my-bird-has-a-pet-hamster
1•surprisetalk•21m ago•0 comments

The Gap Between Machines and Citizens

https://llm-politics.foaster.ai/
1•surprisetalk•21m ago•0 comments

Efficient near-telomere-to-telomere assembly of nanopore simplex reads

https://www.nature.com/articles/s41586-026-10105-6
2•bookofjoe•24m ago•0 comments

MatPy – Pure Python linear algebra library with ODE solvers

https://github.com/njryan-boou/matpy
1•njryan20051•25m ago•0 comments

Starlink fuels SpaceX growth with potential phone, more internet services

https://www.reuters.com/business/media-telecom/starlink-fuels-spacex-growth-with-potential-phone-...
1•TMWNN•27m ago•0 comments

OpenClaw (MoltBot) as a Service on DigitalOcean

https://www.digitalocean.com/blog/openclaw-digitalocean-app-platform
1•perelin•28m ago•0 comments

Show HN: Linear MCP Fast – 10x faster Linear MCP reads from local cache

https://github.com/everything-chalna/linear-mcp-fast
1•liabilityuk0•30m ago•0 comments

My Issues with ProtonMail

https://tildeweb.nl/~michiel/protonmail-issues.html
2•roywashere•34m ago•0 comments

Jane Street Blog – What if writing tests was a joyful experience?

https://blog.janestreet.com/the-joy-of-expect-tests/
2•ryanhn•35m ago•1 comments

I accidentally became a FOSS maintainer

https://www.hughrundle.net/i-accidentally-became-a-foss-maintainer-and-all-i-got-was-this-lousy-n...
1•cratermoon•36m ago•0 comments

What Every Programmer Should Know About Memory [pdf]

https://people.freebsd.org/~lstewart/articles/cpumemory.pdf
2•sebg•36m ago•0 comments

I design with Claude more than Figma now

https://blog.janestreet.com/i-design-with-claude-code-more-than-figma-now-index/
1•jsomers•40m ago•0 comments

Show HN: Sop-agents – Write Markdown, get coordinated agents

https://github.com/serverless-dna/sop-agents
1•walmsles•42m ago•1 comments