frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: Threshold Concepts in CS – ideas that permanently change how you think

https://github.com/nikitph/awesome-threshold-concepts
1•loaderchips•31s ago•0 comments

Meta workers can opt out of being tracked at work up to 30 min

https://www.bbc.com/news/articles/c93x0k194yno
2•reconnecting•47s ago•0 comments

India's High-Stakes Push for Sovereign AI Faces Reality Check

https://www.bloomberg.com/news/features/2026-06-02/modi-wants-india-to-join-japan-uk-in-ai-superp...
1•jmsflknr•48s ago•0 comments

New York to require 3D printers to be equipped with filter software

https://www.governor.ny.gov/news/keeping-new-yorkers-safe-governor-hochul-signs-legislation-stren...
1•15155•2m ago•0 comments

AI Bots Cite Dentists More Than Fortune 500s. The Data Surprised Us

https://engagemii.com/blog/dentists-vs-fortune-500-ai-citations
1•Greg_engagemii•3m ago•0 comments

Show HN: Python PCAP Analyzer

https://github.com/Raduurjan/Python_PCAP_analyzer
1•RaduUrj•3m ago•0 comments

AI licensing coalition SPUR in expansion

https://pressgazette.co.uk/news/ai-licensing-coalition-spur-in-huge-expansion/
1•thm•4m ago•0 comments

Basic Soldering Lesson 1 – "Solder and Flux" [video]

https://www.youtube.com/watch?v=vIT4ra6Mo0s
1•__natty__•5m ago•0 comments

Europe's First Apple Developer Center to Open in Berlin

https://www.apple.com/uk/newsroom/2026/06/europes-first-apple-developer-center-to-open-in-berlin/
1•thm•5m ago•0 comments

ChatGPT Isn't Just Changing How We Work. It's Harming How We Think

https://thewalrus.ca/chatgpt-isnt-just-changing-how-we-work-its-harming-how-we-think/
1•debo_•6m ago•0 comments

Google's Top DMCA Sender Plateaus at 70M Takedowns per Week

https://torrentfreak.com/googles-top-dmca-sender-plateaus-at-70-million-takedowns-per-week/
1•isaacfrond•7m ago•0 comments

API/MCP to check if a physical product is legal to sell in 103 countries

https://legaldata-public.cleolabs.co/products
1•naomiehl•7m ago•0 comments

EU plots long game against US digital supremacy

https://www.politico.eu/article/eu-plots-long-game-against-us-digital-supremacy/
1•thm•7m ago•0 comments

German startup advancing compressor-free electrocaloric heat pump technology

https://www.pv-magazine.com/2026/06/02/german-startup-advancing-compressor-free-electrocaloric-he...
1•rustoo•7m ago•0 comments

Are Electrons Real?

https://physics.aps.org/articles/v19/70
1•sohkamyung•9m ago•0 comments

Amazon's Ring sued over facial recognition feature

https://www.reuters.com/legal/government/amazons-ring-sued-over-facial-recognition-feature-latest...
2•1vuio0pswjnm7•10m ago•1 comments

Show HN: Self tuning chat exposing it's semantic and agentic cache

https://chat.betterdb.com
2•kivanowbetterdb•11m ago•0 comments

I aint gonna work on Maggie's Datacenter no more [video]

https://www.youtube.com/watch?v=FbUHfsJ44x8
1•gdiamos•11m ago•0 comments

Why 374,000 Californians have dropped their Covered CA insurance

https://ktla.com/news/california/californians-drop-covered-california-health-insurance/
1•Bender•12m ago•0 comments

Browser-based image converters (WASM) and a URL image scanner

https://imagedimensions.com
1•Bishop81•12m ago•0 comments

Ask HN: Simple architecture for group messaging with no need to trust operator?

1•julienreszka•13m ago•1 comments

Open Source Appsec Scanner

https://github.com/sinewaveai/agent-security-scanner-mcp
1•dchitimalla•14m ago•0 comments

Git Has a Variable Named false_but_the_compiler_does_not_know_it

https://blog.codingconfessions.com/p/false-but-the-compiler-does-not-know-it
2•sohkamyung•15m ago•0 comments

I Wore a Smart Fart Wearable for Three Days. Here's What I Learned

https://gizmodo.com/i-wore-a-smart-fart-wearable-for-three-days-heres-what-i-learned-2000760032
1•Bender•16m ago•0 comments

Huub, modern CP+Sat solver written in Rust

https://huub.solutions/
1•knuckleheads•17m ago•0 comments

Landmark cancer trial shows success against 'undruggable' cancer

https://www.nature.com/articles/d41586-026-01760-w
1•Bender•17m ago•0 comments

Uber's $1,500/Month AI Limit Is a Useful Signal for AI Tool Pricing

https://simonwillison.net/2026/Jun/3/uber-caps-usage/
2•pdyc•17m ago•0 comments

Insaion – Automated ROS2 fleet monitoring, now with OpenTelemetry bridging

https://www.insaion.com/blog/insaion-v0-8-4-opentelemetry
1•vicmassy•17m ago•0 comments

Ask HN: Would you use a soundproof mic mask to dictate to AI in public?

1•sodakim•18m ago•0 comments

Build 2026: Union types in C#

https://build.microsoft.com/en-US/sessions/DEM304
1•pjmlp•20m ago•0 comments