frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Why do South Koreans love AI so much?

https://www.technologyreview.com/2026/06/15/1138983/why-do-south-koreans-love-ai-so-much/
1•joozio•3m ago•0 comments

Cross-Language Data Types

https://ekxide.io/blog/cross-language-data-types/
1•birdculture•4m ago•0 comments

Show HN: A spreadsheet where your code never reads B7

https://github.com/logisky/LogiSheets/discussions/415
1•JeremyHe•6m ago•0 comments

Show HN: GitHits Public Beta 0.9

https://githits.com/
1•skvark•7m ago•0 comments

Correlated LLM Name Priors and Their Haunting of the Web and Academic Publishing

https://arxiv.org/abs/2606.02184
1•wise_blood•9m ago•0 comments

Adobe's record year couldn't save its stock

https://www.artificialstudio.ai/blog/adobe-record-year-couldnt-save-its-stock
1•artificialstudi•9m ago•0 comments

OpenAI spending hit $34B last year ahead of planned IPO

https://www.ft.com/content/e15b0d7e-ff6b-4f16-ba7a-4068feddb828
1•merksittich•10m ago•1 comments

Vulnerability Forecast Update: Navigating the AI Epoch

https://www.first.org/blog/20260615-vulnerability-forecast-update
1•jruohonen•10m ago•1 comments

Mythos/Fable-5 is a greedy Depth First Search system

https://ankitmaloo.com/fable/
2•ankit219•14m ago•0 comments

Show HN: DocShrink – An offline image and PDF optimizer in your Chrome sidebar

https://github.com/Aditya5556/Shrinkk
1•Aditya_5556•17m ago•0 comments

Shoehorning Flying Toasters into a ESP32-S3

https://taoofmac.com/space/blog/2026/06/14/1400
1•adunk•19m ago•0 comments

Show HN: Tracore – We turned our resume parser into a document-to-JSON API

https://tracore.io/en/
1•imalov•20m ago•0 comments

CATL: Solid-state batteries are in years away from mass market

https://carnewschina.com/2026/06/15/catl-boss-drops-solid-state-battery-reality-check-years-away-...
1•phront•21m ago•0 comments

Know When to Stop, Pivot, or Double Down

https://julienreszka.com/blog/know-when-to-stop-pivot-or-double-down/
1•julienreszka•22m ago•0 comments

Ask HN: Claude renamed my VM from the inside?

2•twooclock•33m ago•0 comments

How to bring down cheap, low-flying drones

https://www.economist.com/science-and-technology/2026/06/01/how-to-bring-down-cheap-low-flying-dr...
2•austinallegro•33m ago•2 comments

Colossal Squid Are Everywhere. We've Been Looking Wrong [video]

https://www.youtube.com/watch?v=-W1Mwd0BWT4
2•mpweiher•36m ago•0 comments

Show HN: Topaz – A small Unicode-first language that compiles to Rust

https://github.com/studiohaze/topaz
1•yo_tafo•37m ago•1 comments

Do call yourself a programmer, and other career advice (2013)

https://yosefk.com/blog/do-call-yourself-a-programmer-and-other-career-advice.html
1•downbad_•39m ago•0 comments

Show HN: Ensure actionable Google Forms responses using automatic feedback

https://workspace.google.com/marketplace/app/ai_response_feedback_for_forms/1081979139028
1•komlan•42m ago•0 comments

DeepSeek raises $7B at $50B valuation

https://digg.com/tech/lxwv71a1?rank=6
3•ilreb•42m ago•3 comments

Keep Alive, fire powered WiFi survival guide (2015)

https://arambartholl.com/keepalive/
1•thenthenthen•43m ago•1 comments

Horsewood Before and After: What Users Are Saying in 2026

https://finance.yahoo.com/sectors/healthcare/articles/horsewood-urgent-report-2026-horse-19110038...
1•rapijats•44m ago•1 comments

Specification Based Programming

https://shape-of-code.com/2026/06/14/specification-based-programming/
1•jruohonen•45m ago•1 comments

Don't write to two systems. Write to one [video]

https://www.youtube.com/watch?v=OfE_GlT-QKo
2•acairns•46m ago•0 comments

The mathematical secrets hidden at the heart of Barcelona's Sagrada Família

https://theconversation.com/the-mathematical-secrets-hidden-at-the-heart-of-barcelonas-sagrada-fa...
4•jruohonen•51m ago•0 comments

Munim free offline expense tracker for Indian households with recurring expenses

https://play.google.com/store/apps/details?id=com.arbharat.munim&hl=en_US
2•rajukumargupta•52m ago•0 comments

Worse Is Better

https://en.wikipedia.org/wiki/Worse_is_better
2•salviati•57m ago•0 comments

Microsoft turns to Amazon for help with GitHub's AI-driven capacity issues

https://www.businessinsider.com/microsoft-github-amazon-ai-cloud-capacity-2026-6
2•TMWNN•57m ago•0 comments

Amazon CEO's Talks with U.S. Officials Triggered Crackdown on Anthropic Models

https://www.wsj.com/tech/ai/amazon-ceos-talks-with-u-s-officials-triggered-crackdown-on-anthropic...
3•Khaine•59m ago•0 comments