frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

OpenAI can't build working RSS feeds

https://openai.com/news/rss.xml
1•johnnyAghands•44s ago•1 comments

Cybercrime investigations in practice: Insights from the LockerGoga case

https://www.sciencedirect.com/science/article/pii/S2666281726000685
1•Manheim•54s ago•0 comments

Anthropic Unveils $1.5B Joint Venture with Wall Street Firms

https://www.wsj.com/business/deals/anthropic-nears-1-5-billion-joint-venture-with-wall-street-fir...
1•erhuve•4m ago•0 comments

Kids bypass age verification with fake moustaches

https://www.theregister.com/2026/05/04/uk_online_safety_act_age_checks_subvert/
1•dreadsword•4m ago•0 comments

About 10% of AMC movie showings sell zero tickets. This site finds them

https://walzr.com/empty-screenings
2•MrBuddyCasino•7m ago•0 comments

(Author Is) Suing the DOJ and the FBI

https://this.weekinsecurity.com/plot-twist-i-am-suing-the-justice-department-and-fbi/
1•sans_souse•11m ago•1 comments

Ethan Mollick: Taste is becoming a key skill in the AI era

https://www.businessinsider.com/ethan-mollick-ai-expert-wharton-taste-skills-ai-2026-5
1•sahar_builds•12m ago•0 comments

2-D Mathematical Curves

https://www.2dcurves.com/
1•the-mitr•13m ago•0 comments

Lattice Semiconductor acquires AMI, maker of server firmware

https://www.oregonlive.com/silicon-forest/2026/05/oregon-based-lattice-semiconductor-buys-georgia...
1•ojn•28m ago•0 comments

// Bookmarks: Firefox Bookmark Manager

https://lebcit.github.io/post/bookmarks-firefox-bookmark-manager/
2•LebCit•29m ago•1 comments

Why Every Generation Says 'Cool' [video]

https://www.youtube.com/watch?v=-YQB2tE6-8U
2•gmays•29m ago•0 comments

OpenAI president discloses his stake in the company is worth $30B

https://apnews.com/article/brockman-musk-altman-openai-trial-837bdc3fbced2a02f0f93a1899260bdd
2•dmitrygr•30m ago•0 comments

Train Your Own LLM from Scratch

https://github.com/angelos-p/llm-from-scratch
2•kristianpaul•32m ago•0 comments

The AI rush is hitting a bottleneck

https://www.economist.com/business/2026/04/27/the-ai-rush-is-hitting-a-bottleneck
3•1vuio0pswjnm7•32m ago•0 comments

Proprietary Microsoft Software Breaks Itself, Similarly Bad Ideas for GNU/Linux

https://news.tuxmachines.org/n/2026/03/24/Proprietary_Microsoft_Software_Slop_Made_Slopware_Break...
1•rolph•37m ago•0 comments

Continual Learning Bench 1.0

https://continual-learning-bench.com/news/cl-bench-1-0/
2•matt_d•38m ago•0 comments

Hand Drawn QR Codes

https://sethmlarson.dev/hand-drawn-qr-codes
2•jollyjerry•39m ago•0 comments

Original Apollo 11 Guidance Computer Source Code for Command and Lunar Module

https://github.com/chrislgarry/Apollo-11
3•tjek•40m ago•0 comments

Disconnect Remote Vehicle Access for Toyotas

https://disconnectaccess.toyota.com/#/drva-landing
2•freeopinion•46m ago•0 comments

Microsoft Lead: "AI Will Never Replace Coders, Here's Why" [video]

https://www.youtube.com/watch?v=CPrePbvbbic
4•oxag3n•46m ago•0 comments

Introduction to My Personal Blog

https://mylightstillshines.wordpress.com/2026/05/05/introduction/
3•jaygirl•52m ago•0 comments

I'm looking for an AI Automation Engineer role or gig

4•Divinz•53m ago•0 comments

CVE-2026-31431: Copy Fail vs. rootless containers

https://www.dragonsreach.it/2026/05/04/cve-2026-31431-copy-fail-rootless-containers/
23•averi•58m ago•5 comments

Making the Marlboro Man

https://quartr.com/insights/edge/making-the-marlboro-man
3•_vaporwave_•1h ago•0 comments

Safari 26.4 Supports WebTransport

https://webkit.org/blog/17862/webkit-features-for-safari-26-4/
4•nazcan•1h ago•1 comments

Fiddler sues Google after AI Overview wrongly claimed he was a sex offender

https://www.theguardian.com/music/2026/may/05/canadian-ashley-macisaac-fiddler-musician-singer-so...
11•prawn•1h ago•2 comments

Pennsylvania Health Insurance independent research report for Q2 May 2026

https://archive.org/details/pa-health-insurance-market-2026-final.docx-1
3•Steaglsz•1h ago•0 comments

Commodity Markets Outlook [pdf]

https://thedocs.worldbank.org/en/doc/f3138644a1e8e2bb631399ae11d6c408-0050012026/original/CMO-Apr...
2•gmays•1h ago•0 comments

Adobe's 'Modern' User Interface Is Just Webpages – Pixel Envy

https://pxlnv.com/linklog/adobe-modern-user-interface/
3•tambourine_man•1h ago•0 comments

Apple Explores Using Intel and Samsung to Build Main Device Chips in the US

https://www.bloomberg.com/news/articles/2026-05-05/apple-explores-using-intel-and-samsung-to-buil...
14•tambourine_man•1h ago•2 comments