frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

iOS 26.2 lockscreen clock is slowly moving left

https://twitter.com/ffaebi/status/2003548130936332519
1•faebi•29s ago•0 comments

ChatGPT is perceived as getting slow/overloaded

https://trends.google.com/trends/explore?date=today%205-y&q=chatgpt%20slow,chatgpt&hl=en
1•lysace•31s ago•0 comments

Why Federated Design Systems Keep Failing

https://www.shaunbent.co.uk/blog/why-federated-design-systems-keep-failing/
1•mooreds•34s ago•0 comments

n8n RCE via Expression Injection

https://github.com/n8n-io/n8n/security/advisories/GHSA-v98v-ff95-f3cp
1•maxmax_•56s ago•0 comments

Show HN: Full-text search engine for Epstein docs (OCR and OpenSearch)

1•ProbDashAI•1m ago•0 comments

Why Agents Matter More Than Other AI

https://substack.com/home/post/p-182047799
1•nvader•2m ago•0 comments

Word Snake

https://wordsnake.co
2•ediblepython•2m ago•0 comments

New reactor produces clean energy and carbon nanotubes from natural gas

https://phys.org/news/2025-12-reactor-energy-carbon-nanotubes-natural.html
1•givinguflac•3m ago•0 comments

The Hunt for the Lost Communist Console [video]

https://www.youtube.com/watch?v=78vWO2SCfEk
1•bane•3m ago•0 comments

Show HN: Prysm – Built a real-time 3D globe for analytics after ditching GA4

https://prysmhq.com
1•yoan9224•4m ago•1 comments

Show HN: CarryFit – Open-source carry-on compliance checker for 170 airlines

https://carryon.fit/
1•axeluser•5m ago•1 comments

Kingdom of the Planet of the Apes movie review (2024)

https://www.rogerebert.com/reviews/kingdom-of-the-planet-of-the-apes-film-review-2024
1•walterbell•5m ago•0 comments

Show HN: Superset – Terminal to run 10 parallel coding agents

https://superset.sh/
3•avipeltz•5m ago•0 comments

Show HN: QuackKing – real-time multiplayer trivia for living-room play

https://quackking.live/
1•Bird2920•10m ago•0 comments

UniFi Travel Router

https://store.ui.com/us/en/products/utr
2•Lwrless•11m ago•0 comments

Barbican to close its doors for a year for multimillion-pound renovation

https://www.theguardian.com/culture/2025/dec/11/barbican-to-close-its-doors-for-a-year-for-multim...
1•gnabgib•13m ago•0 comments

An AI-driven financial time-series data visualization and rendering engine

https://github.com/0xhappyboy/candleview
1•happyboy_•13m ago•0 comments

Jupyter, ChatGPT, Copilot (Part 3): Real-World Code Examples

https://omid.dev/2025/12/23/jupyter-real-world-examples/
1•omidfarhang•15m ago•0 comments

HTTP Caching, a Refresher

https://danburzo.ro/http-caching-refresher/
1•danburzo•16m ago•0 comments

An Overview of the 2024 IECC for Residential Construction

https://www.ekotrope.com/blog/an-overview-of-the-2024-iecc-for-residential-construction
1•mooreds•16m ago•0 comments

Fixed-Wing Runway Design

https://www.wbdg.org/building/aviation/fixed-wing-runway-design
2•DarkContinent•17m ago•0 comments

Thoughts on AGI

https://dimle.wordpress.com/2025/12/23/thoughts-on-agi/
1•speckx•20m ago•0 comments

The Horns and Whistles Work

https://www.motherjones.com/politics/2025/12/the-horns-and-whistles-work/
2•mooreds•21m ago•0 comments

2015 radio interview: AI as "high-level algebra" before Transformers and LLMs

https://doomlaser.com/doomlaser-interview-on-ai-scaling-limits-and-governance-from-2015/
2•doomlaser•22m ago•1 comments

Terrence Malick's Disciples

https://yalereview.org/article/bilge-ebiri-terrence-malick
8•prismatic•23m ago•0 comments

The FCC's foreign drone ban is here

https://www.theverge.com/news/849460/fcc-foreign-drone-ban-dji-congress-deadline
4•Carducci•24m ago•0 comments

Show HN: UTM Manager – Lightweight UTM persistence for marketing attribution

https://gokhanarkan.com/blog/utm-manager/
1•gokh•26m ago•0 comments

Udemy and Coursera Agree to Combine

https://blog.udemy.com/udemy-coursera-combine/
1•turtleyacht•27m ago•0 comments

Wayback Machine Web Extension – A Browser Extension for Chrome/Firefox/Safari

https://github.com/internetarchive/wayback-machine-webextension
4•NJRBailey•31m ago•1 comments

Un-Redactor

https://github.com/kvthweatt/unredactor
1•kvthweatt•31m ago•0 comments