frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Sydney Uni data goes walkabout after criminals raid code repo

https://www.theregister.com/2025/12/19/sydney_uni_breach/
1•breve•4s ago•0 comments

Glint AI – Turn text descriptions into animated videos in seconds

https://glintai.org/
1•pmeduri1•6s ago•1 comments

2001

https://www.youtube.com/watch?v=Fwnphd_QUXo
1•ipnon•4m ago•0 comments

CIX releases P1 CPU TRM and developer guides and SDK source code

https://www.cnx-software.com/2025/12/13/cix-releases-p1-cpu-trm-and-developer-guides-for-gpu-ai-a...
1•mocular•6m ago•1 comments

Alma, Elegant AI Provider Orchestration

https://alma.now/
1•jinqueeny•7m ago•0 comments

Seeking honest feedback on production planning and scheduling pain points

https://taktora.ai
1•totallyscout•8m ago•1 comments

TikTok says Chinese owner will retain core US business

https://www.ft.com/content/7a778d46-8bf8-4b11-af4e-5e5bd891cb9d
1•SilverElfin•10m ago•0 comments

CPR: Christmas Present Rush

https://sublevelgames.itch.io/cpr-christmas-present-rush
2•greentec•23m ago•0 comments

Rust and the Price of Ignoring Theory [video]

https://www.youtube.com/watch?v=1iPWt1gvT_w
1•ok123456•23m ago•0 comments

Exposing Game Servers over Tailscale

https://chameth.com/exposing-game-servers-over-tailscale/
2•todsacerdoti•24m ago•0 comments

A Machine Learning Researcher's Notes from 5k Hours of Tekken

1•taha_moji•25m ago•0 comments

Watch these towers get wiggly

https://www.sfgate.com/national-parks/article/national-parks-under-threat-vibrations-21252643.php
1•ubasu•27m ago•0 comments

The Hum

https://en.wikipedia.org/wiki/The_Hum
1•doener•28m ago•0 comments

Text Similarity Search in Postgres

https://blog.kehvyn.dev/blog/pg-trgm-and-text-similarity-search/
1•kehvyn•31m ago•1 comments

Orbital Compute Control Room: A Space-based Data Centre Simulator

https://astrocompute.dev
3•throw0101a•34m ago•0 comments

What's new in Swift: December 2025 Edition

https://swift.org/blog/whats-new-in-swift-december-2025/
1•frizlab•37m ago•0 comments

Don't go to physics grad school and other cautionary tales

https://scottlocklin.wordpress.com/2025/12/19/dont-go-to-physics-grad-school-and-other-cautionary...
3•dxs•38m ago•0 comments

PulseScribe – Open-source voice-to-text for macOS with local AI

https://pulsescribe.me
1•fabszilla•40m ago•1 comments

TAS Explained: Super Mario Bros. 3 in 0.2 seconds [video]

https://www.youtube.com/watch?v=fQYX_AVxGq0
1•Sir_Twist•40m ago•0 comments

Lessons from a year of Postgres CDC in production

https://clickhouse.com/blog/postgres-cdc-year-in-review-2025
1•saisrirampur•42m ago•0 comments

We migrated off Django's storage API to a filesystem-first approach

https://goauthentik.io/blog/2025-12-19-why-we-revamped-file-management/
2•sdko•45m ago•0 comments

Hochul Reaches Deal on A.I. Regulation in New York

https://www.nytimes.com/2025/12/19/nyregion/ai-bill-regulations-ny.html
6•donohoe•47m ago•0 comments

The FOSS community acts like a cult and it's not helping the cause

https://torrent-empress.leaflet.pub/3mackqgyzh22t
6•Aloha•51m ago•11 comments

Ask HN: What was your worst typo?

1•juujian•55m ago•4 comments

Faith in the internet is fading among young Brits

https://www.theregister.com/2025/12/19/internet_bad_for_society/
7•Bender•57m ago•0 comments

Using GraphViz for Claude.md

https://blog.fsck.com/2025/09/29/using-graphviz-for-claudemd/
2•CharlesW•58m ago•0 comments

Show HN: Reavil – Turn qualitative user feedback into structured data

https://reavil.io
1•Jeebz•58m ago•1 comments

Ideatr – build and grow apps at the speed of thought

https://www.ideatr.dev/
1•arjunkshah21•1h ago•1 comments

Deep Time Maps – maps of ancient earth

https://deeptimemaps.com/
1•moultano•1h ago•0 comments

Wow! Signal

https://en.wikipedia.org/wiki/Wow!_signal
4•basilikum•1h ago•0 comments