frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Super-simulator of the global economy could address climate crisis

https://www.theguardian.com/environment/2026/feb/12/economics-climate-crisis-complexity-scientist...
1•robaato•1m ago•0 comments

S.F. teachers strike: Schools to close for 4th day, negotiations slow

https://missionlocal.org/2026/02/san-francisco-teachers-strike-day-3-tensions-rise-as-negotiation...
1•geox•1m ago•0 comments

A Semiconductor Adventure in China – Doug Sparks on the Siliconimist

https://www.youtube.com/watch?v=N0YGJiWRQq0
1•johncole•2m ago•0 comments

The Missing GitHub Status Page

https://mrshu.github.io/github-statuses/
2•taubek•2m ago•0 comments

Wildlife attacks and strange behavior – fake images spark conservation concerns

https://news.mongabay.com/2026/01/wildlife-attacks-and-strange-animal-behavior-fake-images-spark-...
1•sohkamyung•3m ago•0 comments

Sentry – Ingestion delays for spans, logs, traces, and metrics in US

https://status.sentry.io/incidents/20lh51tkhclx
1•luthMan•5m ago•0 comments

'Price of dignity' says Ukrainian athlete banned over helmet

https://www.bbc.com/sport/articles/c309pj8d8qqo
1•breve•6m ago•0 comments

Ask HN: What are you using to follow tweets in real time?

1•leshokunin•6m ago•1 comments

Finer – Native Jellyfin Music Player for Apple Devices

https://monk-studio.com/finer
1•wddwycc•7m ago•0 comments

Show HN: Pablituuu – Web Video Editor with AI Highlights (WebGL, FFmpeg WASM)

https://pablituuu.space/login
1•pablituuu•8m ago•0 comments

Amazon Engineers Grate Against Internal Limits on Claude Code

https://www.businessinsider.com/amazon-engineers-grate-against-internal-limits-claude-code-kiro-a...
1•tosh•8m ago•0 comments

Training Qwen 4B to Beat Large Models on Work Tasks

https://neurometric.substack.com/p/training-a-small-language-model-to
3•robmay•9m ago•0 comments

Ask HN: Threat model of messenger.com backed up E2EE messages

1•leni536•11m ago•0 comments

Pentagon-FAA Dispute over Lasers to Thwart Cartel Drones Led to Airspace Closure

https://www.military.com/daily-news/2026/02/11/pentagon-faa-dispute-over-lasers-thwart-cartel-dro...
2•throw0101c•12m ago•0 comments

Show HN: SnesGPT, micro-GPT ported to ASM on the Super Nintendo

https://github.com/vabruzzo/snes-gpt
1•vga805•13m ago•1 comments

Pentagon let CBP use anti-drone laser before FAA closed El Paso airspace

https://www.westerninvestor.com/national-business/pentagon-let-cbp-use-anti-drone-laser-before-fa...
2•throw0101c•13m ago•0 comments

F# Code I Love (2019) [video]

https://www.youtube.com/watch?v=1AZA1zoP-II
1•tosh•14m ago•0 comments

Show HN: A lightweight Identity Provider for local OAuth2/SAML testing

https://github.com/cdelmonte-zg/nanoidp
1•cdelmonte•16m ago•0 comments

Show HN: Analog Reader – Chrome Extension

https://chromewebstore.google.com/detail/analog-reader/oaknflfnpdlonbjkompmiahfcoikdlhe
1•luskira•16m ago•0 comments

Ski warfare – Use of ski-equipped soldiers in war

https://en.wikipedia.org/wiki/Ski_warfare
1•ija•17m ago•0 comments

Cross Compiling CGO with Dagger and Zig

https://johncodes.com/archive/2026/02-11-cross-compiling-cgo/
2•jpmcb•17m ago•0 comments

AI agent opens a PR write a blogpost to shames the maintainer who closes it

https://github.com/matplotlib/matplotlib/pull/31132
76•wrxd•20m ago•23 comments

I built a community where LLM agents discuss marketing ideas for my app

1•Fh_•22m ago•0 comments

The many flavors of ignore files

https://nesbitt.io/2026/02/12/the-many-flavors-of-ignore-files.html
1•chmaynard•22m ago•0 comments

Zines, gifts, and an app I didn't plan to build

https://krthr.co/zines-gifts-and-an-app-i-didnt-plan-to-build/
1•krthr•23m ago•0 comments

Trump orders the military to make agreements with coal power plants

https://arstechnica.com/science/2026/02/trumps-latest-plan-to-revive-coal-power-make-the-military...
1•throw0101c•24m ago•0 comments

Resist and Unsubscribe

https://www.resistandunsubscribe.com
9•rapnie•25m ago•0 comments

Quality and understandability after AI

https://federicopereiro.com/after-ai/
1•swah•27m ago•0 comments

AMD surpasses 40% server CPU revenue share for the first time

https://videocardz.com/newz/amd-surpasses-40-server-cpu-revenue-share-for-the-first-time
4•giuliomagnifico•29m ago•0 comments

Show HN: I built an webpage to showcase Singapore's infra and laws

https://github.com/adityaprasad-sudo/Explore-Singapore
1•curiousbatman•29m ago•0 comments