frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

From Golden Gate Bridge to JSON: Why Anthropic's SAE Failed on JSON Output

https://huggingface.co/blog/MaziyarPanahi/sae-steering-json
1•maziyar•3m ago•1 comments

Show HN: Nginx-lint – A linter for Nginx configs with plugin support and autofix

https://github.com/walf443/nginx-lint
1•walf4431•4m ago•0 comments

Astounding facts about crocodile eyes [video]

https://www.youtube.com/watch?v=QSMw3adYh2Y
1•teleforce•4m ago•0 comments

I think AI use is reflected in GitHub stats at least a bit

https://vester.si/blog/github-stats/
1•vesterde•5m ago•1 comments

Amp Free Is Full

https://ampcode.com/news/amp-free-is-full-for-now
1•Charmunk•9m ago•0 comments

Show HN: 80 Lines of codes to transform Codex into a personalized assistant

https://github.com/RalphMao/seedbot
1•asskicker•9m ago•0 comments

Fun With Pinball

https://www.funwithpinball.com/exhibits/small-boards
2•jackwilsdon•10m ago•0 comments

Erythritol May Damage Critical Brain Barrier, Risking Stroke

https://www.sciencealert.com/common-sweetener-may-damage-critical-brain-barrier-risking-stroke
4•Gaishan•11m ago•0 comments

The Housing Debate Is Finally Catching Up to Reality

https://www.strongtowns.org/journal/2026-2-9-the-housing-debate-is-finally-catching-up-to-reality
3•cassepipe•11m ago•0 comments

Show HN: Google Search MCP for local LLMs – 14 tools, no API key

https://github.com/VincentKaufmann/noapi-google-search-mcp
1•vkaufmann•13m ago•0 comments

Lockfiles Killed Vendoring

https://nesbitt.io/2026/02/10/lockfiles-killed-vendoring.html
1•8organicbits•13m ago•0 comments

Ctoc: Cloc, but for Claude Token Counts

https://grohan.co/2026/02/10/ctoc/
1•grohan•15m ago•1 comments

What Is Claude? Anthropic Doesn’t Know, Either

https://www.newyorker.com/magazine/2026/02/16/what-is-claude-anthropic-doesnt-know-either
1•petethomas•17m ago•0 comments

A warning to Seattle: Don't become the next Cleveland

https://platformonomics.com/2026/02/a-warning-to-seattle-dont-become-the-next-cleveland/
3•derekered•18m ago•1 comments

OpenAI Executive Who Opposed 'Adult Mode' Fired for Sexual Discrimination

https://www.wsj.com/tech/ai/openai-executive-who-opposed-adult-mode-fired-for-sexual-discriminati...
2•impish9208•21m ago•2 comments

RynnBrain

https://github.com/alibaba-damo-academy/RynnBrain
1•jsemrau•24m ago•0 comments

Show HN: CAD parts builder with WASM and WebGL

https://modo.is/b/multimeter-stand-with-storage
1•Beefin•29m ago•0 comments

GPT-5.3-Codex being routed to GPT-5.2

https://github.com/openai/codex/issues/11189
4•cactusplant7374•30m ago•1 comments

Show HN: Thoth – Obsidian AI Research Assistant

https://github.com/acertainKnight/project-thoth
1•acertainKnight•31m ago•0 comments

A Daily AI Chat

https://jonathannen.com/daily-ai-chat
1•jwilliams•31m ago•0 comments

Show HN: A "Today" page that turns the top HN news into lyrics, free APIs only

https://eruci.com/t.html
1•eruci•32m ago•0 comments

Replay: Eliminate Human Memory with AI

https://runreplay.com
1•zyadelgohary1•35m ago•2 comments

Ask HN: AI Companions with Persistent Memory

1•warmreed•36m ago•0 comments

.NET 11 Preview 1 is now available

https://devblogs.microsoft.com/dotnet/dotnet-11-preview-1/
2•runesoerensen•36m ago•0 comments

Show HN: Txtweb – serve a website from your domain's DNS TXT record

https://txtweb.lefelys.com/
2•lefelys•38m ago•0 comments

FDA refuses to review Moderna's application for mRNA flu vaccine

https://www.cnn.com/2026/02/10/health/fda-moderna-mrna-flu-vaccine
6•zzzeek•38m ago•1 comments

I Forced 10 AIs to play Among Us (it was insane) [video]

https://www.youtube.com/watch?v=Sxmd7T_dyaM
1•vinnyglennon•39m ago•0 comments

Game Boy Advance Dev: Drawing Pixels

https://www.mattgreer.dev/blog/gba-dev-drawing-pixels/
1•birdculture•39m ago•0 comments

You shouldn't be doing that by hand

https://airstack.garden/
1•kaniksu•39m ago•0 comments

Dorodango

https://blog.fsck.com/2026/02/10/dorodango/
2•arittr•40m ago•0 comments