frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Yes, and

https://htmx.org/essays/yes-and/
1•todsacerdoti•58s ago•0 comments

The Self-Driving Codebase

https://background-agents.com/
1•vinhnx•1m ago•0 comments

Scrapling: An adaptive web scraping framework that handles everything

https://github.com/D4Vinci/Scrapling
1•Paddyz•8m ago•0 comments

Show HN: RayClaw – AI agent like OpenClaw, standalone or as a Rust crate

https://github.com/rayclaw/rayclaw
1•stevensu•9m ago•0 comments

I'm in Tehran, what do you think will be happen?

2•pajuhaan•12m ago•2 comments

An open-source, C#-based Windows RAT (Remote Access Tool)

https://github.com/iss4cf0ng/DuplexSpyCS
1•iss4cf0ng•13m ago•1 comments

Show HN: How many hours have you spent with Claude Code? (CLI tool)

2•yurukusa•14m ago•0 comments

The most beautiful formula not enough people understand (3blue1brown) [video]

https://www.youtube.com/watch?v=fsLh-NYhOoU
2•vismit2000•14m ago•0 comments

Show HN: AI Moire Pattern Remover – Free tool for screen photos, scans, fabrics

https://moireremoval.com
1•wyattly•15m ago•0 comments

Unsloth Dynamic 2.0 GGUFs

https://unsloth.ai/docs/basics/unsloth-dynamic-2.0-ggufs
2•tosh•16m ago•0 comments

Qwen3.5 GGUF Benchmarks

https://unsloth.ai/docs/models/qwen3.5/gguf-benchmarks
2•tosh•18m ago•0 comments

Israel and US launch strikes on Iran as Trump confirms 'major combat operations'

https://www.reuters.com/world/iran-crisis-live-explosions-tehran-israel-announces-strike-2026-02-28/
2•TechTechTech•21m ago•0 comments

Dan Simmons, author of Hyperion and more books, dies at 77

https://arstechnica.com/culture/2026/02/hyperion-author-dan-simmons-dies-from-stroke-at-77/
1•geuis•22m ago•0 comments

Open source router for personal AI agents

https://manifest.build/docs/introduction
1•stosssik•23m ago•0 comments

Is an event planning platform UAE suitable for weddings and private parties?

1•tonhurt•25m ago•1 comments

Cloudflare radar shows Iran internet traffic drop to near zero

https://radar.cloudflare.com/traffic/ir?dateRange=1d
2•nodesocket•26m ago•0 comments

Ask HN: Why are some websites locking or using the audio device on Windows?

1•ezconnect•30m ago•0 comments

Iran agreed zero enriched uranium, and israel immediately bombs Tehran

https://twitter.com/muhammadshehad2/status/2027645609621033376
3•proshno•35m ago•1 comments

Leak confirms GrapheneOS and Motorola partnership

https://piunikaweb.com/2026/02/27/leak-confirms-grapheneos-motorola-partnership-for-non-pixel-har...
3•Cider9986•36m ago•0 comments

You might be falling behind without realizing it

1•hoangnnguyen•40m ago•1 comments

Show HN: AI Tutorial Series – Building Neural Networks with Python

https://peertube.cif.su/watari/videos/watch/123456
1•watari_ai_tools•42m ago•0 comments

Show HN: Agent Hand – Tmux session manager for AI coding agents (Rust)

https://weykon.github.io/agent-hand/
1•weykon•50m ago•0 comments

The proposal for generic methods for Go has been officially accepted

https://github.com/golang/go/issues/77273
2•birdculture•50m ago•0 comments

Startup idea validator – Get brutal verdict

https://dontbuild.it/
1•hackerbo•58m ago•0 comments

Students Are Skipping the Hardest Part of Growing Up

https://larrycuban.wordpress.com/2026/02/28/students-are-skipping-the-hardest-part-of-growing-up-...
4•Tomte•1h ago•0 comments

Packaging AI/ML Models as Conda Packages

https://prefix.dev/blog/packaging-ai-ml-models-as-conda-packages
1•droelf•1h ago•0 comments

OpenAI: Food First, Then Morals

1•goloroden•1h ago•0 comments

Show HN: News Pulse – Real-time global news feed, 475 sources, no algorithm

https://news-alert-eta.vercel.app
6•trevwebdev•1h ago•1 comments

Show HN: Standup.so – Paste your commits, get a standup report in seconds

https://standup-so.vercel.app
1•neoloong•1h ago•1 comments

Redirector Privacy Alternatives: forward links to privacy aware front ends

https://github.com/duyfken/RedirectorPrivacyAlternatives
2•Baljhin•1h ago•1 comments