frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

The terrifying and efficient world of Olympic ski airlifts

https://www.latimes.com/sports/olympics/story/2026-02-13/inside-terrifying-efficient-world-of-oly...
1•bookofjoe•2m ago•1 comments

Four new astronauts arrive via SpaceX rocket at International Space Station

https://www.theguardian.com/science/2026/feb/14/international-space-station-full-crew
1•andsoitis•3m ago•0 comments

What happens when you put Claude, GPT, Grok, and DeepSeek in the same room?

https://warpmode.io
1•spranab•8m ago•0 comments

Ask HN: Alternatives to the Big 4 for SoC 2 compliance?

1•IsraCV•9m ago•0 comments

The myth of the high-tech heist

https://www.technologyreview.com/2026/02/13/1132397/myth-of-high-tech-heist/
1•gnabgib•10m ago•0 comments

Sonder is a word I like

https://www.autodidacts.io/sonder/
1•Curiositry•10m ago•0 comments

I built a bot to grab Berlinale film festival tickets that sell out in seconds

https://github.com/Rswcf/berlinale-ticket-buyer
1•rswcf•11m ago•1 comments

Narmada Human

https://en.wikipedia.org/wiki/Narmada_Human
1•thunderbong•13m ago•0 comments

Stitching Vision Encoders into LLMs: Clip vs. I-JEPA vs. ViT Comparison

https://teendifferent.substack.com/p/stitching-vision-into-llms-a-comparative
1•teendifferent•15m ago•1 comments

Bulletproof: A Look into Aéza

https://213.si/blog/bulletproof-a-look-into-aeza
1•dev213•21m ago•0 comments

NewPipe: YouTube client without vertical videos and algorithmic feed

https://newpipe.net/
2•nvader•22m ago•0 comments

Galactic Matter and Interstellar Flight [pdf]

http://large.stanford.edu/courses/2013/ph241/micks1/docs/bussard.pdf
2•bediger4000•23m ago•0 comments

Prayerfully journey through Lent on the Exodus 90 App

https://exodus90.com/how-lent-works/
1•nvader•24m ago•0 comments

The Battle of the Beams

https://en.wikipedia.org/wiki/Battle_of_the_Beams
4•jacquesm•25m ago•0 comments

I love the work of the ArchWiki maintainers

https://k7r.eu/i-love-the-work-of-the-archwiki-maintainers/
2•panic•25m ago•0 comments

Cuba's regime is in dire straits

https://www.economist.com/the-americas/2026/01/14/cubas-regime-is-in-dire-straits
3•ViktorRay•30m ago•0 comments

Anthropic's Public Benefit Mission

https://simonwillison.net/2026/Feb/13/anthropic-public-benefit-mission/
3•abdelhousni•33m ago•0 comments

States reliant on Colorado River fail to meet latest deadline to find consensus

https://apnews.com/article/colorado-river-arizona-california-nevada-water-45daf816feba9004c389dc4...
3•bikenaga•35m ago•0 comments

An open-source real-time motor driver for the Lego Orrery

https://gorkem.cc/projects/LegoOrreryMod/
1•gorkyver•37m ago•0 comments

Hardest Problem in Computer Science: Centering Things

https://tonsky.me/blog/centering/
1•signa11•40m ago•1 comments

I Have Nothing but Red Herring to Hide

https://theprivacydad.com/i-have-nothing-but-red-herring-to-hide/
1•theprivacydad•40m ago•0 comments

Let's Get Physical

https://m4iler.cloud/posts/lets-get-physical/
1•MBCook•42m ago•0 comments

Computing Inequality: Have Computers Changed the Labor Market? (1977) [pdf]

https://economics.mit.edu/sites/default/files/publications/computing%20inequality%201998.pdf
1•yowmamasita•45m ago•0 comments

MicroGPT - Train and inference a GPT in pure, dependency-free Python (200 lines)

https://gist.github.com/karpathy/8627fe009c40f57531cb18360106ce95
1•susam•45m ago•0 comments

Zig landed io_uring and Grand Central Dispatch std.Io implementations

https://ziglang.org/devlog/2026/?20260213#2026-02-13
1•todsacerdoti•46m ago•0 comments

Show HN: Asked AI to write for fun. It built a CMS to blog on

https://www.omarcms.com/
1•ewimsatt•46m ago•0 comments

On TikTok, we're all Chinese – but the trend doesn't paint the full picture

https://www.bbc.com/news/articles/cz6eljqvyp1o
2•haunter•47m ago•0 comments

Show HN: ShareMyGit – Share private Gitea repos without making them public

https://sharemygit.com/
1•onesandofgrain•52m ago•0 comments

Modular Inch Increment Plastic Drawer Organizers

https://www.schallercorporation.com/
1•walterbell•55m ago•0 comments

Show HN: PinchChat, an open-source webchat UI for OpenClaw

https://github.com/MarlBurroW/pinchchat
1•marlburrow•1h ago•0 comments