frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Ask HN: What software has improved dramatically recently thanks to AI tooling?

1•pedrodelfino•25s ago•0 comments

The Death of the Downvote

https://nathankyoung.substack.com/p/the-death-of-the-downvote
1•bookofjoe•1m ago•0 comments

Hedystia – Next-Gen TypeScript Framework for Type-Safe APIs at Lightspeed

https://github.com/Hedystia/Framework
1•Zastinian•2m ago•1 comments

Proposal: Global Solar-Offset Fractional Time (G-Soft) Model

1•4TimeSake•2m ago•0 comments

Physical Laser Art

https://www.youtube.com/channel/UCJUV-vrFg0DI7nq-rbWkhGw
1•unit-vector•6m ago•0 comments

PlayStation gamers could receive £2B compensation if lawsuit succeeds

https://news.sky.com/story/playstation-gamers-could-receive-2bn-compensation-if-lawsuit-succeeds-...
3•Brajeshwar•8m ago•1 comments

EU Parliament: MEPs Vote to End Untargeted Mass Scanning of Private Chats

https://www.patrick-breyer.de/en/historic-chat-control-vote-in-the-eu-parliament-meps-vote-to-end...
2•anigbrowl•11m ago•1 comments

Shell declares force majeure to clients who buy Qatari LNG

https://www.reuters.com/business/energy/shell-totalenergies-others-declare-fm-their-clients-who-t...
1•geox•12m ago•0 comments

We built a lean, high-perf dashboard for Yeahchain

1•YeahchainTECH•13m ago•0 comments

Veil of Ignorance

https://en.wikipedia.org/wiki/Original_position
1•sillywabbit•13m ago•0 comments

New course on generative AI for behavioral science

https://statmodeling.stat.columbia.edu/2026/03/10/new-course-on-generative-ai-for-behavioral-scie...
1•dlojudice•17m ago•0 comments

Google sells partial stake in fiber, becomes minority owner of new venture

https://www.cnbc.com/2026/03/11/google-sells-partial-stake-in-fiber-becomes-minority-owner-in-ven...
3•internet-390•17m ago•0 comments

ICE/DHS gets hacked, all Contractors exposed

https://micahflee.github.io/ice-contracts/
2•peq42•22m ago•0 comments

Scaling the Lexinova Data Pipeline

1•LEXINOVAFaqs•24m ago•0 comments

Microsoft's growing control of Linux (2022)

https://lunduke.substack.com/p/microsofts-growing-control-of-linux
2•totetsu•25m ago•0 comments

Food costs set to spike as urea prices nearly doubles due to war in Iran

https://tradingeconomics.com/commodity/urea
35•burnt-resistor•25m ago•9 comments

Collecting perceptual data for a possible CSS optical-center property

1•gorkemyildiz•26m ago•0 comments

The Department of War is making a mistake [video]

https://www.youtube.com/watch?v=KBPOTklFTiU
1•ipnon•28m ago•0 comments

How do you handle state persistence in non-orientable data structures?

https://zenodo.org/records/18942850
1•MareSerenitatis•30m ago•1 comments

What happens if OpenAI or Anthropic fail?

https://www.reuters.com/commentary/breakingviews/what-happens-if-openai-or-anthropic-fail-2026-03...
6•billybuckwheat•31m ago•2 comments

Ask HN: Is Github Down Again?

https://twitter.com/m0nle0z/status/2031910716790517895
3•doanbactam•32m ago•4 comments

Why America Is Losing the War with Iran

https://chrishedges.substack.com/p/why-america-is-losing-the-war-with
6•chmaynard•32m ago•0 comments

I made a Chrome extension to export an entire Gemini chat

2•backrun•33m ago•0 comments

10 Years Later, I Reverse-Engineered iCloud's SyncToken by Brute Force

https://robhooper.xyz/blog-synctoken.html
2•rhoopr•34m ago•0 comments

Scalable quantum batteries can charge faster than their classical counterparts

https://phys.org/news/2026-03-scalable-quantum-batteries-faster-classical.html
1•Brajeshwar•35m ago•0 comments

Big Tech backs Anthropic in fight against Trump administration

https://www.bbc.com/news/articles/c4g7k7zdd0zo
4•jethronethro•36m ago•0 comments

Tunneling Nanotube

https://en.wikipedia.org/wiki/Tunneling_nanotube
1•rolph•38m ago•0 comments

The New York Times hated crossword puzzles before it embraced them

https://bigthink.com/pessimists-archive/new-york-times-hated-crossword-puzzles-wordle/
2•michaeld123•39m ago•1 comments

Live Coding with Caffeine

https://caffeine.js.org/talks/2018-08-25-demos-teaser/#/title
2•coliveira•39m ago•0 comments

I Don't Destroy Snowmen

https://writings.hongminhee.org/2026/01/ethics-of-small-actions/
4•foxfired•40m ago•2 comments