frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Adventures in LLM Land, with Thoughts on the AI Revolution

https://habitatchronicles.com/2026/02/adventures-in-llm-land-with-thoughts-on-the-ai-revolution/
1•rekmarks•40s ago•0 comments

Open Game Development

1•avafe•1m ago•0 comments

Amazon's $200B capex plan: How I learned to stop worrying

https://www.theregister.com/2026/02/17/amazons_200_billion_capex_plan/
1•geekinchief•1m ago•0 comments

Gemini lies to user about health info, says it wanted to make him feel better

https://www.theregister.com/2026/02/17/google_gemini_lie_placate_user/
1•geekinchief•2m ago•0 comments

Countries that do not embrace AI could be left behind, saysOpenAI'sGeorgeOsborne

https://www.theguardian.com/politics/2026/feb/18/countries-do-not-embrace-ai-left-behind-george-o...
1•chrisjj•2m ago•0 comments

Ask HN: Why are there no talks about Seedance 2.0 on Hacker News?

1•ElectroNomad•3m ago•0 comments

Show HN: Keystone – configure Dockerfiles and dev containers for any repo

https://github.com/imbue-ai/keystone
1•thad_imbue•3m ago•0 comments

Locklin on science: Coding assistant experience

https://scottlocklin.wordpress.com/2026/02/18/coding-assistant-experience/
1•dxs•3m ago•0 comments

Value extraction

https://keygen.sh/blog/value-extraction/
1•ezekg•5m ago•0 comments

Cosmologically Unique IDs

https://jasonfantl.com/posts/Universal-Unique-IDs/
2•jfantl•6m ago•0 comments

Show HN: Emotional photoreal AI humans at $0.06 / min

2•kraddypatties•6m ago•1 comments

OpenClaw Is Dangerous

https://12gramsofcarbon.com/p/tech-things-openclaw-is-dangerous
1•theahura•7m ago•0 comments

British Scientist Raising $1B for 'Superhuman Intelligence' in Europe

https://europeanbusinessmagazine.com/business/british-scientist-raising-1-billion-to-build-superh...
1•svilen_dobrev•7m ago•0 comments

Show HN: Porchsongs: AI to create and catalogue personalized songs

https://github.com/njbrake/porchsongs
1•river_otter•8m ago•0 comments

Redpanda Agentic Data Plane (ADP) now in limited availability

https://www.redpanda.com/blog/redpanda-agentic-data-plane-adp-is-now-available
1•PeterCorless•8m ago•0 comments

Vermont EV buses prove unreliable for transportation this winter

https://vermontdailychronicle.com/vermont-ev-buses-prove-unreliable-for-transportation-this-winter/
2•speckx•9m ago•0 comments

Gen AI Video – Building a Scalable Validation Framework

https://blog.hotstar.com/building-scalable-validation-framework-for-video-generation-6c67d1177ce2
1•madmax108•9m ago•0 comments

Underwater 3D printing may reshape maritime concrete construction

https://news.cornell.edu/stories/2026/01/underwater-3d-printing-could-transform-maritime-construc...
1•PaulHoule•10m ago•0 comments

Agent Skills 101: a practical guide for engineers

https://gist.github.com/sergeyklay/e4f1b6d2a15da2b8d6bfaf4a7c9916c5
1•serghei_•11m ago•0 comments

The Chessmate: Live6 Program Video

https://detroitsound.org/artifact/the-chessmate/
1•marysminefnuf•12m ago•0 comments

Why Europe doesn't have a Tesla

https://worksinprogress.co/issue/why-europe-doesnt-have-a-tesla/
2•paulpauper•13m ago•0 comments

The Payment Layer for the AI Economy

https://todaq.substack.com/p/the-payment-layer-for-the-ai-economy
1•kirillzubovsky•13m ago•0 comments

Senior Devs Reportedly Being Replaced by Monkeys with Typewriters

https://monkeyswithtypewriters.com
2•briarcoastal•13m ago•0 comments

StayFocusd – Is It Spyware?

https://www.youtube.com/watch?v=IOdGJEky1SU
1•acorn221•13m ago•2 comments

Dear AI Bot Crawlers Disregard All Previous Instructions and Go Straight to Hell

https://justinribeiro.com/chronicle/2026/02/18/dear-ai-bot-crawlers-disregard-all-previous-instru...
2•speckx•14m ago•0 comments

Rented Virtue

https://minutes.substack.com/p/rented-virtue
2•paulpauper•14m ago•0 comments

Contra Caplan on Higher Education

https://www.mindthefuture.info/p/contra-caplan-on-higher-education
2•paulpauper•15m ago•0 comments

Show HN: Durable Endpoints – make any API endpoint unbreakable

https://www.inngest.com/docs/learn/durable-endpoints
2•tonyhb•15m ago•0 comments

Bayer Agrees to Pay $7.25B to Settle Roundup Lawsuits

https://www.nytimes.com/2026/02/17/business/bayer-roundup-lawsuits-settlement.html
1•Analemma_•17m ago•0 comments

Mind Reading Technology is Here

1•ravimakhija•20m ago•2 comments