frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Nw_wrld is an event-driven sequencer for triggering visuals [video]

https://www.youtube.com/watch?v=6vM_b54pWtg
1•_josh_meyer_•2m ago•1 comments

Agent Skills That Work

https://agentskills.work/sellers-net-sheet-florida
1•zlonmask•3m ago•1 comments

AI Blog

https://ai-blog-peach.vercel.app
1•dagmawibabi•5m ago•1 comments

Tell HN: AI could bring back GraphQL from the brink

1•sergiotapia•6m ago•0 comments

Tjs: Fastest and most accurate JSON-schema validator

https://github.com/sberan/tjs
1•handfuloflight•6m ago•0 comments

Chinese military says it is developing over 10 quantum warfare weapons

https://www.scmp.com/news/china/science/article/3339907/chinese-military-says-it-developing-over-...
2•KnuthIsGod•6m ago•0 comments

Show HN: Flashlight: Android app that lets you control torch brightness

https://github.com/rtvkiz/Flashlight
1•ritvikarya98•8m ago•0 comments

Appliances, Factories and the Grid

https://mercurialsolo.substack.com/p/appliances-factories-grids
1•mercurialsolo•9m ago•1 comments

Microsoft-Activision deal was meant to help The Embracer Grou

https://www.gamefile.news/p/bobby-kotick-activision-microsoft-ap7-lawsuit-embracer-group
1•firesteelrain•9m ago•0 comments

Tract: Self-contained, TensorFlow and ONNX inference

https://github.com/sonos/tract
1•vishnukvmd•9m ago•0 comments

Rampsliding Is a Quake Engine Quirk in the Same Way That Bunnyhopping Is

https://www.ryanliptak.com/blog/rampsliding-quake-engine-quirk/
1•luu•11m ago•0 comments

Scientists Discover Form of Water That's Both Solid and Liquid

https://studyfinds.org/scientists-discover-bizarre-water-both-solid-liquid/
1•adwmayer•11m ago•0 comments

Oracle to PostgreSQL DDL: Data Types, Partitions and More

https://www.datacloudgaze.com/post/oracle-postgresql-ddl-migration-guide
1•mahtodeepak•12m ago•0 comments

Airbnb Taps Meta AI Executive as New Chief Technology Officer

https://www.bloomberg.com/news/articles/2026-01-14/airbnb-taps-meta-ai-executive-as-new-chief-tec...
1•doppp•14m ago•0 comments

Free AI-Powered Tools

https://figtalia.com/
1•sifuncion•16m ago•0 comments

The Mythology of Conscious AI

https://www.noemamag.com/the-mythology-of-conscious-ai/
2•hardmaru•19m ago•0 comments

The Sword Blade Bank

https://tontinecoffeehouse.com/2020/09/28/the-sword-blade-bank/
2•dmonay•22m ago•0 comments

Short Supply: Root Causes of Declining Propensity for US Military Service

https://www.cnas.org/publications/reports/short-supply
1•toomuchtodo•26m ago•1 comments

OpenAI acquires health-care technology startup Torch

https://www.cnbc.com/2026/01/12/open-ai-torch-health-care-technology.html
1•gmays•26m ago•0 comments

AI Voice Elements

https://vercel.com/changelog/ai-voice-elements
2•handfuloflight•31m ago•1 comments

Rickroll in Rustc

https://github.com/rust-lang/rust/blob/main/tests/ui/attributes/check-cfg_attr-ice.rs
1•todsacerdoti•32m ago•0 comments

Failed part on UPS plane that crashed in KY failed 4x on other planes previously

https://apnews.com/article/ups-louisville-plane-crash-ntsb-md11-6d4cfff0c3937f847a3ac39809e31c11
3•toomuchtodo•36m ago•1 comments

China's customs agents told Nvidia's H200 chips are not permitted, sources say

https://www.reuters.com/world/china/chinas-customs-agents-told-nvidias-h200-chips-are-not-permitt...
3•donohoe•39m ago•0 comments

How to Ask Questions the Smart Way

http://www.catb.org/esr/faqs/smart-questions.html
1•thunderbong•39m ago•0 comments

Immigration agents shoot man in Minneapolis as tensions in city run high

https://www.theguardian.com/us-news/2026/jan/14/minnesota-immigration-officers-shovel-attack
5•SilverElfin•40m ago•3 comments

Battery is about to change the world in 3 months, or make this guy a fool

https://electrek.co/2026/01/14/batter-about-change-world-or-make-this-guy-fool/
3•topher515•40m ago•1 comments

With this tool, you can enjoy NAS functionality even without a NAS

https://quicksend.chat/
2•foodhome•41m ago•0 comments

String Theory Can Now Describe a Universe That Has Dark Energy

https://www.quantamagazine.org/string-theory-can-now-describe-a-universe-that-has-dark-energy-202...
1•jnord•42m ago•0 comments

DeepSeek Founder Liang's Funds Surge 57% as China Quants Boom

https://www.bloomberg.com/news/articles/2026-01-12/deepseek-founder-liang-s-funds-surge-57-as-chi...
1•gmays•46m ago•0 comments

Thesys: Generative UI Framework

https://www.thesys.dev
1•handfuloflight•46m ago•0 comments