frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

iPhone Deployment of End-to-End Perception via Auto-Labeled Synthetic Data

https://arxiv.org/abs/2604.25949
1•PaulHoule•43s ago•0 comments

VibeOS

https://en.wikipedia.org/wiki/VibeOS
1•maayank•2m ago•0 comments

Show HN: TuringLLM – a LLM-powered Universal Turing machine

https://github.com/gmlion/TuringLLM
1•gmlion•2m ago•0 comments

Apple Silicon's on-device AI bet hasn't moved – only the chip range that runs it

https://tbreak.com/apple-silicon-on-device-ai-doug-brooks-wwdc/
1•Austin_Conlon•3m ago•0 comments

Three of our worst VC stories

https://twitter.com/eastdakota/status/2062860530360959273
2•orgonon•3m ago•0 comments

Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script

https://github.com/omnia-projetcs/spark-dgx
1•nico248•6m ago•0 comments

What people don't get about safety at Anthropic

https://twitter.com/kevins8/status/2062969935379513431
1•kevinatac•7m ago•0 comments

How Elon Musk Killed Hundreds of Thousands of People

https://www.currentaffairs.org/news/how-elon-musk-killed-hundreds-of-thousands-of-people
3•tastyface•7m ago•0 comments

S&P 500 rejects SpaceX, also blocking entry for OpenAI and Anthropic

https://arstechnica.com/tech-policy/2026/06/sp-500-blocks-fast-spacex-entry-wont-waive-rule-for-u...
1•AndrewDucker•11m ago•0 comments

How to Stop Shipping Low-Quality RL Environments (With Examples)

https://www.latent.space/p/bad-envs
1•swyx•12m ago•0 comments

Show HN: Busbar – every LLM behind one URL, in a single Rust binary

https://github.com/MattJackson/busbarAI
1•mattjackson86•13m ago•0 comments

UK orders Google to allow publishers to opt out of AI scraping

https://apnews.com/article/google-britain-ai-competition-regulation-ce2016a4519fbe234799e009bac8f120
3•1vuio0pswjnm7•15m ago•0 comments

The effects of foods on LDL cholesterol levels

https://www.sciencedirect.com/science/article/pii/S0939475321000028
1•brandonb•16m ago•0 comments

I Put ChatGPT Browser Inside My Terminal [video]

https://www.youtube.com/watch?v=YErIWOPytuc
1•tomerbd•17m ago•0 comments

The Wrath of the Killdozer (2009)

https://www.damninteresting.com/the-wrath-of-the-killdozer/
1•bookofjoe•18m ago•0 comments

Data Centers Have a New Adversary: Tigers and Leopards at a Zoo

https://www.bloomberg.com/news/articles/2026-06-05/data-centers-have-a-new-adversary-tigers-and-l...
1•1vuio0pswjnm7•19m ago•0 comments

Amazon Employees Show Up to City Council Meetings, Demand Limits on Data Centers

https://www.wired.com/story/amazon-employees-publicly-demand-regulations-on-data-centers/
5•1vuio0pswjnm7•21m ago•0 comments

We Built Plainform and What It Means for Your Next Project

https://plainform.dev
1•eradon•21m ago•0 comments

Transformers Are Inherently Succinct

https://openreview.net/pdf?id=Yxz92UuPLQ
3•brandonb•21m ago•3 comments

Jax Back Ends and Devices

https://www.gilesthomas.com/2026/06/jax-backends-and-devices
1•gpjt•22m ago•0 comments

Tech sovereignty package to strengthen Europe's digital autonomy and resilience

https://ec.europa.eu/commission/presscorner/home/en
3•andrewstetsenko•22m ago•0 comments

Show HN: SupXML, modern memory-safe XML parser replacement for libxml2

https://supso.org/projects/sup-xml/docs
1•jrpt•23m ago•0 comments

Against an Increasingly User-Hostile Web (2017)

https://neustadt.fr/essays/against-a-user-hostile-web/
3•arunc•27m ago•0 comments

Pasteur, a zero-knowledge pastebin as an unikernel in OCaml

https://github.com/dinosaure/pasteur
2•dinosaure•30m ago•0 comments

Employees aren't resisting AI – they're resisting fear

https://www.fastcompany.com/91541703/employees-arent-resisting-ai-theyre-resisting-fear-ai-employ...
1•berlianta•31m ago•0 comments

OpenClaw Got Safer in Public

https://openclaw.ai/blog/openclaw-security-in-public
1•cryptoking1106•32m ago•0 comments

Digital Dead Man's Switch for Your Files

https://trustbourne.com/
1•BerislavLopac•32m ago•0 comments

What is my IP address?

https://ip.hny.io
1•astrochicken•34m ago•0 comments

Show HN: Lazarus, a coding agent for long-horizon tasks

https://github.com/ExpressGradient/lazarus
1•Sai_Praneeth•34m ago•0 comments

Are Memories Transferable – Or Edible?

https://www.quantamagazine.org/are-memories-transferable-or-edible-20260605/
2•kiwicopple•35m ago•0 comments