frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Apple Excludes iOS 18.7.3 from Compatible Devices

https://old.reddit.com/r/apple/comments/1psbpzs/apple_excludes_ios_1873_from_compatible_devices/
1•brie22•3m ago•0 comments

ICE sending immigrants from continental U.S. to Hawaii, and no one knows why

https://www.hawaiipublicradio.org/local-news/2025-12-18/ice-has-been-sending-immigrants-from-the-...
2•felineflock•4m ago•0 comments

JetBrains abandons Fleet, pins hopes on forthcoming Air agentic development tool

https://devclass.com/2025/12/09/jetbrains-abandons-fleet-ide-pins-hopes-on-forthcoming-air-agenti...
1•doppp•8m ago•0 comments

'He Was Poisoned.' Toxic Fumes on Planes Blamed for Deaths of Pilots and Crew

https://www.wsj.com/business/airlines/toxic-fumes-airplane-pilot-crew-death-739fa3bb
1•appreciatorBus•9m ago•0 comments

NIST NTP clock crisis averted for now

https://groups.google.com/a/list.nist.gov/g/internet-time-service/c/OHOO_1OYjLY
1•geerlingguy•12m ago•0 comments

'Slightly haunted but manageable': new signs cause confusion in Christchurch

https://www.theguardian.com/world/2025/dec/22/new-zealand-christchurch-spoof-absurdist-road-signs
1•n1b0m•14m ago•0 comments

Show HN: Mac app to keep windows always on top

https://alwaysontop.app
1•kamranahmedse•16m ago•0 comments

Electronic Commerce: The Future of Fraud (1998)

https://www.schneier.com/crypto-gram/archives/1998/1115.html
1•101008•19m ago•0 comments

(Science) Frontiers 2025

https://frontier2025.netlify.app/
1•anjel•22m ago•0 comments

Make Change Cheap

https://johnocens.com/soothfare/makechangecheap
2•wonderbar•26m ago•0 comments

Where should outbound request decision logic live in application code?

1•siva_CEO•27m ago•0 comments

Show HN: Eze – AI startup roadmap co‑pilot (Day 2 update)

https://eze.lovable.app/
1•foolmarshal•34m ago•0 comments

Show HN: Sentence Starters – Phrases for academic and professional writing

https://sentencestarters.net
2•superhuang•38m ago•2 comments

The Coalition for Content Provenance and Authenticity

https://c2pa.org/
1•mooreds•39m ago•0 comments

The Pointe Shoe Makers of Hackney

https://spitalfieldslife.com/2018/01/25/the-pointe-shoe-makers-of-hackney-x/
1•zeech•39m ago•1 comments

Beautiful Rails confirmation dialogs (with zero JavaScript)

https://boringrails.com/articles/data-turbo-confirm-beautiful-dialog/
2•mooreds•41m ago•0 comments

Can India catch up with the US, Taiwan and China in the global chip race?

https://www.aljazeera.com/economy/2025/12/18/can-india-catch-up-with-the-us-taiwan-and-china-in-t...
1•mooreds•42m ago•0 comments

How to Optimize Your Usage: The Best AI Models to Use

https://forum.cursor.com/t/how-to-optimize-your-usage-the-best-ai-models-to-use-version-3-0/145657
1•behnamoh•43m ago•0 comments

Light Years Ahead – The 1969 Apollo Guidance Computer [video]

https://www.youtube.com/watch?v=B1J2RMorJXM
1•nill0•45m ago•0 comments

One of Elon Musk's Old Enemies Joins the Race to Run GM

https://www.wsj.com/business/autos/sterling-anderson-gm-ceo-0c493061
2•JumpCrisscross•45m ago•0 comments

OntoMesh – A Structural Map of a Bounded Meta-Architecture

https://zenodo.org/records/18012519
2•nettalk83•45m ago•1 comments

Disney's Living Characters: A Broken Promise [video]

https://www.youtube.com/watch?v=NyIgV84fudM
1•brson•45m ago•0 comments

AI Datacenters in Space [video]

https://www.youtube.com/watch?v=DCto6UkBJoI
1•xqcgrek2•59m ago•0 comments

To sign or not to sign: Practical vulnerabilities in GPG and friends

https://fahrplan.events.ccc.de/congress/2025/fahrplan/event/to-sign-or-not-to-sign-practical-vuln...
1•RGBCube•1h ago•0 comments

Identity Theft in AI Conference Peer Review

https://cacm.acm.org/opinion/identity-theft-in-ai-conference-peer-review/
1•pcfwik•1h ago•0 comments

Build Android apps using Rust and iced

https://github.com/ibaryshnikov/android-iced-example
2•rekireki•1h ago•1 comments

Show HN: I built a zero-config CLI to stop leaked secrets locally

https://www.accord-os.com/
1•mrsoltan•1h ago•0 comments

Power and Greed Ruined Adobe

https://www.youtube.com/watch?v=OtYNmANylro
2•fallinditch•1h ago•0 comments

Amazon Is Filled with AI Book Slop

https://www.rollingstone.com/culture/culture-features/amazon-ai-book-knockoffs-1235450690/
10•chrsw•1h ago•2 comments

Generic Compression Benchmark

https://www.mattmahoney.net/dc/uiq/
2•optimalsolver•1h ago•0 comments