frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Title: A simple dinner meeting led to a sophisticated iOS eKYC bypass

https://medium.com/@ryu360i/potential-for-iphone-ekyc-face-id-hacking-how-passcode-shouldering-se...
1•ryuzaburo•47s ago•1 comments

India Issues Final Warning to Apple in Ongoing Antitrust Case

https://www.macobserver.com/news/india-issues-final-warning-to-apple-in-ongoing-antitrust-case/
1•Brajeshwar•1m ago•0 comments

The Executive Assistant Paradox: Why AI Makes This Role Critical, Not Obsolete

https://vleech.substack.com/p/the-executive-assistant-paradox-why
1•connor11528•3m ago•0 comments

100's of hours of GTM research in seconds

https://dev.dashboard.chainfuse.ai/team/01981bfb-b6fa-78a8-937e-822018bddac0/dataspace/019a6f6c-1...
2•sushidata•4m ago•1 comments

Show HN: The viral speed read at 900wpm app

https://wordblip.com
1•Gillinghammer•4m ago•0 comments

Learning Latent Action World Models in the Wild

https://arxiv.org/abs/2601.05230
1•saswatms•5m ago•0 comments

Antigravity down for Ultra plan accounts

https://discuss.ai.google.dev/t/antigravity-broken-getting-only-agent-execution-terminated-due-to...
1•fmnxl•6m ago•1 comments

European Sovereign Cloud

https://www.chrisfarris.com/
1•weinzierl•11m ago•0 comments

OpenAI Codex with Ollama

https://ollama.com/blog/codex
1•meetpateltech•11m ago•0 comments

OpenAI Used Kenyan Workers on Less Than $2 per Hour to Make ChatGPT Less Toxic

https://time.com/6247678/openai-chatgpt-kenya-workers/
2•pabs3•13m ago•0 comments

I tricked my partner into caring about finances

https://www.indiehackers.com/post/how-i-tricked-my-partner-into-caring-about-finances-dff051a4cb
1•abbster52•21m ago•0 comments

Simulation: Jupiter holds 1.5 times more oxygen than the sun

https://phys.org/news/2026-01-jupiter-hidden-depths-simulation-planet.html
1•wglb•22m ago•1 comments

Behind Trump vs. Powell Is a Battle over US Empire's Future

https://jacobin.com/2026/01/trump-powell-fed-europe-dollars
2•kaycebasques•27m ago•0 comments

It Can Apply and Positive in Favor the Newton III Law on an Engine System Device

1•monterrey•29m ago•0 comments

State Ofthe Art Novel InFlow 1Gearturbine/Reaction 2Imploturbocompressor/Impulse

1•monterrey•32m ago•0 comments

San Francisco to offer free childcare to people making up to $230000

https://www.theguardian.com/us-news/2026/jan/15/san-francisco-childcare-families
6•darth_avocado•33m ago•2 comments

Podcasting Could Use a Good Asteroid

https://www.joanwestenberg.com/podcasting-could-use-a-good-asteroid/
2•zdw•34m ago•0 comments

Ask HN: What are Claude's skills/what skills does Claude possess?

1•Obscurity4340•36m ago•0 comments

Glyphhanger – Your web font utility belt

https://www.zachleat.com/web/glyphhanger/
1•doodlesdev•38m ago•0 comments

The Myth of the ThinkPad

https://innovintageblog.wordpress.com/2026/01/08/the-myth-of-the-thinkpad/
2•volemo•39m ago•2 comments

Jeff Bezos Needs to Speak Up

https://www.theatlantic.com/ideas/2026/01/raid-washington-post/685621/
3•JumpCrisscross•41m ago•2 comments

Ericsson Silent Layoffs in the US

3•allabouttech•45m ago•1 comments

Trump Moves to Make Tech Giants Pay for Surging Power Costs

https://www.bloomberg.com/news/articles/2026-01-15/trump-to-direct-key-us-grid-operator-to-hold-e...
3•jmcdonald-ut•45m ago•1 comments

America's Throwaway Spies: How the CIA Failed Iranian Informants in Tehran

https://www.reuters.com/investigates/special-report/usa-spies-iran/
4•koolhead17•46m ago•0 comments

Mark Carney and Xi Jinping meet to mend ties as Donald Trump disrupts globe

https://www.ft.com/content/9eeff245-2081-4f97-bc8e-6bbdaf59074e
3•KnuthIsGod•49m ago•0 comments

Fontello – Combine icon webfonts for your own project

https://github.com/fontello/fontello
1•doodlesdev•49m ago•0 comments

Is there any way we can help Stack Overflow Website get back up?

https://stackoverflow.com/questions/79867766/is-there-any-way-we-can-help-stack-overflow-website-...
1•nomilk•49m ago•0 comments

AI as a Compression Problem

https://dkg.fifthhorseman.net/blog/2025-ai-and-compression.html
1•pabs3•50m ago•0 comments

PanoptiCity – interactive map reveals the scale of mass surveillance worldwide

https://panopticity.fr/
2•pabs3•51m ago•0 comments

How Safe Is the Rust Ecosystem? A Deep Dive into Crates.io

https://mr-leshiy-blog.web.app/blog/crates_io_analysis/
1•RustSupremacist•56m ago•0 comments