frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

1•DarrylLinington•10s ago

Can anyone confirm if this is HappyHorse?

https://www.happyhorse-ai.app/
1•watree•1m ago•0 comments

Feds Try Secret Grand Jury to Unmask Reddit ICE Critic

https://theintercept.com/2026/04/10/reddit-ice-protest-grand-jury/
2•cdrnsf•1m ago•0 comments

Old habits die hard: Microsoft tries to limit our options, this time with AI

https://blog.mozilla.org/en/mozilla/ai/microsoft-copilot-ai-user-choice/
2•01-_-•3m ago•0 comments

FBI used iPhone notification data to retrieve deleted Signal messages

https://9to5mac.com/2026/04/09/fbi-used-iphone-notification-data-to-retrieve-deleted-signal-messa...
3•01-_-•4m ago•0 comments

One of the biggest game makers reveals 'uncomfortable truth' about AI

https://www.thegamebusiness.com/p/one-of-the-worlds-biggest-game-makers
2•dude250711•5m ago•0 comments

Why Recycling Nuclear Waste Isn't a Silver Bullet [video]

https://www.youtube.com/watch?v=FWzI72snmE0
1•leonidasrup•6m ago•0 comments

Did the Selective Service Harvest Names from a 'Free Ice Cream' List? (1998)

https://www.snopes.com/fact-check/ice-cream-registration-notice/
1•thunderbong•9m ago•0 comments

Stop Registration Spam with Identity Pre-Verification

https://fusionauth.io/blog/identity-pre-verification
1•mooreds•11m ago•0 comments

Build the thing you wish to see in the world (2025)

https://brittanyellich.com/build-the-thing/
1•mooreds•12m ago•0 comments

Microsoft suspends dev accounts for high-profile open source projects

https://www.bleepingcomputer.com/news/microsoft/microsoft-suspends-dev-accounts-for-high-profile-...
12•N19PEDL2•12m ago•1 comments

DeepInverse: PyTorch Library for Inverse Problems

https://github.com/deepinv/deepinv
1•cl3misch•13m ago•0 comments

A name is succession, legacy and celebration in Japan's Kabuki theater

https://apnews.com/article/kabuki-name-succession-japan-tradition-theater-d1e9621bc91385498314f36...
1•mooreds•13m ago•0 comments

Raon-SpeechChat-9B: A Full-Duplex Speech Language Model

https://huggingface.co/KRAFTON/Raon-SpeechChat-9B
1•BrutalCoding•15m ago•0 comments

I replaced Apple Migration Assistant with a local 122B LLM

https://github.com/genedeng-ca/ai-mac-migration
2•genedeng•15m ago•0 comments

Less Net Work for Networks

https://n0.computer/
1•janandonly•20m ago•0 comments

Is it just me, or Opus 4.6 is sounding bit dumb lately

3•rambrrest•23m ago•1 comments

OmniSearch: Fast Windows file search built with Tauri, Rust, and C++

https://github.com/Eul45/omni-search
2•eyuel_engida•23m ago•0 comments

SKHynix Announces "Strategic Investment" in European Semiconductor Startup

https://www.hpcwire.com/off-the-wire/semidynamics-secures-sk-hynix-investment-to-advance-memory-c...
1•Bluextend•26m ago•0 comments

Amazon Is Pulling Support for Kindles from 2012 or Earlier. What to Do Now

https://www.cnet.com/tech/computing/amazon-pulls-the-plug-on-older-kindle-models/
3•bookmtn•26m ago•0 comments

We put all 4 Gemma 4 models in one Telegram bot. Try it and see how we built it

https://seqpu.com/UseGemma4In60Seconds/
1•fredmendoza•28m ago•1 comments

Essay explainging OpenAI's safety collapse [video]

https://www.youtube.com/watch?v=bta18wTOr_k
1•vincentkriek•28m ago•0 comments

Wild chimpanzees waging 'civil war' with coordinated attacks between groups

https://www.theguardian.com/environment/2026/apr/09/civil-war-chimpanzee-group-closer-to-human-co...
2•cebert•29m ago•0 comments

Show HN: A native Compose Desktop host for macOS

https://github.com/letmutex/compose-native-host
1•letmutex•30m ago•0 comments

Python Is Dead

https://calebfenton.substack.com/p/python-is-dead
6•calebfenton•32m ago•3 comments

Show HN: Creating OCD versions of a yearly calendar

https://calendar-architect.pages.dev
1•szemy2•34m ago•1 comments

PostgreSQL REST API from SQL Scripts

https://npgsqlrest.github.io/blog/sql-rest-api.html
1•vbilopav•38m ago•0 comments

FIFA World Cup's best shirts are 30 years old

https://www.cnn.com/2026/04/10/style/fifa-world-cup-best-shirts
1•Tomte•40m ago•0 comments

Macs crash after 49 days of uptime?

https://sixcolors.com/link/2026/04/macs-crash-after-49-days-of-uptime/
3•jmsflknr•40m ago•0 comments

C++23 Support in MSVC Build Tools 14.51

https://devblogs.microsoft.com/cppblog/c23-support-in-msvc-build-tools-14-51/
1•pjmlp•41m ago•0 comments