frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

I LOVE YOU

https://en.wikipedia.org/wiki/ILOVEYOU
1•keepamovin•2m ago•0 comments

Cortex Auth – Rust secrets vault for AI agents (exec-based injection)

https://github.com/davideuler/cortex-auth
1•davideuler•2m ago•1 comments

Trust Me, I'm a Shortcut

https://www.wietzebeukema.nl/blog/trust-me-im-a-shortcut
3•wietze•11m ago•0 comments

Bitwuzla: Satisfiability Modulo Theories (SMT) Solver

https://github.com/bitwuzla/bitwuzla
1•tosh•13m ago•0 comments

"Bot or Human?" Is the Wrong Question for the Modern Web

https://blog.cloudflare.com/past-bots-and-humans/
1•emot•14m ago•0 comments

Image Generators Are Generalist Vision Learners

https://arxiv.org/abs/2604.20329
1•mohsen1•16m ago•0 comments

What you can do in a decade

https://twitter.com/swyx/status/2047217611880984935
1•tosh•17m ago•0 comments

AI and Teaching

https://eiexchange.com/content/ai-and-teaching-the-brave-new-world
1•walterbell•19m ago•0 comments

Show HN: We built an OCR server that can process 270 dense images/s on a 5090

https://github.com/aiptimizer/TurboOCR
2•pfdomizer•19m ago•0 comments

Writing a C Compiler, in Zig

https://ar-ms.me/thoughts/c-compiler-1-zig/
2•tosh•19m ago•0 comments

Subscription bombing attacks: patterns, dark web services, and mitigations

https://cacm.acm.org/practice/subscription-bombing-email-under-attack/
1•gannimo•20m ago•0 comments

Show HN: AI Applyd – score, rewrite, auto-apply via cloud browser

https://aiapplyd.com/
1•sneefle•21m ago•0 comments

A new logical model for artificial gravity cores: from pest control to railguns

https://gist.github.com/ryouta19931007
1•hamutarou•24m ago•0 comments

Programming as Theory Building – Peter Naur

https://gist.github.com/onlurking/fc5c81d18cfce9ff81bc968a7f342fb1
1•jonnonz•24m ago•0 comments

FIU Student Arrested After Joking About Netanyahu on WhatsApp

https://www.youtube.com/watch?v=o1Zsb1IijYY
7•enaaem•28m ago•0 comments

Meta layoff wave impacting 8000 jobs

https://www.usatoday.com/videos/news/2026/04/20/meta-layoffs-impacting-8000-employees/89697461007/
2•tcp_handshaker•31m ago•0 comments

Is Starlink a Secret Radar Constellation? [video]

https://www.youtube.com/watch?v=jbp3kdJZ1_A
3•msuniverse2026•37m ago•0 comments

Show HN: Nova by civai, a platform for managed AI agents

https://nova.civai.co/
1•usecodenaija•39m ago•0 comments

RFK Jr. Defends Trump's Mathematically Impossible Drug Discount Claims

https://www.nytimes.com/2026/04/22/us/politics/rfk-jr-trump-impossible-drug-discounts.html
3•tcp_handshaker•40m ago•1 comments

Vision Banana: Image Generators Are Generalist Vision Learners

https://vision-banana.github.io
2•M4v3R•40m ago•1 comments

Show HN: We built a way for Claude Code to join meetings like a real teammate

7•pattern-ai•40m ago•2 comments

Debugging WASM in Chrome DevTools

https://eli.thegreenplace.net/2026/debugging-wasm-in-chrome-devtools/
2•mfrw•44m ago•0 comments

Hackers breach Anthropic's 'too dangerous to release' Mythos AI model

https://www.euronews.com/next/2026/04/22/hackers-breach-anthropics-too-dangerous-to-release-mytho...
2•latexr•45m ago•0 comments

Show HN: Razorpay-universal – A framework-agnostic Razorpay SDK

https://www.npmjs.com/package/razorpay-universal
1•rupamshil111•46m ago•0 comments

SpaceX and Cursor have explored a team-up with Mistral to take on AI rivals

https://www.businessinsider.com/elon-musk-xai-explored-collaborating-with-mistral-cursor-2026-4
2•consumer451•48m ago•1 comments

Former Israeli intelligence agents from Unit 8200 hired by Apple

https://vuseum.wordpress.com/2025/07/22/ex-spie-israeliane-dellunita-8200-assunte-da-apple/
5•kome•55m ago•1 comments

Google announced that Chrome is becoming an agentic workplace platform

https://thenextweb.com/news/google-chrome-enterprise-ai-coworker-agentic-browser
3•onchainintel•58m ago•2 comments

The new hosted agents in Foundry Agent Service

https://devblogs.microsoft.com/foundry/introducing-the-new-hosted-agents-in-foundry-agent-service...
1•nonfamous•1h ago•0 comments

Show HN: Autonomous coin-flipping machine with on-device CV

https://www.terencegrover.com/section/physicalart/4
2•tgrover•1h ago•0 comments

Supplies Probably Won't Be Stolen in a Disaster

https://www.jefftk.com/p/your-supplies-probably-wont-be-stolen-in-a-disaster
2•luu•1h ago•0 comments