frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Wine 11.8 Improves VBScript Compatibility Fixes Microsoft Golf 1999

https://www.phoronix.com/news/Wine-11.8-Released
1•Bender•21s ago•0 comments

Job Postings for Software Engineers Are Rapidly Rising

https://www.citadelsecurities.com/news-and-insights/2026-global-intelligence-crisis/
1•delichon•1m ago•0 comments

Why Footbinding Persisted in China for a Millennium (2015)

https://www.smithsonianmag.com/history/why-footbinding-persisted-china-millennium-180953971/
1•thomassmith65•8m ago•0 comments

Robot fighting league has new home in S.F. you can watch – and buy – 'humanoids'

https://www.sfchronicle.com/tech/article/buy-fighting-robots-sf-22231111.php
1•iancmceachern•8m ago•0 comments

OpenX32: Open Linux kernel for Behringer X32 mixer

https://github.com/OpenMixerProject/OpenX32
2•brudgers•10m ago•0 comments

We Built an Authorization Engine That the Fortune 100 Bet On

https://www.technology.org/2026/04/30/we-built-an-authorization-engine-that-the-fortune-100-bet-o...
1•mooreds•16m ago•0 comments

Trump issues executive order against Cuba

https://www.cbsnews.com/miami/news/trump-executive-order-tightening-u-s-sanctions-on-cuba/
2•mark336•20m ago•1 comments

Echon – A privacy-first Discord alternative built in Tauri/Rust

https://echon-voice.com
1•highest678•23m ago•0 comments

Apple Is Holding My Pictures Hostage Until I Accept Their New Terms of Service

https://probablydance.com/2026/05/01/apple-is-holding-my-pictures-hostage-until-i-accept-their-ne...
4•akkartik•31m ago•0 comments

Gut bacteria may play a role in the rise in colon cancer in young adults

https://www.npr.org/sections/shots-health-news/2025/04/25/g-s1-62623/colon-cancer-gut-health-micr...
2•littlexsparkee•37m ago•0 comments

There's a lot of hype about Chinese EVs, is any of it true?

https://arstechnica.com/cars/2026/05/theres-a-lot-of-hype-about-chinese-evs-is-any-of-it-true/
2•Bender•43m ago•0 comments

Show HN: Transparent Tor Proxy – Route all Linux system traffic through Tor

https://github.com/onyks-os/TransparentTorProxy
1•onyks•45m ago•0 comments

xAI Has Used OpenAI's Models to Train Its Own

https://www.wired.com/story/elon-musk-distill-openai-models-partly-xai/
1•bhouston•48m ago•0 comments

Show HN: MemHub, Turn Your GPT/Claude/Gemini History into LLM-Wiki Mindmap

https://github.com/XTraceAI/memhub-llm-wiki-guide
3•TristanX•51m ago•0 comments

Hobson vs. Hansen and the Decline of D.C. Schools

https://www.educationprogress.org/p/hobson-v-hansen-and-the-decline-of
1•barry-cotter•56m ago•1 comments

Finny – AI trading agent that runs in your terminal

https://www.finnyai.tech/
1•jaimin67•1h ago•0 comments

Kubernetes Secret Extraction via ArgoCD ServerSideDiff

https://github.com/argoproj/argo-cd/security/advisories/GHSA-3v3m-wc6v-x4x3
1•milkglass•1h ago•0 comments

TanStack Start: Full-Stack Framework Powered by TanStack Router for React+Solid

https://tanstack.com/start/latest
1•doener•1h ago•0 comments

TanStack Router: modern router designed for type safety, data‑driven navigation

https://github.com/TanStack/router
1•doener•1h ago•0 comments

Show HN: Hackamaps – A global hackathon map I build after hitting Lovable Limits

1•ostenjap•1h ago•0 comments

So, About That AI Bubble

https://www.theatlantic.com/economy/2026/05/ai-bubble-revenue-anthropic/687022/
2•JumpCrisscross•1h ago•1 comments

Vite: Next Generation Front End Tooling

https://github.com/vitejs/vite
1•doener•1h ago•0 comments

Agsist – free real-time grain market dashboard for US producers

https://agsist.com/
1•dnilgis•1h ago•0 comments

The Half of CI We Forgot to Automate

https://blog.sebastiansastre.co/posts/the-half-of-ci-we-forgot-to-automate/
1•sebastianconcpt•1h ago•2 comments

GitHub Enshittification

https://pablotron.org/2026/04/30/github-enshittification/
2•birdculture•1h ago•0 comments

Russian forces defeated in Saharan stronghold after wave of attacks

https://www.ft.com/content/e40ba9d0-7430-4545-8c26-c99cde5caf33
4•JumpCrisscross•1h ago•1 comments

Good developers learn to program. Most courses teach a language

https://evilgeniuslabs.ca/blog/good-developers-learn-to-program-not-a-language
9•andsoitis•1h ago•2 comments

Probability Sampling

https://www150.statcan.gc.ca/n1/edu/power-pouvoir/ch13/prob/5214899-eng.htm
1•firasd•1h ago•0 comments

History of Visual Basic (Chapter 1)

https://evilgeniuslabs.ca/books/visual-basic-history
2•andsoitis•1h ago•0 comments

A Physics Engine with Incremental Rollback for Multiplayer Games

https://easel.games/blog/2026-rollback-physics
1•BSTRhino•1h ago•1 comments