frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Netflix Launches Free Kids Gaming App to Compete with Apple Arcade

https://www.macrumors.com/2026/04/06/netflix-playground-kids-app/
1•mgh2•1m ago•0 comments

Google AI Edge Eloquent is a Gemma-powered dictation app that works offline

https://www.neowin.net/news/google-ai-edge-eloquent-is-a-gemma-powered-dictation-app-that-works-o...
1•bundie•3m ago•0 comments

E14 Oracle – Byzantine Consensus System for Environmental Data

https://github.com/LadbotOneLad/AiFACTORi
1•robertdoe•11m ago•0 comments

John Coltrane Illustrates the Mathematics of Jazz

https://www.americanjazzmusicsociety.com/blog/john-coltrane-draws
1•luu•11m ago•0 comments

All GANs No Brakes: An Introduction to GANs

https://mayberay.bearblog.dev/all-gans-no-brakes/
1•mugamuga•16m ago•0 comments

Tags.pub is a global hashtag server for the Fediverse

https://tags.pub/
2•riffraff•21m ago•0 comments

QitOS – A research-first framework for building serious LLM agents

https://github.com/Qitor/qitos
1•morinoppp•24m ago•0 comments

Multi-Agentic Software Development Is a Distributed Systems Problem

https://kirancodes.me/posts/log-distributed-llms.html
1•gopiandcode•26m ago•0 comments

The Fast and Spurious: Developer Productivity with GenAI

https://arxiv.org/abs/2510.24265
1•jruohonen•28m ago•0 comments

Purisaki Berberine Patches: A Simple, No-Pill Approach to Metabolic Support

https://ftawebprod.fta.dot.gov/MeetingRequest/MeetingRequest/DownloadFile/9a9u1R0Yq3sWPTP000tY9Q%...
1•ChristinaHudson•34m ago•0 comments

Show HN: Web to Banner – Design Google Ads HTML5 Banners in Webflow

https://www.webtobanner.com
1•GraphicReDesign•35m ago•0 comments

SeL4 – a formally verified, capability-based microkernel

https://sel4.systems/
3•wh313•39m ago•2 comments

Businesses scramble to get noticed by AI search

https://www.bbc.com/news/articles/c70n2rjgxeyo
1•1659447091•42m ago•0 comments

Spilling the Neural Tea: A Journey Down the Side-Channel

https://www.sigarch.org/spilling-the-neural-tea-a-journey-down-the-side-channel/
2•jruohonen•45m ago•0 comments

What's your video agency tool stack?

https://timeliner.io
1•joeyparker•51m ago•1 comments

Non-Normal Distributions in the Real World

https://qualityamerica.com/LSS-Knowledge-Center/statisticalinference/non_normal_distributions_in_...
2•Tomte•54m ago•0 comments

Federated Wiki

http://fed.wiki.org/view/welcome-visitors/view/about-federated-wiki
2•RebelPotato•56m ago•1 comments

Three hundred synths, 3 hardware projects, and one app

https://midi.guide/blog/three-hunded-synths-one-app/
2•ductionist•56m ago•0 comments

New Rule on Web and Mobile Accessibility for State and Local Governments (2024)

https://www.ada.gov/resources/2024-03-08-web-rule/
2•divbzero•57m ago•1 comments

Matrix-matrix multiplication, from less conventional points of view

https://okmij.org/ftp/Algorithms/matmul.html
2•nbaksalyar•57m ago•0 comments

CLI tool to generate the maximum possible LOC and commits in minimum time

https://github.com/jshchnz/codemaxxed
1•SheinhardtWigCo•58m ago•0 comments

CarriFit – Free AI Calorie Counter

https://play.google.com/store/apps/details?id=com.carrifit.app&hl=en_US
1•mytesting•1h ago•0 comments

Addyosmani/agent-skills: Prod-grade skills for AI coding agents

https://github.com/addyosmani/agent-skills
2•msolujic•1h ago•0 comments

People Love to Work Hard

https://www.anildash.com/2026/04/06/people-love-to-work-hard/
15•zdw•1h ago•6 comments

Chinese pigs fed new menu as Beijing weans farmers off US soy

https://www.reuters.com/world/china/chinese-pigs-fed-new-menu-beijing-weans-farmers-off-us-soy-20...
2•petethomas•1h ago•1 comments

AI Agent Traps

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6372438
2•_____k•1h ago•0 comments

Skunk: a Rust based language that compiles to Go

2•stickynotememo•1h ago•1 comments

Anthropic revenue growth: $11 billion since start of March

1•aurareturn•1h ago•0 comments

Decrypting a DPRK macOS infostealer: 571 values via CPU emulation

https://github.com/Darksp33d/hyperhives-macos-infostealer-analysis
1•darksp33d•1h ago•0 comments

ClearSpec – Turn vague goals into specs that AI agents can execute

https://www.clearspec.dev/
1•mikopiko•1h ago•0 comments