frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

In the Lab – Soldering Prototypes with Enamel Magnet Wire

https://tomverbeure.github.io/2020/02/22/In-The-Lab-Magnet-Wire-Soldering.html
1•hasheddan•23s ago•0 comments

Courtney Love Does the Math

https://www.salon.com/2000/06/14/love_7/
1•sebg•1m ago•0 comments

Ammobia says it has reinvented a century-old technology

https://techcrunch.com/2026/01/13/ammobia-says-it-has-reinvented-a-century-old-technology/
1•PaulHoule•3m ago•0 comments

Show HN: JustNotifs – Push notifications for teams, flat $29/mo instead of SMS

https://justnotifs.com/
1•acaronlex•4m ago•0 comments

Thoughts on AI/LLM usage from a 25 year industry vet

1•hutchplusplus•6m ago•0 comments

Apple Human Interface Guidelines (1987)

https://archive.org/details/applehumaninterf00appl
1•wizardforhire•8m ago•0 comments

Using World Models for Consistent AI Filmmaking

https://getartcraft.com/news/world-models-for-film
1•echelon•8m ago•0 comments

Belkin's Wemo smart devices will go offline on Saturday

https://www.theverge.com/tech/870890/belkin-wemo-cloud-services-shut-down
1•bookofjoe•9m ago•1 comments

Pages from Ceefax: Today's news at yesterday's pace

https://pagesfromceefax.azurewebsites.net/
1•xk3•9m ago•1 comments

Claude Code's GitHub page auto closes issues after 60 days

https://github.com/anthropics/claude-code/issues/16497
1•dcreater•10m ago•1 comments

Ask HN: Routing LLM queries to respective best model

1•nemath•11m ago•0 comments

Making Workflows Work Right in Golang

https://www.dbos.dev/blog/how-we-built-golang-native-durable-execution
1•KraftyOne•12m ago•0 comments

The imminent risk of vibe coding

https://basta.substack.com/p/the-imminent-risk-of-vibe-coding
1•feifan•12m ago•0 comments

Former Google engineer found guilty of espionage and theft of AI tech

https://www.cnbc.com/2026/01/30/former-google-engineer-found-guilty-of-espionage-and-theft-of-ai-...
1•rmason•15m ago•0 comments

Ingress Nginx: Statement from Kubernetes Committees

https://kubernetes.io/blog/2026/01/29/ingress-nginx-statement/
2•sibellavia•16m ago•0 comments

Linux kernel mailing list: [RFC] AI review prompt updates

https://lore.kernel.org/lkml/b187e0c1-1df8-4529-bfe4-0a1d65221adc@meta.com/
1•speckx•16m ago•0 comments

The Influence of Anxiety

https://thepointmag.com/examined-life/the-influence-of-anxiety/
2•sternmere•17m ago•0 comments

Wojtek (Bear)

https://en.wikipedia.org/wiki/Wojtek_(bear)
1•gynecologist•18m ago•0 comments

Polymarket, 'privileged' users made millions betting on war strikes

https://www.theguardian.com/society/ng-interactive/2026/jan/30/polymarket-prediction-markets-betting
1•paulpauper•18m ago•1 comments

Show HN: I Made MCP to Make Claude Code Genius Email Marketer

https://docs.sequenzy.com/concepts/mcp
2•nikpolale•19m ago•1 comments

Show HN: Jobstocks.ai – 6 months in, showing some interesting signals

https://jobstocks.ai/
1•TalO•19m ago•0 comments

Signals: Toward a Self-Improving Agent

https://factory.ai/news/factory-signals
1•janpio•20m ago•0 comments

Surfel-based global illumination on the web

https://juretriglav.si/surfel-based-global-illumination-on-the-web/
1•iamwil•22m ago•0 comments

P vs. NP and the Difficulty of Computation: A ruliological approach

https://writings.stephenwolfram.com/2026/01/p-vs-np-and-the-difficulty-of-computation-a-ruliologi...
2•tzury•23m ago•1 comments

Hypergrowth isn't always easy

https://tailscale.com/blog/hypergrowth-isnt-always-easy
2•usrme•23m ago•0 comments

Alternative to Claudebot/Moltbot, but secure, with control and capabilities

https://twitter.com/Chi_Wang_/status/2017067935601426833
2•Kn1026•24m ago•2 comments

How I built my own secure version of Clawdbot

https://medium.com/ai-native-enterprise/how-i-built-my-own-enterprise-grade-clawdbot-without-the-...
5•cliffly•24m ago•0 comments

Don Lemon Arrested

https://www.nbcnews.com/news/us-news/don-lemon-arrested-federal-authorities-attorney-says-rcna256680
4•Extropy_•25m ago•3 comments

Steve Jobs' son says he can help end cancer deaths – and he's raised $$$$

https://www.sfchronicle.com/health/article/reed-jobs-cancer-fund-21324598.php
3•aanet•26m ago•3 comments

Bill Gates asked Epstein for "antibiotics" for an STD from "Russian girls."

https://twitter.com/LeadingReport/status/2017297448197103947
6•sergiotapia•28m ago•3 comments