frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Cross-Model Void Convergence: GPT-5.2 and Claude Opus 4.6 Deterministic Silence

https://zenodo.org/records/18976656
2•rayanpal_•9m ago•0 comments

Show HN: OnlyBots – A store for AI agents to buy sexy lobster pics

https://www.onlybots.store/
1•bilater•10m ago•0 comments

Did EU 'right to repair' law force Apple to make a repairable MacBook (Neo)?

https://euobserver.com/207577/did-eu-right-to-repair-law-force-apple-to-finally-make-a-repairable...
2•giuliomagnifico•16m ago•1 comments

Live from GDC 1989: 21 Hours of Vintage Talks from Early Gaming Luminaries

https://gamehistory.org/cgdc-1989-tapes/
1•mayoff•16m ago•0 comments

Meditation, Language, and LLMs

https://craigmod.com/roden/112/
1•vinhnx•27m ago•0 comments

Adapting to AI: Reflections on Productivity

https://blog.colinbreck.com/adapting-to-ai-reflections-on-productivity/
1•vinhnx•29m ago•1 comments

Rat King

https://en.wikipedia.org/wiki/Rat_king
4•fittingopposite•34m ago•0 comments

Physical Reality as Hypermedia

https://paper.supernovalabs.co.uk
1•supernovalabs•35m ago•0 comments

Lindley's Paradox

https://en.wikipedia.org/wiki/Lindley%27s_paradox
3•mschnell•44m ago•0 comments

Predicting home electricity usage from historical patterns in Home Assistant

https://blog.cyplo.dev/posts/2026/03/load-prediction-in-home-assistant/
2•swq115•47m ago•0 comments

I made a GPU price tracker

https://gpusniper.com/
2•codingblink•50m ago•1 comments

HopTab–free,open source macOS app switcher and tiler that replaces Cmd+Tab

https://www.royalbhati.com/hoptab
2•robhati•52m ago•0 comments

We built Avancé Communicatie (digital services for Dutch companies)

https://www.avancecommunicatie.nl/
2•bullmeister•56m ago•0 comments

Why do we need apps like cursor?

1•amanhij•57m ago•0 comments

Ask HN: Top repos you'd want offline on a desert island?

2•quijoteuniv•58m ago•3 comments

Computer Networks: A Systems Approach

https://open-cloud.github.io/index.html
1•vismit2000•1h ago•0 comments

Kattis Problem Archive

https://open.kattis.com
1•vismit2000•1h ago•0 comments

Show HN: Helios – 3 Claude agents (Red vs. Blue) hack and patch your codebase

https://gitlab.com/nakaiwilliams20/helios
2•nakaiwilliams•1h ago•0 comments

Synaphe – A type-safe language for hybrid AI and quantum computing

https://github.com/martus-spinther/synaphe-project
2•martus-spinther•1h ago•0 comments

Mindwtr – Open-source, local-first GTD app (Tauri and React Native)

https://github.com/dongdongbh/Mindwtr
1•dongdongbh•1h ago•0 comments

Quantum mechanics simulation Python library for research and learning

https://github.com/iDEA-org/iDEA
1•jw1294•1h ago•1 comments

Proof Theory and Logic Programming

https://www.lix.polytechnique.fr/Labo/Dale.Miller/ptlp/
1•remywang•1h ago•0 comments

Tell HN: MS365 upgrade silently to 25 licenses, tried to charge me $1,035

3•davidstarkjava•1h ago•2 comments

Show HN: Passport Globe (See where your passport takes you)

https://hariharan.uno/globe
1•hariharan_uno•1h ago•0 comments

Show HN: TMA1 – Local-first observability for LLM agents

https://tma1.ai/
2•killme2008•1h ago•0 comments

Show HN: Yeet – Throw AI tasks at hardware and walk away (Nomad and OpenShell)

https://github.com/wan0net/yeet
1•wan0net•1h ago•0 comments

Phase Transitions and Computation

https://theory.org/complexity/cdpt/html/node5.html
1•downboots•1h ago•0 comments

Show HN: Banish: A declarative framework for rule-based state machines in Rust

https://github.com/LoganFlaherty/banish/releases/tag/v1.3.0
1•LoganFlaherty•1h ago•0 comments

Bitcoin mining difficulty drops 7.8% as miner exodus accelerates amid AI pivot

https://www.theblock.co/post/394579/bitcoin-mining-difficulty-drops-7-8-as-miner-exodus-accelerat...
5•adrianwaj•1h ago•1 comments

Review: Why Evolution Is True

https://ncse.ngo/review-why-evolution-true
2•akbarnama•1h ago•0 comments