frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

What the Defense Production Act Can and Can't Do to Anthropic

https://www.lawfaremedia.org/article/what-the-defense-production-act-can-and-can%27t-do-to-anthropic
1•verdverm•3m ago•0 comments

Show HN: CVJ-1 VJ Deck / VISUALZ audio-reactive VJ software

https://www.visualzstudio.com/vj-deck
1•madchops1•4m ago•0 comments

Will A.I. Take Away Our Basic Skills?

https://paperrobots.substack.com/p/will-ai-take-away-our-basic-skills
1•NomNew•6m ago•0 comments

Show HN: Free online audio translator that translates voice instantly

https://audioconvert.ai/audio-translator
1•Katherine603•6m ago•0 comments

Plugin to give Claude Code perception (screen, system audio and mic context)

https://twitter.com/ashu_trv/status/2026296815860203888/
1•ash-ishh•8m ago•0 comments

Show HN: Squidy – How I stopped losing AI agent context mid-project

https://rendernet.com.br/squidyrun/
1•marcfox182•12m ago•0 comments

Show HN: Easyemailfinder.com (5 Free Credits)

https://easyemailfinder.com
1•faalbane•16m ago•0 comments

The Internet Was Weeks Away from Disaster and No One Knew [video]

https://www.youtube.com/watch?v=aoag03mSuXQ
1•trinsic2•22m ago•1 comments

Tesla Lab – 20 computational experiments

https://github.com/consigcody94/tesla-lab
1•sentinelowl•23m ago•1 comments

Show HN: NovelStar – a functional novel writing suite in a single HTML file

https://github.com/pixeldude84/novelstar
1•pixeldude84•24m ago•0 comments

Claude Code Anywhere

https://happy.engineering
1•vismit2000•28m ago•0 comments

Detecting AI scammers and bringing back the control to humans

https://veritrue.ai/
1•cheroll•32m ago•2 comments

I hacked ChatGPT and Google's AI – and it only took 20 minutes

https://www.bbc.com/future/article/20260218-i-hacked-chatgpt-and-googles-ai-and-it-only-took-20-m...
3•leephillips•34m ago•1 comments

RSA-signed prompt envelopes for OpenClaw agents

https://github.com/Mediocr3Mik3/open-claw-spa
1•Mediocr3Mik3•36m ago•1 comments

Connectors: Discord, Notion, and Slack Now Wired into Every Debate

https://www.askverdict.ai/updates/connectors-notion-discord-slack
1•thegdsks•36m ago•0 comments

A Computational Perspective on NeuroAI and Synthetic Biological Intelligence

https://arxiv.org/abs/2509.23896
1•andsoitis•37m ago•0 comments

A faithful, native Windows Notepad clone built in Zig using raw Win32 APIs

https://github.com/leebase/lfznotepad
1•garbagepatch•37m ago•1 comments

Optimism Engine – The first AI engine with a deterministic Safety Layer

https://optimism-engine.vercel.app/
1•sucharithan•37m ago•1 comments

Worried Europeans can now cut Azure's phone cord completely

https://www.theregister.com/2026/02/25/microsoft_azure_local/
2•abdelhousni•39m ago•0 comments

Show HN: Marcus –AI math tutor that guides you to answers instead of giving them

https://marcusmath.com
1•sbharadwaj•39m ago•2 comments

Show HN: I built a persistent LSM-Tree storage engine in Go from scratch

1•Jyotishmoy•40m ago•0 comments

Human brain cells playing Doom

https://www.youtube.com/watch?v=yRV8fSw6HaE
1•noosphr•41m ago•1 comments

Add repo line count to coverage drip emails

https://gitauto.ai/blog/what-are-dora-metrics
1•nishiohiroshi•43m ago•0 comments

I don't know how you get here from "predict the next word."

https://www.grumpy-economist.com/p/refine
3•qsi•44m ago•0 comments

A high-quality OSS graphical session manager and dashboard for pi.dev agent

https://dwsy.github.io/pi-session-manager/en/
1•sinenomine•45m ago•0 comments

Show HN: AI-assert – Constraint verification for LLM outputs (278 lines, Python)

https://github.com/kaantahti/ai-assert
1•kaantahti•49m ago•0 comments

US farmers are rejecting multimillion-dollar datacenter bids for their land

https://www.theguardian.com/technology/2026/feb/21/us-farmers-datacenters
6•carabiner•50m ago•2 comments

Show HN: Prince Cloud – Create PDFs with AI Agents

https://prince.cloud
2•mikeday•52m ago•0 comments

What I Saw Inside Apple's U.S. Chip Supply Chain

https://www.wsj.com/tech/what-i-saw-inside-apples-effort-to-rebuild-the-u-s-chip-supply-chain-28f...
5•Brajeshwar•52m ago•0 comments

Apple Needs to Copy Samsung's New Security Smartphone Screen ASAP

https://www.wsj.com/tech/personal-tech/samsung-galaxy-s26-privacy-display-d5bce9ab
9•Brajeshwar•53m ago•4 comments