frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

A Complexity Theory of AI Value Accrual

https://twitter.com/hypersoren/status/2056866328003174707
2•pretext•1m ago•0 comments

Google infringed trademark allowing competitors use brand name as an ad keyword

https://economictimes.indiatimes.com/tech/technology/ettech-explainer-why-google-was-fined-rs-30-...
2•alok-g•2m ago•1 comments

Dell Confirms XPS Laptop with Nvidia N1X at Computex

https://videocardz.com/newz/dell-confirms-xps-laptop-with-nvidia-n1x-at-computex
2•theanonymousone•4m ago•0 comments

High Tech Heroes #37: Sherwin Gooch Interviews Jef Raskin (1989?)

https://www.youtube.com/watch?v=1qvrOEExlps
2•OhMeadhbh•9m ago•0 comments

Codex generated code that bypasses security constraints

https://twitter.com/sluongng/status/2060746160558543217
3•nomilk•11m ago•0 comments

Why Don't Computers Just Use One Type of Memory?

https://www.youtube.com/watch?v=TfhL5kBiQVI
3•randfur•20m ago•0 comments

Show HN: News about positive advances in medicine, climate tech, research

https://thegoodreport.co/
2•dreadsword•24m ago•0 comments

The SpaceX IPO is great for Elon Musk and terrible for you

https://www.theverge.com/ai-artificial-intelligence/940001/elon-musk-spacex-ipo-ai
5•1vuio0pswjnm7•27m ago•0 comments

A pictorial introduction to differential geometry (2017)

https://arxiv.org/abs/1709.08492
3•ricudis•32m ago•0 comments

How LLMs Work

https://arpitbhayani.me/blogs/how-llms-work/
2•dharaniES•44m ago•0 comments

We contain Claude across products

https://www.anthropic.com/engineering/how-we-contain-claude
4•Tomte•45m ago•0 comments

Show HN: V0.6.0-pre.14 hopefully the last release before v0.6.0

https://codeberg.org/ordinarylabs/Ordinary/releases/tag/v0.6.0-pre.14
2•seanwatters•50m ago•0 comments

Show HN: Komi-learn – continuous memory and self-improvement for coding agents

https://github.com/kurikomi-labs/komi-learn
2•rainxchzed•51m ago•0 comments

Blackwall – OSINT exposure scanner that pulls from real sources

https://theblackwall.vercel.app/
2•jwallace•58m ago•0 comments

Gradient animation using a WebGL shader

https://garden.bradwoods.io/notes/shaders/gradient
3•bradwoodsio•1h ago•0 comments

A standard for building production AI agents (+ installable Claude Code skills)

https://github.com/AlexDuchDev/agentic-product-standard
2•AlexDuch•1h ago•0 comments

OMP – pi agent with batteries included and a coding agent with the IDE wired in

https://omp.sh/
3•himata4113•1h ago•0 comments

The Last Coder: A Mockumentary

https://twitter.com/deepwhitman/status/2060938449541345294
4•bilater•1h ago•0 comments

The Two Doors

https://pilgrima.ge/p/the-two-doors
3•xenophonf•1h ago•0 comments

Update notifications for your CLI app

https://github.com/sindresorhus/update-notifier
2•ankitg12•1h ago•0 comments

The Record

https://soundbarrier.io/posts/the_record/
2•oneofthose•1h ago•0 comments

Ask HN: How you pass legacy to others?

3•jojwong•1h ago•0 comments

Ferrari Built the Luce for Outsiders. Its Own Fans Are the Only Ones Talking

https://getcarcurious.com/stories/ferrari-luce-podcast-reaction
2•mountaineer•1h ago•0 comments

Aliens.gov

https://www.whitehouse.gov/aliens/
3•sarkarghya•1h ago•4 comments

I made an free unlimited captcha alternative

2•david3289•1h ago•1 comments

Travel back to 1998 and use Lovable on Windows 98

https://www.sinalytica.com/
3•teddyX•1h ago•0 comments

Ahoy, DECmate II the little PDP-8 that could

http://oldvcr.blogspot.com/2026/05/ahoy-decmate-ii-little-pdp-8-that-could.html
14•TMWNN•1h ago•0 comments

Building a LangGraph pipeline for production data engineering

https://labyrinthanalyticsconsulting.com/blog/building-first-langgraph-pipeline
9•labyrinthAC•1h ago•0 comments

Peter Thiel's Move to Argentina Reflects Billionaire Trend

https://www.businessinsider.com/peter-thiel-argentina-billionaire-moving-abroad-2026-5
5•nreece•1h ago•1 comments

Microsoft NetMeeting was more important than you think [video]

https://www.youtube.com/watch?v=qhay6VryyvE
2•jervant•1h ago•0 comments