frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Layers of Meaning

https://claude.ai/share/dee4a3eb-ca54-4d83-9450-b9c270102ba5
1•stoicfungi•21s ago•0 comments

Greenlandic women's victory in legal fight with Denmark over forced IUD scandal

https://www.theguardian.com/world/2025/dec/11/greenlandic-women-claim-victory-in-legal-fight-with...
1•binning•1m ago•0 comments

The Open Web Index

https://openwebindex.eu/
1•bilegeek•1m ago•0 comments

Rethinking Mastodon's post visibility UX

https://thatshubham.com/blog/mas
1•DorkyPup•1m ago•0 comments

Startups on hard mode: Oxide. Part 1: Hardware (2024)

https://newsletter.pragmaticengineer.com/p/oxide
1•tosh•2m ago•0 comments

Show HN: CodeProt – Filter PR nitpicks using AST and context-aware analysis

https://codeprot.com
1•allenz_cheung•3m ago•0 comments

Over 10k Docker Hub images found leaking credentials, auth keys

https://www.bleepingcomputer.com/news/security/over-10-000-docker-hub-images-found-leaking-creden...
1•pabs3•4m ago•0 comments

'The patriarchy runs deep': women still getting a raw deal in the workplace

https://www.theguardian.com/global-development/2025/dec/10/women-workplace-equality-gender-world-...
2•binning•4m ago•0 comments

Trump signs executive order blocking states from regulating AI

https://www.theguardian.com/us-news/2025/dec/11/trump-executive-order-artificial-intelligence
3•pera•7m ago•0 comments

How My Small Personal Blog Hit 100K – and the Posts That Made It Happen

https://www.michaelshoe.com/how-my-small-personal-blog-hit-100k-impressions-and-the-strange-posts...
1•michaelshoe•12m ago•0 comments

The Hallway of the Mountain King

https://medium.com/luminasticity/the-hallway-of-the-mountain-king-3b676c31ce9e
1•bryanrasmussen•12m ago•0 comments

Show HN: Jottings; Anti-social microblog for your thoughts

https://jottings.me/
4•vishalvshekkar•15m ago•0 comments

Show HN: Vibe CADing in the cloud with open source tools

https://foundry.siameseai.com/
1•mister_jmm•17m ago•0 comments

Cuba blames online news site 'elTOQUE' for the country's economic chaos

https://english.elpais.com/international/2025-12-02/cuba-blames-online-news-site-eltoque-for-the-...
2•PaulHoule•18m ago•0 comments

Critical Materials: A Strategic Analysis

https://twitter.com/ctindale/status/1997471488514134481
1•obiefernandez•19m ago•0 comments

Neural and molecular changes during placebo healing intervention

https://www.nature.com/articles/s42003-025-09088-3
2•bryanrasmussen•19m ago•1 comments

Show HN: VideoMaker AI – Turn text into professional videos in minutes

https://videomakerai.app/
1•thenextechtrade•23m ago•1 comments

Journalism students expose Russian-linked vessels off the Dutch and German coast

https://www.digitaldigging.org/p/they-droned-back
5•harshreality•23m ago•0 comments

Ask HN: How can I delete a Substack account in Australia?

2•freefrog334433•24m ago•1 comments

Roman occupation of Britain damaged the population's health

https://www.newscientist.com/article/2508181-roman-occupation-of-britain-damaged-the-populations-...
2•Brajeshwar•29m ago•0 comments

Divinity – Cinematic Announcement Trailer

https://www.youtube.com/watch?v=VxzyVeAG00w
1•doener•35m ago•0 comments

Deleting Substack account after Australia age laws

1•freefrog334433•40m ago•0 comments

Agentic coding tools should give more control over message queueing

https://solmaz.io/agentic-coding-tools-message-queueing
1•hosolmaz•40m ago•0 comments

Tumbleweeds inspire this rolling, resilient robot

https://www.popsci.com/technology/tumbleweed-robot-hermes/
2•Brajeshwar•41m ago•0 comments

Beyond Disagree and Commit

https://duncan.dev/post/beyond-disagree-and-commit
1•gpi•41m ago•0 comments

I Migrated an Oracle Schema to YugabyteDB

https://hexacluster.ai/blog/migrating-schema-from-oracle-to-yugabytedb-using-hexarocket
3•jones_david•42m ago•1 comments

Mini Brains Grown from Stem Cells Developed Light-Sensitive, Eye-Like Features

https://www.smithsonianmag.com/smart-news/mini-brains-grown-stem-cells-developed-eyes-can-sense-l...
3•thunderbong•45m ago•0 comments

Europe must be ready when the AI bubble bursts

https://www.ft.com/content/0308f405-19ba-4aa8-9df1-40032e5ddc4e
7•Brajeshwar•47m ago•2 comments

Guarding My Git Forge Against AI Scrapers

https://vulpinecitrus.info/blog/guarding-git-forge-ai-scrapers/
7•todsacerdoti•56m ago•1 comments

Let's Embed a Go Program into the Linux Kernel

https://sigma-star.at/blog/2023/07/embedded-go-prog/
2•birdculture•57m ago•0 comments