frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Charity Majors on AI, Observability, and the Future of Software

https://scalingdevtools.com/podcast/episodes/charity
1•mooreds•2m ago•0 comments

Ask HN: Are SaaS businesses going to zero?

1•nomilk•2m ago•0 comments

Eliza: Natural Language Processing from 1967

https://en.wikipedia.org/wiki/ELIZA
1•frasermarlow•2m ago•0 comments

On-Device Real-Time Speech-to-Speech Translation [video]

https://www.youtube.com/watch?v=D9id2Dgv_OM
1•kenarsa•5m ago•0 comments

My life was changed by four sentences in four books

https://sive.rs/4s4b
1•James72689•5m ago•0 comments

Sync creative assets to any CMS

https://syncific.com
1•lightsyncpro•8m ago•0 comments

Show HN: Short links and dynamic QR codes you can white-label as your own SaaS

https://linked.codes/
1•attendos•10m ago•0 comments

Measuring the Self-Reported Impact of Early-2026 AI on Tech Worker Productivity

https://metr.org/blog/2026-05-11-ai-usage-survey/
1•willmarch•10m ago•1 comments

Show HN: Containarium – self-hosted sandbox for AI agents, MCP-native

https://github.com/footprintai/Containarium
1•hsin003•17m ago•0 comments

Microsoft BitLocker – YellowKey zero-day exploit

https://www.tomshardware.com/tech-industry/cyber-security/microsoft-bitlocker-protected-drives-ca...
2•cookiengineer•18m ago•0 comments

Terax – Lightweight (7MB) AI Terminal Emulator (ADE) in Rust, Tauri, and React

https://terax.app
2•nikolay•25m ago•1 comments

CVE-2026-45793: Anatomy of a 14-Hour PHP Supply-Chain Near-Miss

https://github.com/graycoreio/github-actions-magento2/discussions/261
1•damienwebdev•25m ago•1 comments

MacBook Neo Review: The Laptop for the Rest of Us

https://fireborn.mataroa.blog/blog/macbook-neo-review-the-laptop-for-the-rest-of-us/
2•vinhnx•26m ago•0 comments

Gaining control of every projector and camera on campus

https://www.edna.land/blogs/posts/scanning/
1•ednaordinary•26m ago•0 comments

Butter Box – Sharing offline has never been easier

https://likebutter.app/
1•MRPockets•28m ago•0 comments

Extortion Using Smart Glasses Is a Thing Now

https://gizmodo.com/extortion-using-smart-glasses-is-a-thing-now-2000755562
2•dataflow•36m ago•0 comments

Show HN: MerryDiv – Track dividends across all your brokerage accounts

https://www.merrydiv.com/
2•zouko•41m ago•0 comments

Tell HN: Starting June 15, claude -p usage will change

2•andersonmvd•45m ago•1 comments

LinkedIn planning to lay off 5% of staff

https://www.reuters.com/business/world-at-work/linkedin-is-planning-lay-off-5-staff-latest-tech-s...
6•doppp•47m ago•1 comments

The Chip War Nobody Saw Coming: Samsung vs. SK Hynix

1•xurgadis•49m ago•1 comments

SwiftUI Is a Disaster

https://ben1777.substack.com/p/swiftui-is-a-disaster
1•bentocorp•52m ago•0 comments

Harvey's Legal Agent Benchmark

https://www.harvey.ai/blog/introducing-harveys-legal-agent-benchmark
1•gmays•55m ago•0 comments

The Ten Commands of the Nine Divines

https://en.uesp.net/wiki/Lore:Ten_Commands:_Nine_Divines
1•alhazrod•56m ago•0 comments

ICE may be at World Cup matches in U.S.

https://www.nbcnews.com/sports/soccer/ice-may-world-cup-matches-us-rcna344797
5•mgh2•57m ago•0 comments

Meta launches WhatsApp 'incognito' mode to address privacy concerns for AI chats

https://apnews.com/article/whatsapp-meta-ai-chatbot-privacy-9a5f7565c969cbf04cf150dfc318cfae
4•devonnull•57m ago•0 comments

Google reports first known real-world AI-crafted zero-day exploit

https://cloud.google.com/blog/topics/threat-intelligence/ai-vulnerability-exploitation-initial-ac...
1•JoheyDev888•58m ago•0 comments

US citizen convicted of running secret Chinese 'police station' in NYC

https://www.bbc.com/news/articles/cy72yy7z1dyo
6•tartoran•1h ago•0 comments

59,000-year-old tooth offers a glimpse into how Neanderthals handled dentistry

https://www.cnn.com/2026/05/13/science/neanderthal-dentistry-stone-drill
1•breve•1h ago•0 comments

Oppositional Federalism: A Taxonomy of State Constitutional Postures Under

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6416178
2•hkhn•1h ago•0 comments

Alberta judge throws out petition seeking for the province separation

https://www.bbc.com/news/articles/clypn8py4zwo
5•tartoran•1h ago•0 comments