frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Testing AI orchestrated cyber attacks in practice

https://blog.fraktal.fi/testing-ai-orchestrated-attacks-in-practice-12f8fb03191e
1•tmakkonen•1m ago•0 comments

Downloading a Podcast to Create an Audiobook

https://kevinboone.me/clh_podcast_to_audiobook.html
1•LaSombra•2m ago•0 comments

Why I Don't Have Fun With Claude Code

https://brennan.io/2026/01/23/claude-code/
2•ingve•2m ago•0 comments

Why digital signatures break on structured healthcare data

https://formidable.care/articles/understanding-the-identity-integrity-gap-in-digital-signing
1•vincentxplore•5m ago•0 comments

Roleplayers

1•shoman3003•5m ago•0 comments

Faster Loading for GitHub Issues

https://github.blog/changelog/2026-01-22-faster-loading-for-github-issues/
2•ramon156•7m ago•0 comments

Web-SQLite-JS allows for the persistence of relational data on web clients [video]

https://www.youtube.com/watch?v=ZHYDv4GPprU
1•wuchuheng•11m ago•0 comments

Ask HN: Which paid apps and services do you use?

1•chistev•15m ago•0 comments

SnapHabit : Extreme habit accountability with AI and friend groups

https://snap-habit.com/
1•apollos•16m ago•0 comments

E-scooter sharing company Bird has raised $20M

https://micromobility.io/news/birds-parent-company-third-lane-mobility-raises-20m
1•prabinjoel•17m ago•2 comments

AI-Powered CSPM Tools Are Transforming Cloud Compliance

https://digimagazine.co.uk/how-ai-powered-cspm-tools-are-transforming-cloud-compliance/
1•cybleinc•20m ago•0 comments

Does AI-Assisted Coding Deliver? A Study of Cursor on Software Projects

https://arxiv.org/abs/2511.04427
2•iLoveOncall•20m ago•0 comments

Ghostty's AI Policy

https://github.com/ghostty-org/ghostty/blob/main/AI_POLICY.md
3•mefengl•24m ago•1 comments

A crowdsourced repository for optimization constants?

https://terrytao.wordpress.com/2026/01/22/a-crowdsourced-repository-for-optimization-constants/
1•jjgreen•26m ago•0 comments

Dcli: Declarative Package Management for Arch Linux (Inspired by NixOS)

https://gitlab.com/theblackdon/dcli
1•signa11•32m ago•0 comments

The new rules of the road for agentic commerce

https://www.mastercard.com/us/en/news-and-trends/stories/2026/agentic-commerce-rules-of-the-road....
1•saikatsg•34m ago•0 comments

Copilot SDK in Technical Preview

https://github.com/orgs/community/discussions/184872
1•edent•34m ago•0 comments

Google is ending full-web search for niche search engines

https://programmablesearchengine.googleblog.com/
48•01jonny01•37m ago•19 comments

Voice Layer for AI Agents Built with Rust, Pluggable to All Agentic Frameworks

https://github.com/SaynaAI/sayna
1•tigranbs•37m ago•0 comments

Raiden Warned About AI Censorship [video]

https://www.youtube.com/watch?v=-gGLvg0n-uY
1•DeathArrow•41m ago•0 comments

Show HN: Thalo – A "programming" language for structured knowledge

https://github.com/rejot-dev/thalo
3•WilcoKruijer•45m ago•0 comments

From Tomorrow Back to Yesterday: A Tale of Two Web Architectures – Yang [video]

https://www.youtube.com/watch?v=8W6Lr1hRgXo
1•adityaathalye•45m ago•0 comments

The State of Modern AI Text to Speech Systems for Screen Reader Users

https://stuff.interfree.ca/2026/01/05/ai-tts-for-screenreaders.html
1•tuukkao•50m ago•0 comments

Apple is burying the Time Capsule, but how to replace it?

https://sixcolors.com/post/2026/01/apple-is-burying-the-time-capsule-but-how-to-replace-it/
4•tosh•52m ago•1 comments

What time you should arrive at cinema to avoid adverts

https://news.sky.com/story/what-time-you-should-actually-arrive-at-cinema-to-avoid-adverts-13149863
1•austinallegro•52m ago•0 comments

Subject of Unique Interest: Mary Freeman Heuston Lewis and William Dean Howells

https://commonplace.online/article/a-subject-of-unique-interest/
1•bryanrasmussen•53m ago•1 comments

DeepSeek's mHC: Stabilizing Training Divergence from 3,000x to 1.6x

2•Research_Brief•54m ago•0 comments

How to Think About Self-Attention Intuitively

https://www.henrydashwood.com/posts/attention-intuition
1•HenryDashwood•56m ago•0 comments

Nvidia PersonaPlex: natural conversation AI

https://research.nvidia.com/labs/adlr/personaplex/
1•ricardobeat•59m ago•0 comments

Doing Gigabit Ethernet over My British Phone Wires

https://thehftguy.com/2026/01/22/doing-gigabit-ethernet-over-my-british-phone-wires/
2•user5994461•1h ago•0 comments