frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

SubQ: Sub-quadratic LLM built for 12M-token context

https://subq.ai/
1•gagan2020•1m ago•0 comments

GPU Spot Prices Surge 114% in Six Weeks

https://tomtunguz.com/b200-gpu-pricing-spot-market-model-releases/
1•gmays•1m ago•0 comments

Embodied AI with Claude, Raspberry Pi and Arduino

https://github.com/harshaneo17/agentic_arduino
1•harshaarya17•1m ago•0 comments

Ask HN: Is there a term for feeling sad about forced AI adoption?

2•ge96•3m ago•0 comments

Zuckerberg 'Personally Authorized and Encouraged' Meta's Copyright Infringement

https://variety.com/2026/digital/news/meta-ai-mark-zuckerberg-copyright-infringement-lawsuit-publ...
1•spankibalt•3m ago•0 comments

Clarification on the Notepad++ Trademark Issue

https://notepad-plus-plus.org/news/clarify-npp-trademark-infringement/
1•minimaxir•3m ago•0 comments

Apple Slammed in Calif. Federal Court over AirTag's Alleged Use in Stalking

https://www.law.com/therecorder/2026/05/04/apple-slammed-in-calif-federal-court-with-lawsuits-ove...
1•1vuio0pswjnm7•3m ago•0 comments

Ask HN: How are you structuring your .md docs to facilitate agentic development?

1•lepuski•4m ago•0 comments

"Big Boy" Power for Every User

https://github.com/Mexor-dev/Gator
1•Mexor•6m ago•0 comments

Charts = Tables

https://ia.net/topics/charts-and-tables
1•surprisetalk•8m ago•0 comments

Ask HN: Should I continue this project ? (Being able to change AI harness)

https://github.com/charles-azam/OmniAgents
1•couAUIA•9m ago•0 comments

Show HN: AgentSearch-Self-hosted search API for AI agents and optional Tor stack

https://github.com/brcrusoe72/agent-search
1•bricrusoe•9m ago•0 comments

GitHub incident May 5, 2026

https://www.githubstatus.com/incidents/8kn8t67gdy36
1•krvajal•10m ago•0 comments

ServiceNow just unveiled an AI workforce that can run your company

https://fortune.com/2026/05/05/servicenow-knowledge-2026-autonomous-workforce-microsoft-nvidia-ai...
1•ryan_j_naughton•11m ago•0 comments

U.S. ramps up frontier AI testing as White House pivots toward safety

https://www.axios.com/2026/05/05/us-frontier-ai-testing-white-house-pivots-safety
1•gmays•12m ago•0 comments

GitHub Action Runner Alternatives

https://binhong.me/blog/github-action-runner-alternatives/
1•8organicbits•12m ago•0 comments

Apple to let users choose rival AI models across iOS 27 features

https://www.reuters.com/technology/apple-let-users-choose-rival-ai-models-across-ios-27-features-...
1•thm•12m ago•0 comments

Should You Be Token-Maxxing?

https://speedrun.substack.com/p/should-you-be-token-maxxing
1•7777777phil•14m ago•0 comments

Ask HN: How do you pilot a service company full of AI agents?

2•louismalingrey•14m ago•0 comments

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

https://arxiv.org/abs/2604.26752
13•gmays•15m ago•2 comments

Moving to mainframe can be cheaper than sticking with VMware

https://www.theregister.com/2026/05/04/gartner_state_of_mainframes/
1•johnbarron•15m ago•0 comments

AI, your way: introducing the Poolside Platform

https://poolside.ai/blog/introducing-the-poolside-platform
1•icey•17m ago•0 comments

10 years helping Rails devs reach App Store. Today, someone shipped without me

https://masilotti.com/shipped-without-me/
1•joemasilotti•18m ago•0 comments

Richard Dawkins concludes AI is conscious, even if it doesn't know it

https://www.theguardian.com/technology/2026/may/05/richard-dawkins-ai-consciousness-anthropic-cla...
3•alefalfa•19m ago•5 comments

Oil 101, Second Edition

https://oil101.morgandowney.com
2•mxschumacher•20m ago•0 comments

An Open Letter to Jay Bhattacharya

https://www.science.org/content/blog-post/open-letter-jay-bhattacharya
3•jeromechoo•21m ago•0 comments

Show HN: I built a spoiler-free WWE dashboard for 2001-2019 with 15,000 matches

https://warner-wvez.github.io/wrestling-dashboard/
2•wvez22•21m ago•0 comments

PostHog Code

https://posthog.com/code
4•bewal416•22m ago•0 comments

Nostr Mail – Nostr Mail Documentation

https://nogringo.github.io/nostr-mail/#what-is-nostr-mail
5•janandonly•23m ago•0 comments

Spaces Protocol May 2026 Update

https://spacesprotocol.org/blog/may-2026-update/
1•ca98am79•23m ago•0 comments