frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

AfroTools – 368 free tools for all African countries

https://afrotools.com/
1•Ozaveshe•2m ago•0 comments

Show HN: Free library of 2k martial arts books – read in the browser

https://fightencyclopedia.com/library
1•acenji•4m ago•0 comments

The Power of Playtesting in the Classroom

https://landenlove.com/the-power-of-playtesting-in-the-classroom/
1•LandenLove•6m ago•0 comments

Drafting Earnout Agreements to Minimize Disputes After Sale of Private Companies

https://natlawreview.com/article/earnout-burnout-drafting-earnout-agreements-minimize-disputes-fo...
1•petethomas•10m ago•0 comments

Zenclora OS

https://zenclora.org/
1•debo_•12m ago•0 comments

SEC Prepares Proposal to Eliminate Quarterly Reporting Requirement

https://www.wsj.com/finance/regulation/sec-prepares-proposal-to-eliminate-quarterly-reporting-req...
2•jonbaer•18m ago•0 comments

The third agent is me [video]

https://www.youtube.com/watch?v=HbWu_eYIHKQ
1•BeenSolo•19m ago•0 comments

Monkey Island for Commodore 64 Ground Up

https://pixeldust.se/monkey-island-project
1•aresant•22m ago•0 comments

Reverse engineering a no-name Chinese smartwatch BLE protocol (Jieli chipset)

https://github.com/TruthGh0st/C30-20-Pro-BLE-Reverse-Engineering
1•Truth_Gh0st•28m ago•1 comments

French Bees Are Making M&M-Contaminated Blue and Green Honey (2012)

https://www.smithsonianmag.com/smart-news/french-bees-are-making-mm-contaminated-blue-and-green-h...
2•thunderbong•32m ago•0 comments

The Billionaire Backlash Against a Philanthropic Dream

https://www.nytimes.com/2026/03/15/business/the-billionaire-backlash-against-a-philanthropic-drea...
1•627467•33m ago•0 comments

Boot ROM Security on Silicon Macs (M1/M2/M3)

https://oliviagallucci.com/boot-rom-security-on-silicon-macs-m1-m2-m3/
1•0xkato•35m ago•0 comments

The Building Blocks of Agentic AI

https://ai.meta.com/blog/introducing-pytorch-native-agentic-stack/?_fb_noscript=1
1•werinly•35m ago•0 comments

Cloud Appreciation Society

https://cloudappreciationsociety.org/
1•striking•36m ago•0 comments

Jepsen: MariaDB Galera Cluster 12.1.2

https://jepsen.io/analyses/mariadb-galera-cluster-12.1.2
14•aphyr•36m ago•0 comments

Apollo Lunar Module FDAI Restoration [video]

https://www.youtube.com/watch?v=PZy12ccXQm0
2•twalichiewicz•42m ago•1 comments

Judge blocks US Government from slimming down vaccine recommendations

https://apnews.com/article/kennedy-acip-vaccines-cdc-fc758951019f41d2f5e81e4e2faa22d3
6•petethomas•47m ago•0 comments

Once: Easy self-hosting for Docker-based web apps

https://github.com/basecamp/once
1•lwhsiao•51m ago•0 comments

Tech entrepreneur used AI to help create a cancer vaccine to treat his dog

https://fortune.com/2026/03/15/australian-tech-entrepreneur-ai-cancer-vaccine-dog-rosie-unsw-mr
3•zaikunzhang•51m ago•0 comments

Apache Doris Up to 34× Faster Than ClickHouse for Real-Time Updates

https://www.velodb.io/blog/apache-doris-34x-faster-clickhouse-realtime-updates
3•xiaoqiangnk•51m ago•0 comments

App Store for GitHub Releases!

https://github-store.org/
1•rainxchzed•52m ago•0 comments

I told AI to pick the NCAA brackets for me

https://www.aincaabrackets.com/
1•mattmerrick•53m ago•0 comments

The Functional Programming Hiring Problem

https://blog.janissary.xyz/posts/hiring-functional-programming
2•RustSupremacist•55m ago•0 comments

Monetize AI Agents and APIs with Lightning L402 (HTTP 402)

https://github.com/Mike-io-hash/satsgate
1•Mike-io•55m ago•1 comments

Open-artisan: OpenCode plugin for structured AI workflow orchestration

https://github.com/yehudacohen/open-artisan/
1•ManWith2Plans•56m ago•0 comments

Local-first IP protection system

https://sovereign-ip-protection.com
1•wyngdn•59m ago•0 comments

Every layer of review makes you 10x slower

https://apenwarr.ca/log/20260316
3•greyface-•1h ago•0 comments

Indieweb Business Models

https://indieweb.org/business-models
1•colinprince•1h ago•0 comments

Quo vadis, humanitas?

https://www.vatican.va/roman_curia/congregations/cfaith/cti_documents/rc_cti_doc_20260304_quo-vad...
3•michaelsbradley•1h ago•1 comments

OpenAI to Cut Back on Side Projects in Push to 'Nail' Core Business

https://www.wsj.com/tech/ai/openai-chatgpt-side-projects-16b3a825
2•JumpCrisscross•1h ago•1 comments