frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

VBM – a VTT that feels like Saturday night at the game table

https://vbm.games/
1•burtonmiller•1m ago•0 comments

Endo: JavaScript plugin framework with built-in supply chain attack resistance

https://github.com/endojs/endo
1•ignoramous•3m ago•0 comments

In CRHQ, agents don't just reply with text. They ship live HTML artifacts

https://andrej.crhq.ai/artifact/Us0QJs48JGh3dK3uoEDQLw
1•taubek•4m ago•0 comments

Swift bricks to be installed on all new buildings in Scotland

https://www.theguardian.com/environment/2026/jan/28/swift-bricks-to-be-installed-in-all-new-build...
1•bookofjoe•8m ago•0 comments

React-AI-stream – back end-agnostic SSE streaming hook for React

https://github.com/trimooo/react-ai-stream
1•devleoo•19m ago•0 comments

Reducing TTFT by CPUMaxxing Tokenization

https://www.crusoe.ai/resources/blog/reducing-ttft-by-cpumaxxing-tokenization
1•intrepidsoldier•23m ago•0 comments

Kit that converts film to digital

https://www.digitalcameraworld.com/cameras/film-cameras/the-trending-kit-that-converts-film-to-di...
2•Alupis•24m ago•0 comments

Implementation Details of Codex /Goal

https://gist.github.com/patleeman/b1b5768393f9bf2f60865b1defeeb819
1•dnw•24m ago•0 comments

Nocturne Is the Latest Music Player for Gnome to Hit v1.0

https://www.phoronix.com/news/Nocturne-1.0-GNOME-Music
1•Bender•25m ago•0 comments

Canvas Breach Disrupts Schools and Colleges Nationwide

https://krebsonsecurity.com/2026/05/canvas-breach-disrupts-schools-colleges-nationwide/
2•Bender•26m ago•1 comments

Build native desktop and mobile apps with web UI and Zig

https://github.com/vercel-labs/zero-native
1•kindkang2024•29m ago•0 comments

The Las Vegas Sphere Looked Like a Disaster. It's Become a Huge Hit Instead.

https://www.wsj.com/business/media/sphere-vegas-dolan-disaster-hit-fa0e6b17
1•bookofjoe•30m ago•1 comments

Ask HN: Before Open Source took over the server, what was the discourse like?

1•mbgerring•31m ago•0 comments

Trump reportedly plans to fire FDA Commissioner Marty Makary

https://arstechnica.com/health/2026/05/trump-reportedly-plans-to-fire-fda-commissioner-marty-makary/
1•Bender•32m ago•0 comments

Show HN: Countries where you can leave your MacBook at a random coffee shop

https://vouchatlas.com
1•canergl•32m ago•0 comments

Koide Formula for the Down Quarks

https://doi.org/10.1016/j.physletb.2026.140510
1•arivero•34m ago•1 comments

I built an AI to stress-test my thinking

https://mindmilker.com
1•afxuh•40m ago•0 comments

Ryan Cohen, the rebel CEO who disdains corporate America

https://www.ft.com/content/c0023a3e-08ec-44e8-80a6-f9abb343c52e
2•petethomas•44m ago•0 comments

Data center drains 30M gals of water — until residents complained of pressure

https://www.politico.com/news/2026/05/08/georgia-data-centers-water-00909988
6•thehoff•45m ago•4 comments

Hasan Piker Media Tracker

https://hasanabi.m44.cl/
1•abdelhousni•47m ago•1 comments

ToolOps: One Decorator Away from Production-Ready AI Agents

https://github.com/hedimanai-pro/toolops
2•hedimanai•52m ago•1 comments

Instagram DMs Lose End-to-End Encryption Starting Today

https://www.macrumors.com/2026/05/08/instagram-end-to-end-encryption/
4•0in•52m ago•1 comments

After USDA request, Indiana plant biologist locked out of lab by school

https://www.science.org/content/article/after-usda-request-indiana-plant-biologist-locked-out-lab...
2•petethomas•58m ago•0 comments

macOS 26 adoption rate lower than prior macOS versions

https://forums.macrumors.com/threads/tahoe-adoption-rate.2474641/page-2
2•seam_carver•59m ago•0 comments

The Real Story of Troy

https://storica.club/blog/troy-was-real/
1•cemsakarya•1h ago•0 comments

McDonald's is taking away your fountain machine. Burger King not so much.

https://finance.yahoo.com/markets/article/mcdonalds-is-taking-away-your-fountain-machine-burger-k...
2•phyzix5761•1h ago•2 comments

Revival of Blackberry nostalgia and keyboard fuels smartphone startups

https://www.cnbc.com/2026/05/09/blackberry-nostalgia-keyboard-smartphone-comeback-startups-intent...
1•0in•1h ago•0 comments

User just tricked Grok and Bankrbot to send tokens with Morse code

https://www.cryptopolitan.com/user-tricked-grok-bankrbot-to-send-tokens/
10•wglb•1h ago•0 comments

Stale Gov.uk pages are feeding AI overviews old data and Brits are believing it

https://www.theregister.com/software/2026/04/23/govuk-says-ai-gaslighting-brits-with-stale-govuk-...
1•gnabgib•1h ago•0 comments

Upending assumptions about learning, inspired by an AI phenomenon

https://www.santafe.edu/news-center/news/upending-assumptions-about-learning-inspired-by-an-ai-ph...
1•hhs•1h ago•0 comments