frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

TON Vanity: 286,000x faster vanity addresses

https://gusarich.com/blog/ton-vanity/
1•Gusarich•24s ago•0 comments

Observing growth of metallic crystals inside liquid metal solvents

https://www.nature.com/articles/s41467-025-66249-y
1•PaulHoule•1m ago•0 comments

WebGPU in P5.js

https://www.davepagurek.com/blog/p5-webgpu/
1•todsacerdoti•3m ago•0 comments

Pushing K8s Env Config from Terraform to GitHub Actions

https://drornir.dev/blog/github-actions-dynamic-envs/
1•drorn•3m ago•0 comments

Nettool: Bash utility for network diagnostics, interface information

https://github.com/geduard0098/Nettool
1•thunderbong•5m ago•0 comments

Making Hard Decisions

https://thetortoiseandhare.substack.com/p/on-making-hard-decisions
1•kevinslin•5m ago•0 comments

Show HN: Blohem – Social media without the social. Built with Next.js on Azure

https://blohem.misya.me
1•mekod•7m ago•1 comments

iOS: Apps retain info after being deleted

2•WorldDev•9m ago•0 comments

Reading is a vice: US student reading abilities and habits are declining

https://www.msn.com/en-us/news/us/reading-is-a-vice/ar-AA1Tsp7w
3•smurda•9m ago•0 comments

Where Does Cloudflare Think I Am?

https://wheredoescloudflarethinkiam.com/
1•tomlemon•9m ago•0 comments

Year of Reading

https://kg.dev/thoughts/year-of-reading
2•kashnote•10m ago•0 comments

Show HN: Orange Music – An AI Music Generator for Your Private Music Space

https://oaimusicgen.com
1•jokera•10m ago•0 comments

Booze Elroy

https://pinback.itch.io/booze-elroy
4•IceCreamJonsey•14m ago•0 comments

Ask HN: How do Ops teams use ChatGPT across many internal tools?

2•stosssik•16m ago•0 comments

Fred Espenak Jr. (January 19, 1952 – June 1, 2025)

https://en.wikipedia.org/wiki/Fred_Espenak
3•zeristor•17m ago•1 comments

Portabase: Agent-Based Database Operations Platform (Backup/Restoration)

2•rambokdev•18m ago•0 comments

Modelling a Spring System in Hamiltonian Mechanics

https://ritog.github.io/posts/implicit_euler/
2•__rito__•19m ago•0 comments

Show HN: Fluxer – open-source Discord-like chat

https://fluxer.app
2•hampus•20m ago•0 comments

Mobile Development in the Age of AI

https://www.jpsim.com/mobile-development-in-the-age-of-ai/
2•jpsim•20m ago•0 comments

Linux Addressing Out-of-Memory Killer Inaccuracy on Large Core Count Systems

https://www.phoronix.com/news/Linux-Inaccuracy-OOM-High-CPUs
3•speckx•20m ago•0 comments

Why I am Starting a Blog in 2026

https://www.zias.be/blog/why-i-am-starting-a-blog-in-2026
3•ziasvannes•21m ago•1 comments

Flock Exposes Its AI-Enabled Surveillance Cameras

https://www.schneier.com/blog/archives/2026/01/flock-exposes-its-ai-enabled-surveillance-cameras....
1•walterbell•22m ago•0 comments

Trump Almost Has a Point About the Federal Reserve

https://www.theatlantic.com/economy/2026/01/federal-reserve-independence-lending/685444/
3•JumpCrisscross•23m ago•0 comments

The cost function of an "AI CEO"

https://carette.xyz/posts/automated_ceo/
3•LucidLynx•24m ago•0 comments

Publish (On Your) Own Site, Syndicate Elsewhere

https://indieweb.org/POSSE#
28•47thpresident•32m ago•4 comments

Everyone's Watching Stocks. The Real Bubble Is AI Debt

https://www.bloomberg.com/news/newsletters/2025-12-31/everyone-s-watching-stocks-the-real-bubble-...
4•zerosizedweasle•34m ago•2 comments

Show HN: Square Face Generator – A Flash-free tool to generate avatars

https://squarefacegenerator.work/en
3•lion__93332•37m ago•0 comments

Black bear living under house shows no sign of budging; owner mulls legal action

https://www.cbsnews.com/news/bear-under-house-altadena-california/
3•zzzeek•37m ago•0 comments

Afham – Arabic dialect translator iOS app

https://apps.apple.com/us/app/afham/id6755209468
2•argam•38m ago•0 comments

Xbox's 2025 was pure chaos

https://www.polygon.com/microsoft-gaming-2025-xbox-series-x-year-in-review/
4•ViktorRay•40m ago•0 comments