frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Agent Development Kit for TypeScript: Build AI Agents with a Code-First Approach

https://developers.googleblog.com/introducing-agent-development-kit-for-typescript-build-ai-agent...
1•kjhughes•43s ago•0 comments

Recovering Corrupt Zip Files

https://www.construct.net/en/blogs/ashleys-blog-2/recovering-corrupt-zip-files-1895
1•AshleysBrain•2m ago•0 comments

Golang's Defer in C (Kind Of)

https://twdev.blog/2025/12/ccleanup/
1•ibobev•2m ago•0 comments

How to annotate JITed code for perf/samply

https://bernsteinbear.com/blog/jit-perf-map/
1•ibobev•3m ago•0 comments

Show HN: ShipBoard – View user feedback directly inside VS Code

1•divinho•3m ago•1 comments

Show HN: ZXC – Asymmetric, +40% decode vs. LZ4 on ARM (C, BSD-3, Fuzzed)

https://github.com/hellobertrand/zxc
1•pollop_•3m ago•0 comments

Generate Presentations from Markdown

https://github.com/luigimorel/madslides
1•lmaao•3m ago•0 comments

Introduction to Programming the Commodore PET

https://retrogamecoders.com/introduction-to-programming-the-commodore-pet/
1•ibobev•4m ago•0 comments

Kennedy Center to Be Renamed Trump-Kennedy Center, White House Says

https://www.bbc.com/news/articles/cp84pxvp87eo
3•throw0101a•6m ago•2 comments

Lying Virtual Eyes

https://surfingcomplexity.blog/2024/12/07/your-lying-virtual-eyes/
1•qouteall•7m ago•0 comments

Building Todoist's Ramble #1: Taming the Microphone

https://www.doist.dev/building-ramble-1-taming-the-microphone/
1•rfgamaral•9m ago•0 comments

New H-1B visa rules upgrade some lottery applicants – and squeeze out others

https://www.businessinsider.com/h1b-visa-lottery-new-rules-salary-big-tech-students-2025-12
1•pseudolus•9m ago•0 comments

Gemma Scope 2: open suite of tools for language model interpretability

https://deepmind.google/blog/gemma-scope-2-helping-the-ai-safety-community-deepen-understanding-o...
1•siegers•10m ago•0 comments

Urban birds' beak shape rapidly changed during Covid-19 lockdowns

https://phys.org/news/2025-12-urban-birds-beak-rapidly-covid.html
1•pseudolus•11m ago•0 comments

Wan 2.6: AI Video Generator with Multi-Shot and Reference Video

https://wan-2-6.com
1•jacksteven•13m ago•0 comments

A Comprehensive Review of HDMI-CEC and the Cec-Ctl Command

https://utdream.org/a-comprehensive-review-of-hdmi-cec-and-the-cec-ctl-command/
1•madspindel•13m ago•0 comments

Ask HN: How can archive.is access paid online articles?

1•hamburgererror•14m ago•1 comments

Why This Island Could Trigger World War 3 [video]

https://www.youtube.com/watch?v=x-XdOaZPhBw
1•keepamovin•16m ago•0 comments

The Silent Struggle: Why Keeping Your Baby Fed on Time Breaks Co-Parenting

https://www.chunkybabies.com/blog/why-keeping-your-baby-fed-on-time-breaks-co-parenting
1•sirius93•17m ago•0 comments

Tiled Art

https://tiled.art/en/home/?id=SilverAndGold
2•meander_water•18m ago•0 comments

TikTok signs agreement to create new U.S. joint venture

https://www.cnbc.com/2025/12/18/tik-tok-us-sale-china.html
1•bnewton•19m ago•0 comments

Food becoming more calorific but less nutritious due to rising carbon dioxide

https://www.theguardian.com/environment/2025/dec/19/higher-carbon-dioxide-food-more-calorific-les...
2•n1b0m•19m ago•0 comments

Sony Is Buying Peanuts

https://www.engadget.com/big-tech/sony-is-buying-peanuts-022341467.htm
2•thunderbong•21m ago•0 comments

What's new in Spring Boot 4.0, with coding examples and demo

https://www.youtube.com/watch?v=dvbyOsiuUDQ
1•huseyinbabal•23m ago•0 comments

Cloudflare Is Down, Again

https://www.cloudflarestatus.com/incidents/msv92vykgtvp
13•stradiv•24m ago•5 comments

Google Gemini calls out Google dark pattern

https://variousbits.net/2025/12/19/google-gemini-calls-out-google-dark-pattern/
3•dmje•25m ago•0 comments

I built a privacy-first analytics tool after running websites

https://www.pageviews.online/
1•cute_penguin•27m ago•1 comments

Brown, MIT professor shootings linked, suspect found dead, officials say

https://www.washingtonpost.com/nation/2025/12/18/brown-university-shooting-person-of-interest/
1•Anon84•28m ago•1 comments

Fine-tuning Qwen3 at home to respond to any prompt with a dad joke

https://medium.com/nixiesearch/fine-tuning-qwen3-at-home-to-respond-to-any-prompt-with-a-dad-joke...
1•shutty•28m ago•0 comments

Fast matrix-vector multiplication for a fixed 0-1 matrix

https://mathoverflow.net/questions/506105/fast-matrix-vector-multiplication-for-a-fixed-0-1-matrix
1•jjgreen•30m ago•0 comments