frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: SalaryScript – The FAANG Negotiation Playbook

1•corefiredrill•25s ago•0 comments

GPLv2 and Installation Requirements

https://lwn.net/Articles/1052842/
1•beckford•1m ago•0 comments

Brain file format for AI agents – one file, any LLM, sub-millisecond queries

1•morshola•1m ago•0 comments

Google's Lyria 3: make 30-second audio tracks using text or images (in beta)

https://blog.google/innovation-and-ai/products/gemini-app/lyria-3/
1•tummler•2m ago•0 comments

Spec driven development – new workflows and spec types

https://kiro.dev/blog/specs-bugfix-and-design-first/
1•t2f2•4m ago•0 comments

Ask HN: Do you build your own X?

2•henryhale•6m ago•0 comments

Trump has prepared speech on extraterrestrial life, Lara Trump says

https://thehill.com/homenews/administration/5744218-trump-holds-alien-speech/
2•zzzeek•8m ago•0 comments

Querying OSM objects by their shapes

https://www.openstreetmap.org/user/rphyrin/diary/408263
1•altilunium•8m ago•0 comments

The History of Sushi

https://en.wikipedia.org/wiki/History_of_sushi
1•BiraIgnacio•9m ago•1 comments

The Worst-Case Future for White-Collar Workers

https://www.theatlantic.com/ideas/2026/02/ai-white-collar-jobs/686031/
2•petethomas•11m ago•1 comments

Do the people building Claude understand what they've created?

https://www.npr.org/2026/02/18/nx-s1-5717561/do-the-people-building-the-ai-chatbot-claude-underst...
2•geox•12m ago•0 comments

Show HN: What We See. An AI generated art exhibition

https://www.whatwesee.space/
1•sarreph•14m ago•0 comments

Model collapse – how LLMs become worse when trained on their own output

https://www.ibm.com/think/topics/model-collapse
2•daymos•14m ago•0 comments

Conversations with an AI That Argues Back

https://luisfernandoyt.makestudio.app/blog/878-conversations-with-ai
1•lout332•15m ago•0 comments

Zuckerberg testimony: Company consulted stakeholders about beauty filters

https://www.cnbc.com/2026/02/18/meta-mark-zuckerberg-social-media-safety-trial.html
1•samaysharma•16m ago•0 comments

The Only "Good" Cloud: Is a Google Cloud

https://blog.dijit.sh/gcp-the-only-good-cloud/
2•dijit•16m ago•0 comments

I made $15K/month at 13. Built a YC startup at 20. Still looking for my person

3•HNMaxHN•17m ago•1 comments

Hacking conference Def Con bans three people linked to Epstein

https://techcrunch.com/2026/02/18/hacking-conference-def-con-bans-three-people-linked-to-epstein/
3•donutshop•18m ago•0 comments

S3lite – A SQLite-like database engine with S3-compatible storage back end

https://github.com/sjcotto/s3lite
2•sjcotto•30m ago•0 comments

A Thick-Skulled Troodontid Theropod from the Late Cretaceous of Mexico

https://www.mdpi.com/1424-2818/18/1/38
1•PaulHoule•30m ago•0 comments

Cloud and AWS cost consultant Duckbill expands to software, raises $7.75M

https://www.geekwire.com/2026/cloud-and-aws-cost-consultant-duckbill-expands-to-software-raises-7...
2•mooreds•32m ago•0 comments

DBML: DSL for easily creating ER diagrams

https://dbml.dbdiagram.io/home/
1•todsacerdoti•32m ago•0 comments

How AI is affecting productivity and jobs in Europe

https://cepr.org/voxeu/columns/how-ai-affecting-productivity-and-jobs-europe
2•pseudolus•33m ago•0 comments

8086 Agentic AI Assembler Tool

https://github.com/cookertron/agent86
1•cookertron•34m ago•0 comments

Apollo Seeks to Reassure Clients About Rowan's Epstein Ties

https://www.bloomberg.com/news/articles/2026-02-18/apollo-seeks-to-reassure-clients-about-executi...
2•petethomas•40m ago•1 comments

China Is Killing the Fish

https://www.noahpinion.blog/p/china-is-killing-the-fish
2•paulpauper•40m ago•0 comments

Gemini JiTOR Jailbreak: Unredacted Methodology

https://recursion.wtf/posts/jitor_unredacted/
1•tomjakubowski•40m ago•0 comments

Dwarkesh Patel's 2026 Podcast with Elon Musk and Other Recent Elon Musk

https://thezvi.substack.com/p/on-dwarkesh-patels-2026-podcast-with-850
1•paulpauper•41m ago•0 comments

Things you should never do (Part 1)

https://www.joelonsoftware.com/2000/04/06/things-you-should-never-do-part-i/
3•nedwin•43m ago•3 comments

Chief: Delightfully Simple Agentic Loops

https://www.geocod.io/code-and-coordinates/2026-02-18-introducing-chief/
1•mooreds•44m ago•0 comments