frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Dnstt (DNS Tunneling) Bypassing Iran's Current DPI via DoH/DoT

https://www.bamsoftware.com/software/dnstt/
1•us321•1m ago•1 comments

When it comes to vaccine schedules, the U.S. is now the outlier

https://www.statnews.com/2026/01/09/childhood-vaccination-fact-check-denmark-not-america-is-the-o...
1•bikenaga•2m ago•0 comments

Show HN: CC TV remote plugin, stops your binge watching when Claude goes idle

https://github.com/HermannBjorgvin/claude-plugins/tree/main/tv-pauser
1•hermannbjorgvin•2m ago•1 comments

Show HN: Galaxy visualization using Redshift data (Raylib, C)

https://github.com/Avicted/galaxy_visualization_raylib
1•avicted•3m ago•0 comments

Pencil case feminism

https://glosswitch.substack.com/p/pencil-case-feminism
1•binning•5m ago•0 comments

TSMC Has No Choice but to Trust the Sunny AI Forecasts of Its Customers

https://www.nextplatform.com/2026/01/16/tsmc-has-no-choice-but-to-trust-the-sunny-ai-forecasts-of...
1•speckx•5m ago•0 comments

SkyVM (By Dioxus Labs): Instant-Boot Desktop VMs for AI Agents

https://skyvm.dev/blog/introducing-skyvm
1•satvikpendem•5m ago•0 comments

Reading across books with Claude Code

https://pieterma.es/syntopic-reading-claude/
1•gmays•6m ago•0 comments

IBM warns AI spend fails without AI literacy

https://www.thedeepview.com/articles/ibm-warns-ai-spend-fails-without-ai-literacy
2•CrankyBear•7m ago•0 comments

HN: Afk – Rust CLI for the Ralph Wiggum Approach to AI Coding

https://github.com/m0nkmaster/afk
1•m0nkmaster•8m ago•0 comments

Claudette Colvin, US civil rights pioneer, dies at 86

https://www.cnn.com/2026/01/13/us/claudette-colvin-death
1•binning•8m ago•0 comments

The Toxic Modernity Narrative

https://www.theargumentmag.com/p/the-toxic-modernity-narrative
2•honoredb•8m ago•0 comments

Langfuse Joins ClickHouse

https://langfuse.com/blog/joining-clickhouse
2•cnkk•9m ago•0 comments

Temporal API Ships in Chrome 144, Marking a Major Shift for JavaScript Date

https://socket.dev/blog/temporal-api-ships-in-chrome-144-major-shift-for-javascript-date-handling
1•feross•9m ago•1 comments

UPenn faculty condemn Trump administration's demand for 'lists of Jews'

https://www.theguardian.com/us-news/2026/jan/13/upenn-trump-jews-list
3•binning•11m ago•0 comments

Framework for a Hypercapable World

https://aiprospects.substack.com/p/options-for-a-hypercapable-world
1•paulpauper•11m ago•0 comments

Daniel Walker Howe, 88, Revisionist Historian of Jackson's America, Dies

https://www.nytimes.com/2026/01/11/obituaries/daniel-walker-howe-dead.html
1•paulpauper•12m ago•0 comments

When tools pretend to be people

https://uxdesign.cc/when-tools-pretend-to-be-people-4283748d33e1
1•kaizenb•13m ago•0 comments

Batch Delete in SwiftData

https://mjtsai.com/blog/2025/12/18/batch-delete-in-swiftdata/
1•mpweiher•15m ago•0 comments

The Rise and Fall of Corba (2008)

https://dl.acm.org/doi/pdf/10.1145/1378704.1378718?download=true
1•twoodfin•15m ago•0 comments

OpenAI to start testing ads in ChatGPT free and Go tiers

https://xcancel.com/OpenAI/status/2012223373489614951
2•qingcharles•18m ago•2 comments

LogiCode: LeetCode for hardware design. Synthesize, optimize, and compete

https://logi-code.com
1•nateb2022•19m ago•0 comments

Chinese Fishing Boats Form Sea Barriers

https://www.nytimes.com/interactive/2026/01/16/world/asia/china-ships-fishing-militia-blockade.html
5•perihelions•22m ago•0 comments

Show HN: Flag AI Slop in PRs

https://haystackeditor.com/slop-detector
3•yatvij•23m ago•0 comments

Creating a 48GB Nvidia RTX 4090 GPU – Brother Zhang's Repair Shop (Ft. 张哥) [video]

https://www.youtube.com/watch?v=TcRGBeOENLg
2•adityaathalye•26m ago•0 comments

Ads Are Coming to ChatGPT. Here’s How They’ll Work

https://www.wired.com/story/openai-testing-ads-us/
4•thm•27m ago•2 comments

The State of LLM Serving in 2026: Ollama, SGLang, TensorRT, Triton, and vLLM

https://thecanteenapp.com/http:/localhost:4000/analysis/2026/01/03/inference-serving-landscape.html
1•jxmorris12•28m ago•0 comments

They Wanted a University Without Cancel Culture. Then Dissenters Were Ousted

https://www.politico.com/news/magazine/2026/01/16/civil-war-university-of-austin-bari-weiss-00729...
4•Anon84•29m ago•1 comments

Tell HN: HP Ultra G1a Bios Freezing Issue

1•BizarroLand•29m ago•0 comments

JustMD – Free and Clean Markdown Editor

https://www.justmd.app/?error=Cannot%2BGET%2B%252Fjustmd.app&errorType=warning
1•luisfkandriolo•30m ago•1 comments