frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

How do you map runtime text back to source code without source maps?

1•ryan_rudd•1m ago•0 comments

Best Converting Page Has the Worst SEO Metrics

https://growtika.com/blog/best-converting-page-worst-seo
1•Growtika•1m ago•0 comments

Ask HN: 20 months post-Seed, no Series A, safe to assume <3 months runway?

1•jennalk•4m ago•0 comments

To Save a Keyboard

https://newsletter.shifthappens.site/archive/to-save-a-keyboard-pt-3/
1•admp•14m ago•0 comments

People Not People

https://robinrendle.com/notes/people-not-people/
2•tobr•16m ago•0 comments

Sylvester and Clifford on Curved Space

https://johncarlosbaez.wordpress.com/2026/01/10/sylvester-and-clifford-on-curved-space/
1•chmaynard•21m ago•0 comments

Extracting books from production language models

https://arxiv.org/abs/2601.02671
1•cleandreams•23m ago•2 comments

IRC Networks

https://netsplit.de/networks/
1•sans_souse•26m ago•0 comments

Gen Z, millennials more likely to cut down on screen time than older generations

https://nypost.com/2026/01/08/lifestyle/gen-z-millennials-are-more-likely-to-digitally-unplug-tha...
2•1vuio0pswjnm7•27m ago•0 comments

Ask HN: I built a local-first encrypted secrets manager – feedback?

1•shahnoor•28m ago•0 comments

PostgreSQL Recovery Internals

https://www.cybertec-postgresql.com/en/postgresql-recovery-internals/
1•0x54MUR41•29m ago•0 comments

Postgres Scan Types in Explain Plans

https://www.crunchydata.com/blog/postgres-scan-types-in-explain-plans
1•0x54MUR41•29m ago•0 comments

Agent-native Architectures – A Technical Guide

https://every.to/guides/agent-native
2•rocho•37m ago•0 comments

A red pixel in the snow: How AI solved the mystery of a missing mountaineer

https://www.bbc.com/future/article/20260108-how-ai-solved-the-mystery-of-a-missing-mountaineer
3•1659447091•38m ago•0 comments

Slowest Labor Market in Years Leaves Job Seekers Stuck

https://www.wsj.com/economy/jobs/job-market-cooling-labor-department-6d4204ed
2•JumpCrisscross•41m ago•0 comments

Sprites: Stateful Sandbox Environments (from fly.io)

https://sprites.dev/
2•jimmcslim•41m ago•1 comments

What's happening in the ocean's "dark zones" [video]

https://www.youtube.com/watch?v=2tuS1LLOcsI
2•dataflow•41m ago•0 comments

Show HN: Visual Email Builder – Mygs

https://mygs.int.yt/mail/
2•MopAmine•43m ago•0 comments

The Darkest Timeline of American Imperialism [video]

https://www.youtube.com/watch?v=QwnJx2g0Okw
3•WinDoctor•44m ago•1 comments

Show HN: SpeedyEDA – One-line exploratory data analysis

2•dawitworku•44m ago•0 comments

YC application page erroring out

1•h_samani•46m ago•0 comments

From GraphQL to Pydantic-Resolve: How I Improved Architecture of API Integration

https://github.com/allmonday/rapid-development-pattern/blob/master/why.en.md
1•tank-34•48m ago•0 comments

Nginx Visualizer

https://codercat.xyz/nginx-visualizer/
1•snayss•51m ago•1 comments

Digging into the LLM-as-a-Judge Results

https://www.gilesthomas.com/2026/01/llm-from-scratch-30-digging-into-llm-as-a-judge
1•ibobev•51m ago•0 comments

Type inference of all constructs in Elixir

https://elixir-lang.org/blog/2026/01/09/type-inference-of-all-and-next-15/
1•aeonfox•52m ago•0 comments

Understanding the Types of Data in Data

https://ischool.syracuse.edu/types-of-data/
2•mahirsaid•53m ago•0 comments

US oil giant ExxonMobil says Venezuela is 'uninvestable'

https://www.ft.com/content/4c21c031-443e-4834-a7a6-3dd59672b54e
5•petethomas•59m ago•1 comments

Landlords are using automated services to monitor tenant promotions

https://old.reddit.com/r/shitrentals/comments/1q38sh4/if_you_get_promoted_at_work_keep_it_a_secre...
4•xyzal•1h ago•0 comments

HackLikeMe – AI DevSecOps CLI with 6 specialized agents that think before acting

1•abrarnasirj•1h ago•0 comments

Feedly Is Down

https://x.com/i/trending/2009815486377214014
2•ksec•1h ago•1 comments