frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

GoAccess Release 1.10

https://goaccess.io/release-notes
1•osxman•1m ago•1 comments

Microgpt

http://karpathy.github.io/2026/02/12/microgpt/
1•WithinReason•2m ago•0 comments

Reflecting on my AI adoption timeline

https://tomquirk.me/reflecting-on-my-ai-adoption-timeline
1•tomquirk•3m ago•0 comments

The big AI job swap

https://www.theguardian.com/technology/2026/feb/11/big-ai-job-swap-white-collar-workers-ditching-...
1•msolujic•9m ago•0 comments

Unreal Tournament 2004 is now available for free

https://bsky.app/profile/thekinsie.com/post/3mep77kgpps2r
1•mariuz•10m ago•0 comments

Ask HN: Why is my Claude experience so bad? What am I doing wrong?

1•moomoo11•10m ago•0 comments

Show HN: I built a simple quant scanner for mean-reversion setups (ZcoreAI)

https://zcoreai.onrender.com/
1•tchantchov•15m ago•1 comments

Invisible Prompt Injection

https://github.com/bountyyfi/invisible-prompt-injection
1•taubek•16m ago•0 comments

A simple way to track howcooked you are, daily

https://howcooked.me/
1•blockholder•16m ago•0 comments

CSS-Doodle

https://css-doodle.com/
2•dsego•17m ago•0 comments

Metrics Monitoring System

https://programmingappliedai.substack.com/p/hld-design-real-time-monitoring-system
1•HintedHandoff•18m ago•0 comments

Frustrated by costly Competitor Intel tools, so I vibe coded one

https://ulavu.lovable.app
1•kaaviansivam•19m ago•1 comments

Small Language Models (SLMs) vs. Large Language Models (LLMs)

1•AkshatRaj00•19m ago•0 comments

CodeSpeak: A next-generation programming language powered by LLMs

https://www.codespeak.dev/
1•smokel•20m ago•0 comments

Bed Frames That Work Harder in Small Bedrooms

https://dreamhomestoreblog.wordpress.com/2026/02/11/bed-frames-that-work-harder-in-small-bedrooms/
1•dreamhomestore•22m ago•1 comments

Everything Takes Longer Than You Think

https://revelry.co/insights/software-estimation-everything-takes-longer/
1•birdculture•22m ago•0 comments

OfCom fines 4chan £520k

https://twitter.com/i/status/2021949320455442662
1•cft•22m ago•0 comments

A Meditation on AI Identity

https://soul.md/
1•ibobev•24m ago•1 comments

JUCE plugins soon be back on Wine

https://forum.juce.com/t/juce8-direct2d-wine-yabridge/64298?page=4
1•vindex10•27m ago•1 comments

I'm building an AWS cost CLI and need your feedback about it

https://awsdoctor.compacompila.com/
1•elC0mpa•28m ago•1 comments

The Godless Students of London University

https://www.historytoday.com/archive/feature/godless-students-london-university
1•samclemens•28m ago•0 comments

PDS OLM to PST Converter

https://apps.microsoft.com/detail/9p62fq9z8x7p?hl=en-US&gl=US
1•tieanderson•29m ago•1 comments

Large tech companies don't need heroes

https://www.seangoedecke.com/heroism/
1•kiyanwang•29m ago•0 comments

Three Bad Managers

https://randsinrepose.com/archives/three-bad-managers/
1•kiyanwang•29m ago•0 comments

Diffs

https://diffs.com/
1•tosh•30m ago•0 comments

MinIO repository is no longer maintained

https://github.com/minio/minio/commit/7aac2a2c5b7c882e68c1ce017d8256be2feea27f
2•psvmcc•33m ago•0 comments

Expensively Quadratic: The LLM Agent Cost Curve

https://blog.exe.dev/expensively-quadratic
2•luu•35m ago•0 comments

Show HN: Own the Void – a trillion-cell infinite canvas

https://ownthevoid.com
2•sneezydwarf•38m ago•0 comments

Show HN: Introspect – messy CSV exports into shareable dashboards (no signup)

https://www.introspectdigital.com/
1•kiroid123•42m ago•1 comments

Show HN: Temp Mail – Fastest temporary email generator for iOS/macOS

https://tempmail.jamcry.app
1•jamcry•43m ago•0 comments