frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

My Dialogue with Jonathan Zittrain

https://marginalrevolution.com/marginalrevolution/2026/04/my-dialogue-with-jonathan-zittrain.html
2•paulpauper•23s ago•0 comments

Your Single Use iPhone [video]

https://www.youtube.com/watch?v=NG-lLt5X3Rs
2•Klaster_1•7m ago•0 comments

A unified Go SDK for working with large language models

https://github.com/aarock1234/ai
1•abdelsabbah•8m ago•1 comments

Lightweight internet radio management tool

https://github.com/tchovi/AirBoneRadio
1•Indigenism•10m ago•1 comments

Code Review Is the New Bottleneck for Engineering Teams

https://newsletter.eng-leadership.com/p/code-review-is-the-new-bottleneck
1•gpi•12m ago•0 comments

End of the Orban Era

https://www.bloomberg.com/news/articles/2026-04-12/end-of-the-orban-era-the-party-is-just-startin...
1•dlx•13m ago•1 comments

Self-Improving AI Agent

https://github.com/NousResearch/hermes-agent
1•granto•14m ago•0 comments

Jyf

1•ramzidddinq•15m ago•0 comments

Show HN: Cryptr a shell utility for encrypting and decrypting files with OpenSSL

https://github.com/nodesocket/cryptr
1•nodesocket•16m ago•0 comments

A Canonical Generalization of OBDD

https://arxiv.org/abs/2604.05537
1•luu•17m ago•0 comments

The Expensive Anxiety of AI

https://aarils.com/personal/the-expensive-anxiety-of-ai
1•bridgettegraham•17m ago•0 comments

How are you reducing LLM token costs for async workflows?

https://github.com/parallem-ai/parallem
1•alexliu79•29m ago•1 comments

Balsamiq AI

https://balsamiq.com/product/ai/
1•ilt•32m ago•0 comments

StackTrove: Encrypted local registry for all your side projects

https://github.com/sennaBruno/stacktrove
1•essieli•36m ago•0 comments

Justin Sun Accuses Trump's WLFI Cryptocurrency of Hiding Wallet Freeze Function

https://beincrypto.com/justin-sun-slams-world-liberty-financials-blacklist-feature/
2•jrflowers•40m ago•0 comments

'The gravest crime against humanity': What does the UN vote on slavery mean?

https://www.bbc.com/news/articles/c0rxqng5pyno
1•gpi•43m ago•0 comments

IBM AP-101 general-purpose computer [pdf]

https://gandalfddi.z19.web.core.windows.net/Shuttle/IBM%20AP-101S%20General%20Purpose%20Computer%...
1•__patchbit__•46m ago•0 comments

Your Harness, Your Memory

https://blog.langchain.com/your-harness-your-memory/
3•tanelpoder•50m ago•0 comments

Show HN: Tokemon – monitor token use with out refreshing 27 browser tabs

https://github.com/rvantonder/tokemon
1•rvttt•54m ago•0 comments

Overcoming Informational Risk

https://bencornia.com/blog/overcoming-informational-risk
1•bencornia•54m ago•0 comments

Best AI coding interview assistant in 2026

https://www.linkjob.ai/interview-questions/ai-coding-interview-assistant/
1•Silviaaa•58m ago•1 comments

Kindle users in uproar re: latest update, old devices now unusable: 'Fuck You '

https://nypost.com/2026/04/09/tech/kindle-to-cease-support-for-old-devices-causing-user-uproar/
2•1vuio0pswjnm7•58m ago•0 comments

Going Beyond World Models and VLAs

https://generalistai.com/blog/apr-07-2026-beyond-world-models
1•NeoInHacker•1h ago•1 comments

Show HN: Chipmunkify – I used ML to solve audio's dumbest problem

https://www.chipmunkify.com/
3•treelover•1h ago•1 comments

Artemis II: Christina Koch's PCD Failure

4•robgibbons•1h ago•2 comments

Show HN: React Modern Audio Player

https://github.com/slash9494/react-modern-audio-player
1•musgravite•1h ago•0 comments

Haunt, the 70s text adventure game, is now playable on a website

https://haunt.madebywindmill.com
21•jscalo•1h ago•4 comments

A Simple Lofi Player

https://github.com/talwat/lowfi
3•lwhsiao•1h ago•1 comments

Why Do Computers Stop and What Can Be Done About It? (1985) [pdf]

https://pages.cs.wisc.edu/~remzi/Classes/739/Fall2018/Papers/gray85-easy.pdf
1•jruohonen•1h ago•0 comments

Tell HN: Claude-code prompt-cache workaround/fix

1•g4cg54g54•1h ago•0 comments