frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Benchmarking 5 concurrent map implementations in Go (incl. sync.Map)

https://github.com/puzpuzpuz/go-concurrent-map-bench
1•puzpuzpuz-hn•34s ago•1 comments

Show HN: TinyPAN – Zero-allocation Bluetooth tethering for microcontrollers

https://github.com/Akhil-Chaturvedi/TinyPAN
1•akhilchaturvedi•52s ago•0 comments

Show HN: CLI Image Generation Agent Using (OpenRouter and Free Models)

1•Kiran7893•1m ago•0 comments

Show HN: Eliezer – Tiny (~7K LOC) Self-Hosted AI Agent (PWA, Self-Editing)

https://www.eliezer.app/
1•dvictor•1m ago•0 comments

Why Do LLMs Hallucinate?

https://sicheng.dev/writing/why-does-LLMs-hallucinate
1•sichengo•5m ago•0 comments

Catherine of Braganza, the Queen Who Brought Tea to England

https://www.thecollector.com/catherine-braganza-queen-tea-england/
1•Tomte•6m ago•0 comments

Lichess Puzzle Timer: A browser extension to help you do chess puzzles slower

https://catswhisker.xyz/log/2026/2/21/lichess_puzzle_timer/
1•cristoperb•8m ago•1 comments

USB-Only Fans on Linux: Control Path Lian Li SL Wireless, Corsair Commander Core

https://twitter.com/its_aksh_/status/2025249992815018313
1•alexzeitler•8m ago•0 comments

Months in a Day with Claude Code: Immich on Cloudflare Workers

https://gpeake.com/blog/immich
1•gepeake•9m ago•0 comments

IPv6 Address Assignment

https://lpar.ATH0.com/posts/2026/02/ipv6-address-assignment/
2•todsacerdoti•14m ago•0 comments

Take Off

https://benn.substack.com/p/take-off
1•MindGods•18m ago•0 comments

Show HN: Late – A subagent orchestrator TUI for local LLMs (Go/Linux)

https://github.com/mlhher/late
1•mhher•21m ago•1 comments

My Life as a GitLab instance: How I use GitLab to manage almost everything

https://www.iduoad.com/posts/life-as-gitlab/
1•iduoad•24m ago•1 comments

The Reason Robotics DevOps Is Failing to Scale

1•ajime•24m ago•0 comments

Grandson of Reese's PB Cup inventor accuses Hershey of replacing ingredients

https://www.cbsnews.com/news/hershey-reeses-peanut-butter-cup-ingredients-grandson-brad-reese/
1•randycupertino•24m ago•0 comments

The Easiest Price Drop Alert Engine -No Signup. No Browser Extensions. No Apps

https://www.pricedropnotifications.com/
2•HNCATCH•24m ago•0 comments

JSON library might be your most expensive dependency

https://kmaliszewski9.github.io/scala/2026/02/20/jsoniter.html
1•kmaliszewski•24m ago•0 comments

The Flawed Paper Behind Trump's $100k H-1B Fee

https://eig.org/the-flawed-paper-behind-trumps-100000-h-1b-fee/
3•johntfella•25m ago•0 comments

EFF's Policy on LLM-Assisted Contributions to Our Open-Source Projects

https://www.eff.org/deeplinks/2026/02/effs-policy-llm-assisted-contributions-our-open-source-proj...
2•leephillips•25m ago•0 comments

New Android App: Weel – GPS and Dashcam

https://play.google.com/store/apps/details?id=live.weel&hl=en_US
1•OczyCzarne•27m ago•0 comments

Do You Back into a Parking Spot or Back Out?

https://www.nytimes.com/2026/02/21/style/parking-backing-in-headfirst.html
2•bookofjoe•28m ago•2 comments

The Nekonomicon – Nekochan.net Archive, Updated

http://nekonomicon.irixnet.org/
3•ThatGuyRaion•35m ago•1 comments

Extinct Code Grew Leopard Spots: AI-assisted evolution of a 90s screensaver

https://psychodeli.com/inside_the_math/
2•andyed•36m ago•2 comments

Trump raises tariffs to 15% day after Supreme Court ruling

https://www.bbc.co.uk/news/articles/cn8z48xwqn3o
14•rwmj•37m ago•4 comments

Build an LLM from Scratch in Max

https://llm.modular.com/
1•nojito•38m ago•0 comments

Slide rule simulator teaches you how to calculate the old-fashioned way

https://hackaday.com/2026/02/18/sliderule-simulator-teaches-you-how-to-do-calculations-the-old-fa...
1•iamwil•44m ago•0 comments

Show HN: AI Dev Hub. 100 free dev tools (all client-side, no signup, no ads)

https://aidevhub.io/
1•orbydx•45m ago•0 comments

Speaking of OpenClaw – OpenClaw news feed with RSS

https://deadstack.net/tag/openclaw
1•dreadsword•45m ago•0 comments

The "Enshittification" of Consumer Products

https://littlegreensteps.substack.com/p/the-enshittification-of-consumer
6•n2parko•46m ago•1 comments

How far back in time can you understand English?

https://www.deadlanguagesociety.com/p/how-far-back-in-time-understand-english
4•jger15•47m ago•0 comments