frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•12mo ago

Comments

tocs3•12mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

PyPI packages are increasing rapidly

https://rushter.com/blog/pypi-packages/
1•lumpa•44s ago•0 comments

Baylibre Partners with SpacemiT to Bring Android 16 to RISC-V

https://baylibre.com/blog/baylibre-partners-with-spacemit-to-bring-android-16-to-risc-v/
1•fork-bomber•1m ago•0 comments

Learning-focused CTFs are Facing a Restructure

https://exploiting.systems/posts/2026-05-17-learning-focused-ctfs-are-facing-a-restructure
1•ropbear•1m ago•0 comments

HearHam Live Repeater listing app

https://hearham.com/repeaters
1•fodmap•3m ago•0 comments

The new class of AI jobs

https://www.businessinsider.com/new-ai-jobs-2026-5
3•giuliomagnifico•4m ago•0 comments

Qubes OS: A reasonably secure operating system

https://www.qubes-os.org/
1•throwoutway•8m ago•0 comments

Utah lawmakers form united front in push to ban prediction markets

https://www.theguardian.com/us-news/2026/may/18/you-can-bet-on-it-utah-lawmakers-form-united-fron...
1•thm•8m ago•0 comments

Busting performance issues, AI edition

https://p403n1x87.github.io/busting-performance-issues-ai-edition.html
1•p403n1x87•9m ago•0 comments

What A.I. Did to My College Class

https://www.nytimes.com/2026/05/17/opinion/chatgpt-ai-college-school-graduation.html
1•thm•11m ago•0 comments

How to Write Something Wise (Maria Popova Interview) [video]

https://www.youtube.com/watch?v=yb9Tz-RQFN4
1•freediver•12m ago•0 comments

I automated opt-outs for 500 data broker sites (open source)

https://github.com/stephenlthorn/auto-identity-remove
2•stephenlthorn•14m ago•0 comments

AI agent harnesses like OpenClaw are changing LLMs, inference, and CPUs

https://www.theregister.com/ai-ml/2026/05/17/how-ai-agent-harnesses-like-openclaw-are-changing-ll...
1•abdelhousni•17m ago•0 comments

The Global Fertility Crisis Is Worse Than You Probably Think

https://www.derekthompson.org/p/why-the-whole-world-stopped-having
8•momentmaker•20m ago•1 comments

How Trump's crypto venture and Iran's top exchange tapped into the same networks

https://www.reuters.com/investigations/how-trumps-crypto-venture-irans-top-exchange-tapped-into-s...
2•notagoodidea•22m ago•1 comments

Show HN: Chrome extension that hides YouTube shorts and other distractions

https://chromewebstore.google.com/detail/distraction-free-youtube/ckkcdcieljicflmkokdekbfpkclmmibp
2•mikax•23m ago•1 comments

Now that code is cheap, personal and open software is next

https://blog.stromflix.com/personal-software-is-next
1•StromFLIX•25m ago•0 comments

How to Create Your Own Bespoke, Artisanal, Hand-Drawn PCBs

https://www.hackster.io/news/how-to-create-your-own-bespoke-artisanal-hand-drawn-pcbs-d96d6978a4fb
2•CTOSian•28m ago•0 comments

Japanese-style free pdf editor

https://katanapdf.com/
1•samuraiduckling•29m ago•1 comments

The Backward Logic of Chickenpox Parties

https://www.wired.com/story/chickenpox-parties-and-the-pre-vaccine-internet/
1•joozio•29m ago•0 comments

Indexing code by behavior not imports – tested on large repos, seeking feedback

1•afxuh•30m ago•0 comments

Ask HN: Which books do you wish you'd read earlier in life?

1•jimsojim•33m ago•0 comments

I made a machine that burns money to prove it doesnt exists [video]

https://www.youtube.com/watch?v=2UM4j1_xEs0
3•tzvc•33m ago•1 comments

Spec-Driven Development with math-glyph compression

https://github.com/kborovik/pilot-skills/
1•kborovik•34m ago•0 comments

Zero a Language for Humans and Robots

https://zero-lang.com/
1•dcu•35m ago•0 comments

Show HN: Alder: Dynamic Code Execution Without Roslyn

1•MartiSilvio•37m ago•0 comments

A Danish Couple's Maverick African Research Finds Its Moment in RFK Jr.'S Vacci

https://www.wired.com/story/a-danish-couples-maverick-african-research-finds-its-moment-in-rfk-jr...
2•joozio•37m ago•0 comments

Tim – A High-Performance Template Engine and Markup Language

https://github.com/openpeeps/tim
1•TheWiggles•38m ago•0 comments

Show HN: I built an easy to manage, sharable personal memory for my AI agents

https://ai.actingweb.io/
1•gregertw•44m ago•1 comments

Show HN: Shiftpaper – native parallax wallpaper engine for Wayland

https://github.com/CPritch/shiftpaper
3•PxldLtd•44m ago•0 comments

An ICE Firearms Trainer Was Involved in at Least 4 Deadly Shootings

https://www.wired.com/story/an-ice-firearms-trainer-was-involved-in-at-least-4-deadly-shootings/
4•joozio•45m ago•0 comments