frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

There are only four sensible ways to build a website

https://www.jonoalderson.com/conjecture/four-ways-to-build-a-website/
1•eustoria•23s ago•0 comments

Speculation Rules for Evil

https://www.jonoalderson.com/performance/speculation-rules-for-evil/
1•eustoria•39s ago•0 comments

The metallurgy and artisan secrets of making GOES for large power transformers

https://frontiermap.substack.com/p/the-us-imports-82-of-its-large-power
1•rob_lh•43s ago•0 comments

AI Agents for Business Analysis: A Working BA's Honest Take

https://bettersoftware.uk/2026/01/17/ai-agents-for-business-analysis/
1•lifeisstillgood•1m ago•0 comments

View Transitions Toolkit

https://chrome.dev/view-transitions-toolkit/
1•eustoria•1m ago•0 comments

A beginners guide to identifying propaganda

https://covertactionmagazine.com/2026/04/24/living-in-the-age-of-hyperbole-or-a-guide-to-identify...
1•thinkingemote•2m ago•0 comments

Ask HN: Do you read differently now that anything could be AI generated?

1•dwa3592•2m ago•0 comments

Ask HN: Do you waste AI assisted time looking for answers?

1•Haeuserschlucht•3m ago•0 comments

Ask HN: Is anyone working on Gov Digital IDs or have implementation docs / FOSS

1•lifeisstillgood•5m ago•0 comments

Show HN: Space 4 Links

https://space4links.com/
1•skyfantom•6m ago•0 comments

Wanted: A New Finance Writer

https://www.economist.com/finance-and-economics/2026/04/23/wanted-a-new-finance-writer
1•bookofjoe•6m ago•0 comments

SpaceX: Test Like You Fly [video]

https://www.spacex.com/content/starship/test-like-you-fly
1•w8vY7ER•8m ago•1 comments

Buddhist monk builds irreverent classifieds for lonely human mortals

https://chickenlist.com
1•ascottaggart•10m ago•0 comments

Show HN: Mux0 – Open-source macOS terminal with workspace tabs and agent hooks

https://mux0.com/
1•Justin3go•11m ago•0 comments

Andromeda – Making local AI accessible to non-technical users

https://store.steampowered.com/app/4056090/SmarterWaysProductions_Andromeda/
1•klueglscheisser•11m ago•1 comments

Niri 26.04 was just released (scrollable-tiling Wayland compositor)

https://github.com/niri-wm/niri/releases/tag/v26.04
1•nickjj•12m ago•0 comments

The physics slop that YouTube wants me to make [video]

https://www.youtube.com/watch?v=Cd5EHfRerGI
1•thorum•13m ago•0 comments

NATO eyes Saab GlobalEye to replace AWACS planes in historic shift from the U.S.

https://www.armyrecognition.com/news/aerospace-news/2026/nato-selects-swedish-saab-globaleye-to-r...
2•vrganj•13m ago•0 comments

The Beautiful Barbell Effect

https://camerasearch.substack.com/p/the-beautiful-barbell-effect
1•Aeroi•14m ago•0 comments

Show HN: I gave Claude and Cursor a seat on my Kanban board [video]

https://www.youtube.com/watch?v=CD2-NGtshrY
1•spotlayn•14m ago•0 comments

Graphite open source hybrid image editor

https://www.graphite.art/
2•tomcam•16m ago•0 comments

Writing a book is a labor of love

https://usefulfictions.substack.com/p/writing-a-book-is-a-labor-of-love
2•eatitraw•17m ago•0 comments

Why Silicon Valley Is Turning to the Catholic Church

https://www.theatlantic.com/ideas/2026/04/silicon-valley-catholicism-ai-leo/686948/
2•jonah•18m ago•0 comments

Show HN: Odozi – open-source iOS journaling app

https://odozi.app
2•jlarks32•19m ago•0 comments

What's Missing in the 'Agentic' Story

https://www.mnot.net/blog/2026/04/24/agents_as_collective_bargains
7•ingve•20m ago•0 comments

Baldmaxxing Confidence Mobile App – No BS

https://baldandwinning.com/en
2•thisissidhant•23m ago•1 comments

GLP-1 receptor agonist effects on Alzheimer's pathophysiology: Systematic review

https://www.sciencedirect.com/science/article/pii/S1044743126000217
2•bookofjoe•24m ago•0 comments

A validation-gated execution system (VYRDON)

https://github.com/teee79A/vyrdon
2•vyrdon•25m ago•0 comments

Your Job Isn't Programming

https://codeandcake.dev/posts/2025-12-12-your-job-isnt-programming
2•dgroshev•27m ago•0 comments

Trump alum helps Israel mount AI influence campaign

https://www.axios.com/2026/04/25/israel-ai-influence-parscale
2•sosomoxie•28m ago•0 comments