frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•10mo ago

Comments

tocs3•10mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Why and How to Avoid Hamburger Menus (2014)

https://lmjabreu.com/post/why-and-how-to-avoid-hamburger-menus/
1•chistev•1m ago•1 comments

You Do Not, in Fact, Have to Hand It to Them

https://2ndbreakfast.audreywatters.com/you-do-not-in-fact-have-to-hand-it-to-them/
1•MindGods•3m ago•0 comments

The FlightAware global FlightFeeder network

https://blog.flightaware.com/inside-the-flightaware-global-flightfeeder-network
1•eu•4m ago•0 comments

Show HN: Python Dispatching Without Limitations

https://ulrikchristensengit.github.io/argbox/dispatching/
1•Upitor•5m ago•0 comments

Why use living cells? Researchers are making chemicals with enzymes alone

https://phys.org/news/2026-03-cells-chemicals-enzymes.html
1•Brajeshwar•6m ago•0 comments

Filter news with a single prompt

https://distill.cstein.xyz
1•cstein2•7m ago•1 comments

EU agrees to fine online platforms importing unsafe products

https://www.reuters.com/sustainability/eu-reaches-deal-fine-online-platforms-importing-products-d...
1•geox•8m ago•0 comments

Tar: A slop-free alternative to rsync

https://drewdevault.com/2026/03/28/2026-03-28-rsync-without-rsync.html
3•shscs911•9m ago•0 comments

Show HN: I built a tool that checks if your council tax band is wrong

https://counciltaxchallenger.co.uk
1•giwa_abdul•9m ago•0 comments

Show HN: Octopus, Open-source alternative to CodeRabbit and Greptile

https://octopus-review.ai
1•redoh•12m ago•0 comments

How to implement the Outbox pattern in Go and Postgres

https://www.youtube.com/watch?v=hJ4S-5MirvU
1•der_gopher•12m ago•0 comments

I am selling my AI image app

https://trustmrr.com/startup/picx-studio
2•Yash16•12m ago•0 comments

We got a heat pump (at last)

https://www.positech.co.uk/cliffsblog/2026/03/28/we-got-a-heat-pump-at-last/
1•alin23•12m ago•0 comments

The Decadelong Feud Shaping the Future of AI

https://www.wsj.com/tech/ai/the-decadelong-feud-shaping-the-future-of-ai-7075acde
2•pondsider•13m ago•1 comments

Ask HN: HN now hides comments from new users

1•nomoreaccts•14m ago•2 comments

Your Data Never Dies

https://www.mjeggleton.com/blog/your-data-never-dies
1•michaelje•17m ago•0 comments

Data centers aren't breaking the grid. A broken grid is

https://fortune.com/2026/03/28/data-centers-grid-problem-infrastructure-ai/
2•Brajeshwar•17m ago•0 comments

Erythritol linked to brain damage and stroke risk

https://www.sciencedaily.com/releases/2026/03/260328065333.htm
1•beboplifa•18m ago•0 comments

When AI Takes My Job (Whenaitakesmyjob.work)

https://whenaitakesmyjob.work
1•hafiz_•18m ago•0 comments

EPA Plans to Start Diluting Gasoline This May

https://www.thedrive.com/news/the-feds-plan-to-start-diluting-gasoline-this-may-explained
1•timbowhite•23m ago•0 comments

Show HN: ACP – An open protocol for agents to operate live UIs natively

https://acp-protocol.org/
1•cezarvil•26m ago•1 comments

Why does cannabis give people 'the munchies'?

https://www.livescience.com/health/why-does-cannabis-give-people-the-munchies
1•Brajeshwar•27m ago•0 comments

Show HN: A shared emergency number for families

https://statphone.com/
2•popupeyecare•28m ago•1 comments

I Built the C/C++ Visualizer I Always Wanted [video]

https://www.youtube.com/watch?v=wQkXyK--xHI
1•richardboegli•28m ago•0 comments

Social media is populist and polarising; AI may be the opposite

https://www.ft.com/content/3880176e-d3ac-4311-9052-fdfeaed56a0e
2•artninja1988•29m ago•1 comments

How I accidentally made the fastest C# CSV parser

https://bepis.io/blog/turbo-csv-parser/
1•PretzelFisch•29m ago•0 comments

SourceDive

https://marketplace.visualstudio.com/items?itemName=tariq10x.sourcedive
1•richardboegli•30m ago•0 comments

Feeding a Family of Four on $200 a Month

https://www.amazon.com/dp/B0GSP1NNKL
1•OldDennis•34m ago•0 comments

AI Can Fuel Overconfidence in Bad Relationship Decisions

https://www.psychologytoday.com/us/blog/urban-survival/202512/ai-can-fuel-overconfidence-in-bad-r...
1•oldfrenchfries•34m ago•0 comments

Show HN: Why I built another GitHub star tracker

https://www.startrail.dev/
1•mvansegbroeck•34m ago•0 comments