frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Psxsplash – Build PlayStation 1 Games in Unity

https://psxsplash.github.io/
1•spicyjpeg•1m ago•0 comments

Show HN: Dewobble – Filter out phantom clicks and mouse jitter

https://github.com/skorotkiewicz/dewobble
2•modinfo•2m ago•0 comments

AI's New Training Data: Your Old Work Slacks and Emails

https://www.forbes.com/sites/annatong/2026/04/16/ais-new-training-data-your-old-work-slacks-and-e...
1•gnabgib•2m ago•0 comments

Independent bookstores make quiet comeback as big chains dominate retail

https://www.theguardian.com/business/2026/apr/19/independent-bookstores-comeback
1•mykowebhn•13m ago•0 comments

Why every security awareness training has false positives

https://kindssecurity.com/blog/why-every-phishing-simulator-has-false-positives-except-one
2•paulwalkerSEC•15m ago•0 comments

Duolingo's CEO Backtracks Due to Backlash Over Performance Reviews

https://www.entrepreneur.com/business-news/duolingos-ceo-changing-how-he-measures-employee-perfor...
1•gpi•16m ago•0 comments

Show HN: Qapir – Generate API tests automatically from docs

https://app.qapir.io/
1•gradnichkovski•18m ago•1 comments

We beat Google's zero-knowledge proof of quantum cryptanalysis

https://blog.trailofbits.com/2026/04/17/we-beat-googles-zero-knowledge-proof-of-quantum-cryptanal...
1•rayhaanj•20m ago•0 comments

What's Your Favourite Amiga Games?

https://old.reddit.com/r/amiga/comments/1spxiu8/whats_your_favourite_amiga_games/
1•doener•20m ago•0 comments

How are you handling security for AI agents that use MCP tools?

1•bdhobson•22m ago•1 comments

The demand for local AI could shape a new business model for Apple

https://9to5mac.com/2026/04/19/apple-local-ai-server-hosting-new-business-model/
1•omer_k•22m ago•0 comments

MLX vs. CoreML on Apple Silicon: A Practical Guide to Picking the Right Back End

https://blog.ivan.digital/mlx-vs-coreml-on-apple-silicon-a-practical-guide-to-picking-the-right-b...
1•ipotapov•23m ago•0 comments

Tearing down a car telematic unit (and finding an accident on Facebook)

https://blog.quarkslab.com/tearing-down-a-car-telematic-unit-and-finding-an-accident-on-facebook....
2•breve•23m ago•0 comments

A Free and Open Source Orbital Flight Computer

https://gitlab.com/supernovalabs/pan-solar-nav
1•supernovalabs•23m ago•1 comments

Showcase: Kylrix; stop switching apps, your productivity suite all in one place

2•nathfavour•24m ago•0 comments

The fastest way to match characters on ARM processors?

https://lemire.me/blog/2026/04/19/the-fastest-way-to-match-characters-on-arm-processors/
1•mfiguiere•29m ago•0 comments

Show HN: Newsmaps.io a map of how news topics are covered by different countries

https://www.newsmaps.io/
2•mkoh•30m ago•0 comments

How Music Works [video]

https://www.youtube.com/watch?v=zbfKFa-reBE
1•gmays•31m ago•0 comments

Context.ai seemingly cause of Vercel breach

https://twitter.com/jaimeblascob/status/2045960143209152981
2•bearsyankees•31m ago•0 comments

EFF pushes back on Google data scandal response: 'Google screwed up'

https://www.androidauthority.com/eff-pushes-back-on-google-exception-claim-3658264/
2•donohoe•31m ago•0 comments

OpenMythos – open-source Mythos alternative

https://github.com/kyegomez/OpenMythos
1•orixilus•31m ago•0 comments

C++26: Reflection, Memory Safety, Contracts, and a New Async Model

https://www.infoq.com/news/2026/04/cpp-26-reflection-safety-async/
3•birdculture•32m ago•0 comments

Ask HN: Tactical/military reasons for difficulty unblocking the Hormuz Straits?

2•dsalzman•33m ago•1 comments

You're About to See a Lot of Critical Software Updates. Don't Ignore Them

https://www.wsj.com/tech/personal-tech/anthropic-mythos-security-software-updates-573cc9b3
2•fortran77•33m ago•1 comments

'Meow, meow': Pilots scolded after animal noises heard on air traffic radio

https://abcnews.com/US/meow-meow-pilots-scolded-after-animal-noises-heard/story?id=132076661
3•osnium123•36m ago•2 comments

BeagleConnect Zepto – A "$1 Computer" Based on TI MSPM0L1117 Cortex-M0 MCU

https://www.cnx-software.com/2026/04/19/beagleconnect-zepto-a-1-computer-based-on-ti-mspm0l1117-c...
1•HardwareLust•37m ago•0 comments

Satoshi Nakamoto: 'The best outcome is that no one ever finds out'

https://english.elpais.com/economy-and-business/2026-04-19/satoshi-nakamoto-the-reclusive-billion...
2•geox•40m ago•0 comments

Are developers burning out faster than ever? (survey)"

https://docs.google.com/forms/d/e/1FAIpQLSdu-1Sa6oPvhDtFtBuKEgeQ-xIUMTjGdtfRwVLJGibhJUAmOg/viewform
3•rechargedaily•41m ago•1 comments

The Invisible Migration from Nation-States to Stateless Nations

https://medium.com/discourse/the-invisible-migration-from-nation-states-to-stateless-nations-687b...
2•simonebrunozzi•44m ago•0 comments

Sardine Scam or Delicious Novelty?

https://ecency.com/hive-150329/@leaky20/sardine-scam-or-delicious-novelty
2•randycupertino•44m ago•1 comments