frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Adwetysu6tryth

https://glot.io/snippets/hfmww91ujn
1•faresfa•42s ago•0 comments

A Language for Agents

https://lucumr.pocoo.org/2026/2/9/a-language-for-agents/
1•doppp•1m ago•0 comments

Open Source: How Middle Powers Can Build Influence in the Age of AI

https://institute.global/insights/tech-and-digitalisation/open-source-influence-age-of-ai
2•romes•2m ago•0 comments

Show HN: Voice-to-voice translation for meetings (macOS, alpha)

https://voiceleap.ai/
1•kamban•2m ago•1 comments

RLM Explained

https://twitter.com/zby/status/2020802687659348196
1•zby•3m ago•0 comments

The pitch deck is dead. Write a pitch.md instead

https://www.joanwestenberg.com/the-pitch-deck-is-dead-write-a-pitch-md-instead/
1•flobosg•5m ago•0 comments

Show HN: GW – manage Git worktrees when you're babysitting multiple AI agents

https://github.com/nikhilshinday/gw
1•chaos_emergent•6m ago•0 comments

A handy method for hazards detection in an IS of a pipelined processor [pdf]

https://arxiv.org/abs/1203.0787
1•liungrin•10m ago•0 comments

Show HN: Algorithmically Finding the Longest Line of Sight on Earth

https://alltheviews.world
2•tombh•10m ago•1 comments

Ask HN: Why do you use AI for coding?

1•MrSandingMan•11m ago•0 comments

Show HN: Blink – Build custom AI agents in TypeScript for your team

https://github.com/coder/blink
1•hugodutka•12m ago•0 comments

The original vi is a product of its time (and its time has passed)

https://utcc.utoronto.ca/~cks/space/blog/unix/ViIsAProductOfItsTime
2•adunk•13m ago•1 comments

Xbox cancel French localizations as voice actors refuse AI training clauses

https://www.jeuxonline.info/actualite/65797/doublage-francais-absent-plusieurs-jeux-microsoft-ea-...
3•WhereIsTheTruth•13m ago•0 comments

Half of CO2 emissions come from just 32 fossil fuel firms, study shows

https://www.theguardian.com/environment/2026/jan/21/carbon-dioxide-co2-emissions-fossil-fuel-firm...
3•JeanKage•14m ago•0 comments

Show HN: Turn DeFi whitepapers into executable flows for quick validation

https://eigenarc.com
1•sridhar87•16m ago•0 comments

Screenshots from developers and Unix people (2002) (2015)

https://anders.unix.se/2015/10/28/screenshots-from-developers--unix-people-2002/
1•SerCe•16m ago•0 comments

GitHub Status – Degraded Performance in Webhooks API and UI, Pull Requests

https://www.githubstatus.com/incidents/ffz2k716tlhx
1•jackwilsdon•16m ago•0 comments

Seedance 2.0

https://seedance2.studio
1•sarkory•18m ago•1 comments

I went through the OpenClaw Source code. And here are my observations

https://pai.dev/i-went-through-every-line-of-code-of-openclaw-so-you-dont-have-to-bec04bfe3be0
1•dheerajmp•19m ago•0 comments

Newer Faster Amiga Internet Access from Your BlueSCSI

https://www.youtube.com/watch?v=awwRFWpfL-4
2•doener•20m ago•0 comments

Stack Overflow for AI Coding Agents

https://shareful.ai
13•schappim•21m ago•0 comments

Show HN: Fifu – Ultra-Fast Terminal YouTube Downloader

https://fifu-docs.vercel.app
2•dawitworku•22m ago•0 comments

GitHub Is Down in EU

https://statusgator.com/services/github
1•bilekas•24m ago•1 comments

Is India about to make Ozempic-like weight-loss drugs a whole lot cheaper?

https://www.cnn.com/2026/02/07/india/india-semaglutide-patent-expiry-intl-hnk-dst
2•sandGorgon•25m ago•0 comments

The Steamdeck is my guitar rig (video)

https://www.youtube.com/watch?v=yL1DM0QWGSE
1•viraptor•26m ago•0 comments

Mutmut: A Python mutation testing system (2016)

https://kodare.net/2016/12/01/mutmut-a-python-mutation-testing-system.html
2•shirian•28m ago•1 comments

From churches to chatbots: How AI is fusing with religion

https://www.reuters.com/technology/ai-and-us/pulpits-chatbots-how-ai-is-fusing-with-religion-2026...
1•u1hcw9nx•32m ago•0 comments

We Give AI Agents Long-Term Memory Without Blowing the Budget

https://metaduck.com/how-we-give-ai-agents-long-term-memory-without-blowing-the-budget/
2•pgte•33m ago•0 comments

Science parks can transform Australian universities into innovation hubs

https://360info.org/how-science-parks-can-transform-australian-universities-into-innovation-hubs/
1•JeanKage•33m ago•0 comments

Show HN: AST parsing + LLM to generate live architecture docs and diagrams

https://demo.maimap.dev/
1•ev_dev3•33m ago•0 comments