frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Meditations on Moloch (2014)

https://slatestarcodex.com/2014/07/30/meditations-on-moloch/
1•simonebrunozzi•1m ago•0 comments

iOS [$59.99/Yr 1 Year Free] SwiftBillInvoice and Small Business CRM 1 Year Free

https://apps.apple.com/redeem/?ctx=offercodes&id=6760855924&code=SWIFTBILL1STYEAR
1•sheboftek•2m ago•0 comments

Running official Arch Linux on Arm (not to be confused ArchLinuxARM)

https://charon.konekopi.com/posts/archlinux_on_arm/
1•Charon77•3m ago•0 comments

Dermatology is wrong about the sun

https://twitter.com/MattZirwas/status/2050586857868591306
1•bilsbie•3m ago•0 comments

'Feed a cold': eating primes immune cells for action

https://www.nature.com/articles/d41586-026-01362-6
1•XzetaU8•4m ago•1 comments

New post: The Markdown Link no. 27

https://md-handbook.com/markdown-link-no-27
1•wordius•6m ago•1 comments

A minimal TUI for reading Git diffs

https://github.com/amiralies/gitty
1•amiralies•7m ago•0 comments

Will EU bring in a windfall tax on oil companies?

https://www.dw.com/en/will-eu-bring-in-a-windfall-tax-on-oil-companies/a-76920556
1•rustoo•8m ago•0 comments

An attempt at explaining bipolar disorder and psychosis

https://osf.io/w28g9_v1
1•anon1253•8m ago•0 comments

Show HN: Bhatti – Self-hostable Firecracker orchestrator with auto pause/wake

https://bhatti.sh/
2•sahil-shubham•10m ago•0 comments

Amazon says restoring damaged Middle East cloud operations to take months

https://www.reuters.com/world/middle-east/amazon-says-damaged-uae-cloud-region-recovery-take-seve...
4•abdelhousni•11m ago•1 comments

Learning Kubernetes Security second edition book

https://www.packtpub.com/en-us/product/learning-kubernetes-security-9781835886397
1•bernardoortega•16m ago•0 comments

Jaron Lanier – You Are Not a Gadget (2011) [video]

https://www.youtube.com/watch?v=IwbGumZ-FYg
1•simonebrunozzi•17m ago•0 comments

Show HN: Valkyr LM Inference with Realtime Guarantees

https://github.com/Foundation42/valkyr
1•quatonion•20m ago•0 comments

Show HN: Sourcery – Open Deep-Research, Grounded in Evidence

https://sourcery-deep-research.pagey.site/
1•freakynit•20m ago•0 comments

In Defense of AI Slop

https://reidhoffman.substack.com/p/in-defense-of-ai-slop
2•aworks•22m ago•1 comments

The extended predicative Mahlo universe in Martin-Löf type theory

https://academic.oup.com/logcom/article/34/6/1032/7158523
1•danny00•24m ago•0 comments

A Neuro-Symbolic engine that autonomously verified the GCT barrier in Lean 4

https://github.com/MyceliaCognition/chaos-prover
1•michaelpreid1•24m ago•0 comments

The Last Days of Butter Ridge

https://www.nytimes.com/2026/05/03/us/dairy-farm-butter-ridge-pennsylvania.html
1•JumpCrisscross•25m ago•0 comments

Meta abandons open-source Llama for proprietary Muse Spark

https://thenewstack.io/meta-abandons-llama-spark/
4•Nars088•27m ago•0 comments

Where's the New AI Infrastructure?

https://blog.viewfromtheweb.com/where-s-the-new-ai-infrastructure-6f3e2320/
1•rickdg•28m ago•0 comments

Show HN: Diom – Backend primitives (queue, rate limit, etc.) in one Rust binary

https://github.com/svix/diom
1•tasn•28m ago•0 comments

Show HN: Parrot – a fun, skeuomorphic audio recorder to hear yourself

https://www.zkhrv.com/parrot
2•zkhrv•32m ago•0 comments

Show HN: Glucera Local-first iPhone glucose app, seeking fingerstick beta tester

https://glucera.app/
2•kv0•33m ago•0 comments

Newborns come into the world with LOW vitamin K on purpose

https://twitter.com/ValerieAnne1970/status/2050757238860452132
2•bilsbie•33m ago•0 comments

Show HN: ShadowBrokers – AI trade signals for retail traders

https://www.shadowbrokers.app/start
3•devlsx•34m ago•1 comments

The problem with 'S-curves'

https://energynetworks.substack.com/p/the-problem-with-s-curves
2•scrlk•35m ago•0 comments

The `boring` SSH tunnel manager

https://alebeck.github.io/boring/
1•0x12A•36m ago•0 comments

Show HN: I built my site as a Windows 95 experience (2025)

https://wes.dev/
3•WesSouza•36m ago•0 comments

Show HN: I built my product to be right to repair friendly

https://stefan.schueller.net/posts/how-to-not-end-up-in-a-louis-rossmann-video/
2•sschueller•39m ago•0 comments