frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•1y ago

Comments

tocs3•1y ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Anduril, General Atomics get Air Force contracts to build first drone wingmen

https://www.defenseone.com/defense-systems/2026/06/anduril-general-atomics-get-air-force-contract...
1•geox•2m ago•0 comments

Jemalloc cut our production memory by 47%

https://www.refine.ink/blog/jemalloc-fragmentation
1•pedromsantos•2m ago•0 comments

Claude Fable 5 suspension: Anthropic exec says it may return in the coming days

https://www.koreajoongangdaily.com/business/anthropic-confident-of-reenabling-mythos-fable-5-acce...
1•vfc1•3m ago•0 comments

TongFlow, a free open-source multi-modal AI workflow studio

https://github.com/user-attachments/assets/407a7e7b-2d44-4c90-8016-33d0a9f5e7d5
1•tong-io•10m ago•0 comments

Tools for checking traffic of website subpages

1•blacker145•10m ago•0 comments

Seven Perfect Shuffles Randomize a Deck of Cards. But How Many Sloppy Ones?

https://www.quantamagazine.org/seven-perfect-shuffles-randomize-a-deck-of-cards-but-how-many-slop...
1•layer8•11m ago•1 comments

SpaceX will officially acquire Cursor for $60B

https://fortune.com/2026/06/17/spacex-acquire-cursor-60-billion/
2•aledevv•12m ago•1 comments

Colours Can Compute – CoM Apr 2021 [video]

https://www.youtube.com/watch?v=f3dTmO1JfmY
1•ColinWright•12m ago•0 comments

Aerioq Heating Cooling Portable AC

https://www.facebook.com/AERIOQHeatingCoolingPortableAC.Get
1•rimshesyu•18m ago•0 comments

Space Telescopes Drown in Satellite Light Pollution

https://www.universetoday.com/articles/satellites-have-brightened-the-skies-by-about-10-across-th...
2•JeanKage•18m ago•0 comments

Show HN: Openfusion - enhanced results from a panel of models

https://github.com/shahar-dagan/openfusion
2•shadag•19m ago•0 comments

World leaders want American AI. They just don't want America to turn it off

https://www.techsentiments.com/article/2026/06/17/world-leaders-want-american-ai-they-just-dont-w...
1•rajsuper123•21m ago•0 comments

Matter 1.6

https://csa-iot.org/newsroom/matter-1-6-enables-more-intuitive-setup-multi-ecosystem-experiences-...
1•tosh•23m ago•0 comments

Castles in the Air

https://articles.pragdave.me/p/castles-in-the-air
1•ingve•25m ago•0 comments

Ask HN: Is anyone using the A2A protocol yet?

1•asim•27m ago•0 comments

K (1993)

https://web.archive.org/web/20160330020952/http://archive.vector.org.uk/art10010830
1•tosh•28m ago•0 comments

Integration Testing on JVM

https://kpavlov.me/blog/integration-testing-on-jvm/
1•karimtr•30m ago•0 comments

Midjourney Medical goes from AI image generation to full-body ultrasounds

https://www.theverge.com/ai-artificial-intelligence/952011/midjourney-medical-ai-ultrasound-scan
1•JeanKage•31m ago•0 comments

Djevops: Self-Host Django Easily

https://github.com/mherrmann/djevops
1•mherrmann•32m ago•0 comments

Summation and Ordering of Logs Regarding Recent Temporal Incident

https://medium.com/luminasticity/summation-and-ordering-of-logs-regarding-recent-temporal-inciden...
1•bryanrasmussen•33m ago•0 comments

CLI That Enforces Spec-Driven Development with Claude Code, OpenCode, and Codex

https://github.com/davidpv/opsx-spec-driven-development-toolkit
1•davidpv•36m ago•0 comments

Cognitive Surrender

https://addyosmani.com/blog/cognitive-surrender/
1•marksully•36m ago•1 comments

Cannabis commercialisation not decriminalisation drives up usage, study finds

https://www.theguardian.com/society/2026/jun/17/cannabis-commercialisation-not-decriminalisation-...
1•n1b0m•36m ago•0 comments

Smalltalk Blocks

https://donraab.medium.com/smalltalk-blocks-cbe508c2e472
1•ingve•37m ago•0 comments

AI Dungeons: How Caching and Optimized Context Works

https://old.reddit.com/r/AIDungeon/comments/1u6xn1n/how_caching_and_optimized_context_works/
1•doener•40m ago•0 comments

Dear A.I. Companies: The Doom Trolling Needs to Stop

https://www.nytimes.com/2026/06/17/opinion/ai-dangerous-openai-anthropic.html
3•frb•41m ago•0 comments

Automatic Data Processing: A Programming Language (Draft) (1960) [pdf]

https://softwarepreservation.computerhistory.org/apl/book/Iverson-AutomaticDataProcessing-color.pdf
1•tosh•41m ago•0 comments

I built a 2.3MB Markdown-to-PDF app because Chromium felt absurd

https://github.com/Butterski/rayomd
2•buttterski•44m ago•3 comments

Show HN: Meeting Notes Sync – import transcripts and AI summaries into Obsidian

https://community.obsidian.md/plugins/meeting-notes-sync
1•andreagrandi•45m ago•1 comments

A Short Guide to Minimal Web Development (2018)

https://meiert.com/blog/minimal-web-development/
1•downbad_•45m ago•0 comments