frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Canvas hack shuts down operations at UW-Madison, worldwide

https://www.dailycardinal.com/article/2026/05/canvas-hack-shuts-down-operations-at-uw-madison-wor...
1•isaacdl•25s ago•0 comments

Why Don't Lowercase Letters Come Right After Uppercase Letters in ASCII?

https://tylerhillery.com/blog/why-dont-lowercase-chars-come-after-upper/
1•alpaylan•2m ago•0 comments

How a Congressional Primary Became a Proxy Battle over A.I.

https://www.newyorker.com/news/our-local-correspondents/how-a-congressional-primary-became-a-prox...
2•mitchbob•4m ago•1 comments

Aztec Codex

https://en.wikipedia.org/wiki/Aztec_codex
2•soupspaces•6m ago•0 comments

RNA-triggered cell killing with CRISPR-Cas12a2

https://www.nature.com/articles/s41586-026-10466-y
2•manuelr-t•6m ago•0 comments

Show HN: Disputron – AI small claims court for petty disputes

https://disputron.ai
3•etaheri•8m ago•0 comments

Show HN: Built a Public Journal for Builders

https://joinserendipity.co/
2•ldang•12m ago•1 comments

US launches new strikes on Iran

https://www.ft.com/content/21131ff4-35e1-4e5c-b827-1e07c9153e8e
3•JumpCrisscross•12m ago•0 comments

French prosecutors seek charges against Musk/X over child sexual abuse images

https://apnews.com/article/france-x-grok-deepfakes-child-sexual-abuse-charges-cac04b1869201bb4c9d...
3•afavour•14m ago•0 comments

Cybercrime group crashes Penn's Canvas system

https://www.thedp.com/article/2026/05/penn-canvas-shinythunters-data-breach-hack-second
2•bgschulman31•18m ago•0 comments

Family Deserves a Lasting Legacy

https://kleinlegacywealth.pro/
2•misterthp•18m ago•0 comments

Not fight, flight or freeze, but fawn

https://psyche.co/notes-to-self/not-fight-flight-or-freeze-this-is-what-fawning-looks-like
3•herbertl•18m ago•0 comments

blink-dev: Intent to Ship: Prompt API

https://groups.google.com/a/chromium.org/g/blink-dev/c/iR6R7-nQeHI?pli=1
2•xg15•19m ago•0 comments

We built autoresearch for browser agents

https://www.browserbase.com/blog/autobrowse
2•Kylejeong21•19m ago•0 comments

Show HN: No More Deepfakes – A Ramanujan 1/π and Nvidia B200 Architecture

https://zenodo.org/records/20065581
2•Prakash_1•20m ago•0 comments

Left-Right Handedness Asymmetry in Snail Shells (2004)

https://www.sciencedirect.com/science/article/pii/S0960982204005901
2•bookofjoe•21m ago•0 comments

My Claude dreams at night and remembers everything. Better than mempalace

https://github.com/CodeAbra/iai-mcp
2•CodeAbra•21m ago•0 comments

Fire at Dutch NorthC data center, all personnel evacuated in time

https://www.techzine.eu/news/infrastructure/141131/fire-at-northc-data-center-all-personnel-evacu...
2•notorandit•22m ago•1 comments

What do AI based layoffs say about their ability to scale?

https://www.elliotcsmith.com/what-do-ai-based-layoffs-say-about-tam/
2•smitec•23m ago•0 comments

How many of us are evaling our skills?

https://github.com/BintzGavin/apastra
2•GavinBintz•24m ago•0 comments

Attacking your competitors online is dumb

https://posthog.com/blog/why-attacking-competitors-is-dumb
3•herbertl•28m ago•0 comments

Reality emerges: What is the Universe made of?

https://aeon.co/essays/why-reality-is-more-than-the-sum-of-its-particles
2•herbertl•31m ago•0 comments

Ask HN: Are we observing the death of social networks?

4•fullstacking•32m ago•3 comments

thoughts on Gen AI's frontier of individuality

3•audreyfei•33m ago•0 comments

US trade court rules against Trump's 10% global tariffs

https://www.reuters.com/world/us-trade-court-rules-against-trumps-10-global-tariff-2026-05-07/
6•JumpCrisscross•34m ago•0 comments

Marc Andreessen Egg Game

https://marc-egg.eieio.games/
3•HotGarbage•34m ago•0 comments

Structured Procrastination

https://www.structuredprocrastination.com/
3•biscuits1•35m ago•1 comments

Ask HN: What will happen as AI costs increase?

3•MetaWhirledPeas•38m ago•0 comments

David Attenborough's 100 Years on Planet Earth

https://www.royalalberthall.com/tickets/events/2026/david-attenboroughs-100-years-on-planet-earth
2•smusamashah•38m ago•0 comments

cuda-oxide: a custom rustc backend for compiling GPU kernels in pure Rust

https://github.com/NVlabs/cuda-oxide
2•matt_d•39m ago•0 comments