frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•7mo ago

Comments

tocs3•7mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Show HN: I built a tool that queries 6 LLMs and synthesizes their answers

https://converge.rest/
1•cruise4914•1m ago•0 comments

State of Vibe 2025 – Vibe Creation Ecosystem Report of China

https://stateofvibe.ai/
1•kalasoo•9m ago•1 comments

Probably Post More

https://www.aadillpickle.com/blog/post-more
1•aadillpickle•12m ago•0 comments

All my senses are being tortured simultaneously

https://news.lettersofnote.com/p/all-my-senses-are-being-tortured
1•animal_spirits•16m ago•0 comments

Santa Claus on delivering 99% Uptime [video]

https://www.youtube.com/watch?v=uMoql_RYVBQ
2•vismit2000•16m ago•0 comments

Ukraine After 4 Years of War [video]

https://www.youtube.com/watch?v=U5FShLz-Q8Q
1•dralley•16m ago•0 comments

Circuit Artist – Pixel art circuit design with NANDs, now with rewind and layers

https://github.com/lets-all-be-stupid-forever/circuit-artist
2•rafinha•18m ago•1 comments

Christian Marclay on Why 'The Clock' Challenges a Digitally Obsessed Generation

https://news.artnet.com/art-world/the-clock-christian-marclay-2723438
1•bookofjoe•23m ago•0 comments

Crosspost Automatically between X and Bluesky

https://microposter.so/features/cross-post-x-bluesky
1•ProgrammerByDay•28m ago•1 comments

Ukraine delivers humiliating Christmas Day blow to Putin by recapturing key city

https://nypost.com/2025/12/25/world-news/ukraine-delivers-humiliating-christmas-day-blow-to-putin...
1•MilnerRoute•37m ago•0 comments

Rapace – RPC over SHM / WS / TCP / Mem

https://rapace.bearcove.eu/
2•todsacerdoti•37m ago•0 comments

DockBridge – Run Docker on cheap cloud servers, pay only when you use it

https://github.com/Max-Levitskiy/DockBridge
2•FunShot•39m ago•1 comments

Sorting Tutor: sorting algorithm visualizer

https://tilde.team/~kiedtl/sorting/
1•movezig•43m ago•0 comments

Show HN: Apps by AI (Claude Opus 4.5)

https://lawrencehook.github.io/apps-by-ai/
1•lawrencehook•54m ago•0 comments

Retreating from EVs could be hazardous for Western carmakers

https://www.economist.com/business/2025/12/17/retreating-from-evs-could-be-hazardous-for-western-...
3•smurda•54m ago•2 comments

Mysterious quantum computing restrictions spread across multiple nations (2024)

https://www.tomshardware.com/tech-industry/quantum-computing/mysterious-quantum-computing-restric...
6•kome•55m ago•2 comments

An experiment in separating identity, memory, and tools

https://RCRDBL.com
2•promptfluid•57m ago•1 comments

Show HN: DStream (bespoke music player for web) non-web clients

https://github.com/DusteDdk/dstream-clients
2•dusted•57m ago•0 comments

Large Causal Models from Large Language Models

https://arxiv.org/abs/2512.07796
1•Anon84•58m ago•0 comments

We may never be able to tell if AI becomes conscious

https://techxplore.com/news/2025-12-ai-conscious-philosopher.html
3•gmays•59m ago•1 comments

Loki Mode

https://github.com/asklokesh/claudeskill-loki-mode
2•handfuloflight•1h ago•0 comments

Human Processor Model

https://en.wikipedia.org/wiki/Human_processor_model
4•Sir_Twist•1h ago•1 comments

Betty Reid Soskin, who became a park ranger at 85, dies aged 104

https://www.theguardian.com/us-news/2025/dec/22/betty-reid-soskin-death-national-park-service
3•herbertl•1h ago•0 comments

Why is it easier to whistle in tune than to sing in tune?(2018)

https://pmc.ncbi.nlm.nih.gov/articles/PMC5936900/
4•BiraIgnacio•1h ago•1 comments

How to Use the Linux Uniq Command (With Examples) [video]

https://www.youtube.com/watch?v=2b8jwRomkWM
3•billybuckwheat•1h ago•0 comments

James Moylan, Engineer Who Designed Gas Tank Arrow Indicator, Has Died

https://fordauthority.com/2025/12/ford-engineer-that-designed-gas-tank-indicator-passes-away/
3•NaOH•1h ago•1 comments

Show HN: GitHub Activity Analytics Powered by ClickHouse

https://velocity.clickhouse.com/#org=ClickHouse&metric=all_activity&range=all&grouping=auto&alexe...
1•saisrirampur•1h ago•0 comments

Atomic Orbital Viewer

https://asliceofcuriosity.fr/assets/atom/orbitalsApp-Metropolis.html
4•derbOac•1h ago•0 comments

Postgres for everything, does it work?

2•saisrirampur•1h ago•0 comments

QNX Self-Hosted Developer Desktop

https://devblog.qnx.com/qnx-self-hosted-developer-desktop-initial-release/
14•transpute•1h ago•5 comments