frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Creating a Tab completion model from scratch

https://docs.getpochi.com/developer-updates/how-we-created-nes-model/
1•wsxiaoys•1m ago•1 comments

Cosmic Paradox Reveals the Awful Consequence of an Observer-Free Universe

https://www.quantamagazine.org/cosmic-paradox-reveals-the-awful-consequence-of-an-observer-free-u...
1•pseudolus•6m ago•0 comments

Myanmar's military detains foreigners in raid on second major online scam center

https://apnews.com/article/myanmar-thailand-scam-cybercrime-china-4f218e449b5993b37f20705061766bdf
1•bikenaga•7m ago•0 comments

VoIP Brings Back Old-Fashioned Pay Phones to Rural Vermont

https://spectrum.ieee.org/payphone-voip
1•pseudolus•8m ago•0 comments

Crypto got everything it wanted. Now it's sinking

https://www.economist.com/finance-and-economics/2025/11/18/crypto-got-everything-it-wanted-now-it...
1•pseudolus•10m ago•2 comments

How to Get High on Math

https://www.justinmath.com/how-to-get-high-on-math/
1•gmays•13m ago•0 comments

Nanoscale Mirrorless Superradiant Lasing

https://journals.aps.org/prl/abstract/10.1103/rbs2-2pd5
1•PaulHoule•17m ago•0 comments

Histone variants and chromatin structure, update of advances(2022)

https://pmc.ncbi.nlm.nih.gov/articles/PMC9764139/
2•rolph•19m ago•0 comments

Show HN: Sourcewizard – AI installs SDKs in your codebase

https://sourcewizard.ai
11•mifydev•19m ago•11 comments

Histone acetylation and CpG methylation on nucleosomes(2012)

https://www.sciencedirect.com/science/article/abs/pii/S1570963912000957
1•rolph•21m ago•0 comments

Microtubules as Fractal Time Crystals: implications for life and consciousness [video]

https://www.youtube.com/watch?v=YusrOYGAhqM
1•jakeogh•21m ago•0 comments

QRL: Future‑Proof Blockchain

1•slakernode•21m ago•0 comments

Ask HN: Struggling founders, pls share your startup struggle

2•vieews•24m ago•0 comments

Poland closes last Russian consulate after 'act of state terrorism' on railway

https://www.theguardian.com/world/2025/nov/19/poland-closes-last-russian-consulate-after-act-of-s...
2•wslh•25m ago•0 comments

I Hate Journalism's Culture of Casual Calumny

https://jessesingal.substack.com/p/i-hate-journalisms-culture-of-casual
1•aidenn0•28m ago•0 comments

NATO on alert as Poland accuses Russia of 'state terrorism' in rail blast

https://www.washingtonpost.com/world/2025/11/19/ukraine-poland-russia-rail-explosion-consulate/
1•softwaredoug•31m ago•1 comments

Truth Window

https://en.wikipedia.org/wiki/Truth_window
1•lukas099•34m ago•0 comments

"We're in an LLM bubble," Hugging Face CEO says–but not an AI one

https://arstechnica.com/ai/2025/11/were-in-an-llm-bubble-hugging-face-ceo-says-but-not-an-ai-one/
1•leemailll•34m ago•0 comments

Perennial Technical Reading List

https://parallelprogrammer.substack.com/p/a-reading-list-for-metalheads
1•Karrot_Kream•35m ago•0 comments

A fast and powerful log viewer that turns JSON/logfmt into human-readable form

https://github.com/pamburus/hl
2•lwhsiao•37m ago•0 comments

Reproducing UMI with a UR5 Robot Arm and a 3D-Printed Gripper

https://twitter.com/raulgarreta/status/1987679358409203921
1•rgarreta•38m ago•1 comments

NASA wants you to know that 3I/ATLAS is an interstellar comet

https://arstechnica.com/science/2025/11/nasa-really-wants-you-to-know-that-3i-atlas-is-an-interst...
1•bikenaga•39m ago•0 comments

Ask: What's the SSL gateway alternative to CF?

1•winstonwinston•40m ago•0 comments

Cargo-pgo: Subcommand for optimizing Rust binaries/libraries with PGO and BOLT

https://github.com/Kobzol/cargo-pgo
1•klaussilveira•41m ago•0 comments

Building an AI-powered health app (think Noom meets symptom tracking)

https://docs.google.com/document/d/11dZJUkC0fuowUCurUv8yHaJofKv3UeF9EBjL6ToUnK8/edit?usp=sharing
1•DietAppAi•45m ago•0 comments

Wirecutter buys 450 lb box of Amazon returns

https://www.nytimes.com/wirecutter/reviews/mystery-amazon-pallet-unboxing/
3•0xWTF•46m ago•1 comments

LLM chat interfaces will kill curiosity

https://harsehaj.substack.com/p/llms-curiosity-loss-frictionless-learning
2•harsehaj•56m ago•0 comments

Companies Predict 2026 Will Be the Worst College Grad Job Market in Five Years

https://www.wsj.com/lifestyle/careers/2026-graduates-job-market-7928bcd7
1•sarimkx•58m ago•1 comments

From early‑stage shortcuts to a ledger of record

https://www.parafin.com/blog/from-early-stage-shortcuts-to-a-ledger-of-record-our-journey-to-reli...
1•mattmarcus•59m ago•0 comments

Can weed help you drink less? Scientists study how well 'California sober' works

https://www.npr.org/sections/shots-health-news/2025/11/19/nx-s1-5604813/marijuana-drinking-califo...
2•Stratoscope•1h ago•0 comments