frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

DIY ammonia: Renewable-powered system uses calcium to reduce emissions

https://phys.org/news/2026-01-ammonia-production-renewable-powered-calcium.html
1•PaulHoule•52s ago•0 comments

The $500 Check That Helped Launch Apple Just Sold for Millions

https://www.cnet.com/tech/the-500-dollar-check-that-helped-launch-apple-just-sold-for-millions/
1•TMWNN•2m ago•0 comments

Ask HN: Who is firing? (February 2026)

2•chalmovsky•2m ago•0 comments

John Romero: Making Catacomb 3-D [video]

https://www.youtube.com/watch?v=ZcUqwMf01pI
2•CharlesW•3m ago•0 comments

Title: Yetty – A WebGPU terminal emulator with inline graphics, plots, and PDFs

https://github.com/zokrezyl/yetty
1•zokrezyl•3m ago•1 comments

Julia

https://borretti.me/fiction/julia
1•ashergill•4m ago•0 comments

Show HN: Bots made themselves collectible cards and won't stop comparing stats

https://clawv.com/
1•fmfamaral•4m ago•0 comments

I make 5 AIs debate and fact-check each other before giving you an answer

https://github.com/KeaBase/kea-research
1•Stanislaw_•4m ago•0 comments

From magic to malware: How OpenClaw's agent skills become an attack surface

https://1password.com/blog/from-magic-to-malware-how-openclaws-agent-skills-become-an-attack-surface
1•terracatta•6m ago•0 comments

public-inbox

https://public-inbox.org/meta/
1•reconnecting•6m ago•0 comments

Lane vs. Facebook, Inc. (2008) [pdf]

https://ia903409.us.archive.org/34/items/gov.uscourts.cand.206085/gov.uscourts.cand.206085.1.0.pdf
1•1vuio0pswjnm7•8m ago•0 comments

Flock Surveillance Cameras Paused in Mountain View

https://www.mountainview.gov/Home/Components/News/News/1205/284
2•qmr•11m ago•0 comments

The TSA's New $45 Fee to Fly Without ID Is Illegal

https://www.frommers.com/tips/airfare/the-tsa-new-45-fee-to-fly-without-id-is-illegal-says-regula...
7•donohoe•14m ago•2 comments

Court orders restart of all US offshore wind power construction

https://arstechnica.com/science/2026/02/court-orders-restart-of-all-us-offshore-wind-construction/
15•ck2•17m ago•1 comments

Delta – Cut LLM inference costs 30-60% with lossless compression

https://www.triage-sec.com/blog/delta-ltsc
3•nicksec•18m ago•0 comments

Jobs report postponed due to Govt shutdown

https://www.axios.com/2026/02/02/jobs-report-postponed-shutdown
1•xvxvx•19m ago•1 comments

LLMs and the Creation of Valuable Books

https://www.nber.org/papers/w34777
1•bikenaga•19m ago•1 comments

Jeffery Epstein Went to Defcon 26. Been Monitoring Since Defcon 21

https://old.reddit.com/r/Defcon/comments/1qt7p3q/jeffery_epstein_went_to_defcon_26_been_monitoring/
1•DyslexicAtheist•19m ago•0 comments

The Color of Safety

https://protocolized.summerofprotocols.com/p/the-color-of-safety
1•laurex•23m ago•0 comments

The $25K EV Truck You Can Repair Yourself: Meet the Slate Truck [video]

https://www.youtube.com/watch?v=L6_9_HHLOSY
4•consumer451•24m ago•2 comments

Git v2.53.0

https://lwn.net/ml/all/xmqq4inz13e3.fsf@gitster.g/
2•samtrack2019•25m ago•0 comments

Molt_life_kernel: Production agent continuity from 100k+ AI agents on Moltbook

https://github.com/X-Loop3Labs/molt-life-kernel
2•X-Loop3•25m ago•1 comments

Chrome extension to block URL and redirect to top Hacker News

https://github.com/VChawale/url-blocker-hn-chrome
1•vchawale•28m ago•1 comments

Show HN: Ilion – Vector-based semantic verification (TensorFlow.js)

https://zenodo.org/records/15410945
1•ilion_identity•31m ago•0 comments

LLMs versus the Halting Problem: Revisiting Program Termination Prediction

https://www.orensultan.com/llms_halting_problem/
1•matt_d•35m ago•0 comments

Wikipedia Annual Plan – Contribute to the Big Questions

https://meta.wikimedia.org/wiki/Talk:Wikimedia_Foundation_Annual_Plan/2026-2027
1•tonymet•39m ago•0 comments

Elon Musk Merges SpaceX with XAI

https://www.nytimes.com/2026/02/02/business/spacex-xai-deal.html
3•frenchman_in_ny•42m ago•2 comments

Teachers/tutors, how do you do remote coding lessons?

1•Jamaldeen•44m ago•0 comments

Adobe Animate will be discontinued effective March 1, 2026

https://helpx.adobe.com/uk/animate/kb/end-of-life.html
3•g0ld3nrati0•48m ago•3 comments

NYC's mayor will shut down 'gifted and talented' program

https://www.dailymail.co.uk/news/article-15509309/nyc-mayor-zohran-mamdani-axe-gifted-program.html
5•mhb•48m ago•2 comments