frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•11mo ago

Comments

tocs3•11mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

The Mystery of Rennes-Le-Château, Part 5: The Man Behind the Curtain

https://www.filfre.net/2026/05/the-mystery-of-rennes-le-chateau-part-5-the-man-behind-the-curtain/
1•ibobev•24s ago•0 comments

Using autocommands with the new Neovim API

https://xnacly.me/posts/2023/autocommand-nvim/
1•ibobev•1m ago•0 comments

Jury Convicts Isis-K Terrorist for Role in the Abbey Gate Bombing&Other Attacks

https://www.justice.gov/opa/pr/federal-jury-convicts-isis-k-terrorist-role-abbey-gate-bombing-and...
1•737min•1m ago•0 comments

Ukiyo-E Online Database Holds 220k Japanese Woodblock Prints (2017)

https://mymodernmet.com/japanese-woodblock-ukiyo-e-online-database/
1•speckx•2m ago•0 comments

U.S. Fed policymakers stance on interest rate hikes

https://www.reuters.com/graphics/USA-ECONOMY/FED/gdpzajoegvw/
1•gmays•3m ago•0 comments

Category Theory for Tiny ML in Rust

https://hghalebi.github.io/category_theory_transformer_rs/
1•pajop•5m ago•0 comments

Show HN: The Cat Is Under Mayonnaise – Modifying LLM Behavior Without Retraining

https://github.com/andycufari/the-cat-is-under-mayonnaise-experiment
1•andycufari•6m ago•0 comments

Sex matters: European urban birds flee approaching women sooner than men

https://wiley.scienceconnect.io/error?msg=ewogICJpZCIgOiAiYzhmN2JlYjItYTllOC00OWQ0LTkyNTgtM2ZmNWY...
2•w4lker•6m ago•0 comments

The AI Governance Gap

1•Qoris_AI2026•7m ago•1 comments

Show HN: Safety layer between AI agents and databases

https://github.com/fazhq/faz
1•burhanultayyab•7m ago•0 comments

A.I.-Themed High School Is Put on Hold After Parental Backlash

https://www.nytimes.com/2026/04/27/nyregion/nyc-ai-high-school-halted.html
2•bookofjoe•8m ago•1 comments

Rural America is resisting the surge in data center construction

https://arstechnica.com/ai/2026/04/rural-america-is-resisting-the-surge-in-data-center-construction/
2•speckx•8m ago•0 comments

Trump administration cites national security in stalling 165 wind farms

https://arstechnica.com/science/2026/05/trump-administration-cites-national-security-in-stalling-...
4•ndr42•8m ago•0 comments

From RSS to Atom

https://susam.net/from-rss-to-atom.html
1•susam•8m ago•0 comments

Audion – Music Sequencing Language

https://github.com/audion-lang/audion
1•skor•9m ago•0 comments

The Cartoon That Shut Down Boston

https://nowiknow.com/the-cartoon-that-shut-down-boston/
2•cainxinth•11m ago•0 comments

EuroClojure 2027 in Prague

https://2027.euroclojure.org/
2•kaliszad•12m ago•0 comments

Meta Solved Problem with Kenyan Contractors Seeing Footage of AI Glasses Wearers

https://daringfireball.net/linked/2026/05/01/meta-solved-their-problem
3•mooreds•12m ago•0 comments

OpenClaw Got Safer in Public

https://openclaw.ai/blog/openclaw-security-in-public
2•zvikomborero•13m ago•0 comments

ChatGippety: Enterprise-Grade Conversational Compliance

https://chatgippety.com/
2•mooreds•13m ago•0 comments

Denmark faces data center reckoning as power grid overwhelmed

https://www.cnbc.com/2026/05/04/denmark-data-centers-moratorium-grid-pause-power-demand.html
3•tcp_handshaker•15m ago•0 comments

Flight data bolsters claim China Eastern plane was deliberately crashed in 2022

https://www.cnn.com/2026/05/04/china/china-eastern-crash-ntsb-report-intl-hnk
3•tcp_handshaker•16m ago•0 comments

AI Worries Have Returned to Wall Street. Now Come Earnings

https://www.wsj.com/tech/ai-worries-have-returned-to-wall-street-now-come-earnings-d680e19c
4•gmays•16m ago•0 comments

My 2026 goal is to be bored more often

https://cdevroe.com/2025/12/16/2026-goal-boredom/
2•speckx•18m ago•0 comments

PR: Restoring Privacy and Freedom

https://github.com/eu-digital-identity-wallet/av-doc-technical-specification/pull/23
3•monneyboi•20m ago•0 comments

Utah Reduced Chronic Homelessness by 91 Percent (2015)

https://www.npr.org/2015/12/10/459100751/utah-reduced-chronic-homelessness-by-91-percent-heres-how
2•downbad_•21m ago•1 comments

Why this tribe is buying up acres of farmland – and flooding it

https://www.npr.org/2026/05/03/nx-s1-5806062/washington-tribe-restore-wetlands-fish
2•Brajeshwar•23m ago•0 comments

Poverty on the rise as RI families struggle to meet basic living expenses

https://rhodeislandcurrent.com/2026/05/04/federal-data-gaps-aside-poverty-on-the-rise-as-ri-famil...
2•chmaynard•23m ago•0 comments

Easiest way to create agents with local LLMs

https://github.com/iBz-04/quaynor
2•Ibz04•28m ago•0 comments

Back into Plato's Cave

https://akoepke.github.io/cave_umwelten/
2•Murfalo•28m ago•0 comments