frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•9mo ago

Comments

tocs3•9mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

UK PM: "No platform gets a free pass"

https://www.gov.uk/government/news/pm-no-platform-gets-a-free-pass-government-takes-action-to-kee...
1•kaelyx•1m ago•0 comments

Thinking Machines Lab Will Hire Me. They Just Don't Know It Yet.

https://medium.com/@redjonzaci/thinking-machines-lab-will-hire-me-they-just-dont-know-it-yet-0a59...
1•redjonzaci•5m ago•0 comments

Show HN: ACDC – A non-agentic AI coding tool with L0-L3 context cache tiering

https://github.com/flatmax/AI-Coder-DeCoder
1•flatmax•6m ago•1 comments

Declarative, Inquisitive, then Imperative (2017) [pdf]

https://www.forth.org/svfig/kk/11-2017-Falvo.pdf
1•tosh•6m ago•0 comments

The New Way to Build a Startup (YC Video, YouTube)

https://www.youtube.com/watch?v=rWUWfj_PqmM
1•wuschel•6m ago•0 comments

Agent Skills Hub – Security first directory for AI agent skills and MCP

https://agentskillshub.dev/
1•cana2026•10m ago•0 comments

Show HN: Chrome extension that turns your keyboard into a piano while you type

https://chromewebstore.google.com/detail/qwerty-jam/adaegmlplifnnokcjafmfakheoingnfp
1•anshika_vijay•11m ago•0 comments

Test

1•iamgrootali•12m ago•0 comments

Comparing how 3 AI assistants implement memory

https://www.maximem.ai/blog/ai-apps-memory
1•gdad•13m ago•1 comments

Socio – A WebSocket Real-Time Communication (RTC) API Full-Stack Framework

https://github.com/Rolands-Laucis/Socio
1•Rolands_Laucis•14m ago•1 comments

SwiftForth IDE for Windows, Linux, macOS

https://www.forth.com/swiftforth/
1•tosh•14m ago•0 comments

Calculating the shortest path using just CSS

https://css-tip.com/graph-theory/
1•samwho•14m ago•0 comments

Ask HN: How's Business These Days for Upwork Freelancers?

1•burnerToBetOut•15m ago•0 comments

Gitas – A tool for Git account switching

https://github.com/letmutex/gitas
1•letmutex•18m ago•0 comments

DAG PROTOCOL with 3 dimensions, graph to mash

https://github.com/navigatorbuilds/elara-protocol
1•NenadVasic•18m ago•1 comments

AI is slowly munching away my passion

https://whynot.fail/human/ai-is-slowly-munching-away-my-passion/
1•birdculture•20m ago•0 comments

Show HN: I've build a self hosted convex/Firebase/Supabase alternative

https://linkedrecords.com/
1•WolfOliver•22m ago•0 comments

Slopware AI: Ship Garbage Even Faster

https://slopware.ai/
1•hassenaa•25m ago•1 comments

Breach / Stealer-Log / Identity Exposure Services Comparison (With Scoring)

https://github.com/infostealers-stats/Credential-and-breach-monitoring
6•webzio•25m ago•0 comments

For Warmth: Thich Nhat Hanh's Poetic Antidote to Anger

https://www.themarginalian.org/2026/02/16/for-warmth-thich-nhat-hanh/
1•robtherobber•27m ago•0 comments

The Menu, production draft script (2022) [pdf]

https://deadline.com/wp-content/uploads/2023/01/The-Menu-Read-The-Screenplay.pdf
1•nxobject•27m ago•0 comments

UK banks plan Visa and Mastercard alternative amid Trump fears

https://www.theguardian.com/business/2026/feb/16/uk-bank-bosses-plan-visa-mastercard-alternative
7•tosh•28m ago•1 comments

KDE Plasma 6.6 Released

https://kde.org/announcements/plasma/6/6.6.0/
6•mkurz•29m ago•0 comments

A Comparative Security Analysis of Three Cloud-Based Password Managers

https://eprint.iacr.org/2026/058
1•u1hcw9nx•30m ago•1 comments

Show HN: [Keyboard Navigator] Navigate any page without a mouse

https://chromewebstore.google.com/detail/keyboard-navigator/ofncobnikhkdaodjckpehpicnjomjjgm
1•kkimdev•31m ago•0 comments

Show HN: Hunter Logo API free replacement of Clearbit

https://hunter.io/api/logo
1•jeanro_hunter•31m ago•1 comments

- -dangerously-skip-reading-code

https://olano.dev/blog/dangerously-skip/
1•ingve•31m ago•0 comments

OpenClaw Partners with VirusTotal for Skill Security

https://openclaw.ai/blog/virustotal-partnership
1•eternalaeonic•32m ago•0 comments

The Servo project and its impact on the web platform ecosystem

https://servo.org/slides/2026-02-fosdem-servo-web-platform/
2•todsacerdoti•33m ago•0 comments

Lessons learned from rebuilding a 19-year-old platform in one week with Claude

https://gist.github.com/janit/6559e97ceb00e444b9aecb3c00dfdf16
1•velmu•34m ago•0 comments