frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•6mo ago

Comments

tocs3•6mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

Leave the Em Dash Alone

https://danielmiessler.com/blog/leave-the-em-dash-alone
1•speckx•3m ago•0 comments

Freelance Marketplace for African Software Developers and Virtual Assistants

https://devdey.com
1•captainXYZ•3m ago•0 comments

A brain implant that could rival Neuralink's enters clinical trials

https://www.nature.com/articles/d41586-025-03849-0
1•geox•5m ago•0 comments

Tuxedo Computers Cancels Snapdragon X1 Linux Laptop

https://www.tuxedocomputers.com/en/Discontinuation-of-ARM-notebooks-with-Snapdragon-X-Elite-SoC.t...
1•Venn1•6m ago•0 comments

GraphLite: An Embeddable Graph Database with ISO Graph Query Language Support

https://github.com/GraphLite-AI/GraphLite
3•cpard•9m ago•0 comments

Point-of-Care Transesophageal Echocardiography in Emergency and Intensive Care

https://www.mdpi.com/2227-9059/13/11/2680
2•PaulHoule•9m ago•0 comments

AgentxSuite – Open-Source Control Plane for AI Agents Using MCP

1•aliparnan•10m ago•0 comments

Next general training environment for superintelligence?

https://shash42.substack.com/p/automated-scientific-discovery-as
1•shash42•11m ago•1 comments

Block Lamp

https://arslan.io/2025/01/25/my-first-lighting-design-block-lamp/
2•wonger_•12m ago•0 comments

The Harvard Endowment's Biggest Public Investment Is Now Bitcoin

https://gizmodo.com/the-harvard-endowments-biggest-public-investment-is-now-bitcoin-2000686439
3•paulpauper•13m ago•1 comments

ChatGPT in Systematic Investing – Enhancing Risk-Adjusted Returns with LLMs

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5680782
1•paulpauper•14m ago•0 comments

What Now? Handling Errors in Large Systems

https://brooker.co.za/blog/2025/11/20/what-now.html
2•SchwKatze•15m ago•0 comments

Show HN: Even Turns, track your families turns

https://eventurns.com
1•gdesplin•23m ago•0 comments

IVs, manicures and no bathroom breaks: 24 hours selling cards with Joe Hollywood

https://www.cllct.com/sports-collectibles/memorabilia/ivs-manicures-and-no-bathroom-breaks-24-hou...
1•starkparker•24m ago•0 comments

FCC rolls back cybersecurity rules for telcos despite state hacking risks

https://www.bleepingcomputer.com/news/security/fcc-rolls-back-cybersecurity-rules-for-telcos-desp...
1•sans_souse•24m ago•0 comments

Why an AI 'godfather' is quitting Meta after 12 years

https://www.bbc.com/news/articles/cdx4x47w8p1o
1•iamtech•26m ago•1 comments

Is Write­Process­Memory faster than shared memory for transferring data

https://devblogs.microsoft.com/oldnewthing/20251119-00/?p=111800
1•ibobev•27m ago•0 comments

GE to Acquire InteleRad for $2.3B

https://www.reuters.com/legal/transactional/ge-healthcare-acquire-intelerad-23-billion-2025-11-20/
2•lostlogin•28m ago•0 comments

Show HN: Ìkọkúkọ 0.1.0 – Reactive Form Validation for Compose Multiplatform

https://github.com/quantipixels/ikokuko
1•theblackngel•28m ago•0 comments

Show HN: 3400 remote tech jobs from companies with a 3.5 rating on Greenhouse

https://www.remoteweek.io
1•TonySyrup•29m ago•0 comments

Foundry Local comes to Android–plus on-device speech, and on-prem support

https://devblogs.microsoft.com/foundry/foundry-local-comes-to-android/
1•ibobev•30m ago•0 comments

The Bilbao Effect

https://en.wikipedia.org/wiki/Starchitect
1•valzevul•31m ago•0 comments

Reinventing How .NET Builds and Ships (Again)

https://devblogs.microsoft.com/dotnet/reinventing-how-dotnet-builds-and-ships-again/
2•ibobev•32m ago•0 comments

Earl Grey Tea Intoxication

https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(02)08436-2/abstract
1•valzevul•32m ago•0 comments

Rails update: per-adapter migration, hash-format support, MemoryStore caching

https://rubyonrails.org/2025/11/21/this-week-in-rails
2•andrewstetsenko•32m ago•0 comments

Cloudflare Dashboard and Cloudflare API service issues

https://www.cloudflarestatus.com/incidents/9ts6gx8q69xl
7•testplzignore•33m ago•1 comments

Critical Thinking during the age of AI

https://addyo.substack.com/p/critical-thinking-during-the-age
1•waprin•34m ago•0 comments

Architecting Uncertainty: Designing Reliable Systems on Top of LLMs

https://medium.com/data-science-collective/architecting-uncertainty-a-modern-guide-to-llm-based-s...
1•oddish-tv•34m ago•1 comments

Spring Boot 4

https://spring.io/blog/2025/11/20/spring-boot-4-0-0-available-now/
3•sh_tomer•34m ago•0 comments

Coding Trance Music [video]

https://www.youtube.com/watch?v=GWXCCBsOMSg
2•alxjsn•35m ago•0 comments