frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."

LinkedIn Causing 7000 Requests/Second? CPU Spike Across Fresh Systems

https://www.youtube.com/watch?v=24wNHf9JyMY
1•praveenscience•2m ago•0 comments

Is the Constitution Broken

https://www.harvardmagazine.com/social-sciences/is-the-constitution-broken
2•KnuthIsGod•3m ago•1 comments

How Much Is Eight Dollars?

https://defector.com/how-much-is-eight-dollars
1•MaysonL•6m ago•0 comments

The creator of Node.js says the era of writing code is over

https://jpcaparas.medium.com/the-creator-of-node-js-says-the-era-of-writing-code-is-over-8320c868...
2•CharlesW•11m ago•2 comments

Uber, often sued over car crashes, pushes for law to limit lawyer fees

https://www.latimes.com/california/story/2026-01-17/uber-personal-injury-lawsuits-california-law
3•sizzle•17m ago•1 comments

F5 tackles AI security with new platform extensions

https://www.networkworld.com/article/4118696/f5-tackles-ai-security-with-new-platform-extensions....
1•ohjeez•18m ago•0 comments

Uber Pushes to Cap Personal Injury Lawyer Payouts A.G. 25-0022 [pdf]

https://oag.ca.gov/system/files/initiatives/pdfs/25-0022A1%20%28Self%20Dealing%20Attorneys%29.pdf
4•sizzle•20m ago•0 comments

Austrian cow shows first case of flexible, multi-purpose tool use in cattle

https://www.amazon.com/IASEAHK-Cushions-Dining-Chairs-Kitchen/dp/B0DJCT6H9D/ref=sr_1_9?dib=eyJ2Ij...
2•bookmtn•21m ago•0 comments

SearchGuard: How Google detects bots and what the SerpAPI lawsuit reveals

https://searchengineland.com/inside-google-searchguard-467676
3•sans_souse•22m ago•0 comments

Show HN: AlgoSync – A social space for builders to share the "real" tech journey

https://www.algosyncverse.com/
1•lyquochao84•24m ago•1 comments

Netflix Ruined Korean Dramas Forever [video]

https://www.youtube.com/watch?v=p1_j6izmEX4
2•danhite•24m ago•1 comments

Opensync

https://github.com/waynesutton/opensync
1•handfuloflight•24m ago•0 comments

Bank of England 'must plan for a financial crisis triggered by aliens'

https://www.msn.com/en-gb/news/uknews/bank-of-england-must-plan-for-a-financial-crisis-triggered-...
5•matthewsinclair•34m ago•6 comments

EnergyNet Explained: Internetification of Energy Distribution

https://arxiv.org/abs/2509.08152
1•zekrioca•36m ago•0 comments

Scaling long-running autonomous coding

https://simonwillison.net/2026/Jan/19/scaling-long-running-autonomous-coding/
2•srameshc•36m ago•0 comments

Ask HN: Do Hackathons Still Matter in 2026?

4•rafaepta•38m ago•0 comments

React Native Windows v0.81

https://devblogs.microsoft.com/react-native/%f0%9f%9a%80react-native-windows-v0-81-is-here/
1•soheilpro•38m ago•0 comments

AI Is a Horse (2024)

https://kconner.com/2024/08/02/ai-is-a-horse.html
1•zdw•38m ago•0 comments

SolarPunk: Autonomous redistribution system with anti-corporate code

https://github.com/MeekoThaRaccoon/SolarPunk
1•RealSolarPunk•39m ago•1 comments

Ygrep: Fast, local, indexed code search tool optimized for AI coding assistants

https://github.com/yetidevworks/ygrep
1•kristianp•43m ago•0 comments

How to Kill a Fish

https://www.newyorker.com/magazine/2026/01/26/how-to-kill-a-fish
1•mitchbob•45m ago•1 comments

Show HN: NPM install a WASM based Linux VM for your agents

https://github.com/deepclause/agentvm
1•schmuhblaster•47m ago•1 comments

We've Turned Off AI‑Assisted Answers

https://noai.duckduckgo.com/
10•doener•49m ago•2 comments

Volvo EX60: First Gemini-Powered EV vs. BMW iX3 Alexa+

https://www.techradar.com/vehicle-tech/dash-cams/the-worlds-first-gemini-powered-ev-lands-this-we...
2•gfortaine•50m ago•2 comments

London Eye architect proposes 14-mile tidal power station off Somerset coast

https://www.theguardian.com/environment/2025/dec/27/london-eye-architect-proposes-14-mile-tidal-p...
2•PaulHoule•54m ago•0 comments

Prisma-sql – Direct SQL generation from Prisma queries

https://www.npmjs.com/package/prisma-sql
1•multipliedtwice•55m ago•1 comments

Why Walmart still doesn't support Apple Pay

https://9to5mac.com/2026/01/18/heres-why-walmart-still-doesnt-support-apple-pay/
9•CharlesW•55m ago•1 comments

ClovaLink: Enterprise file management without the enterprise price tag

https://github.com/ClovaLink/ClovaLink
1•thunderbong•58m ago•0 comments

Reticulum, a secure and anonymous mesh networking stack

https://github.com/markqvist/Reticulum
18•brogu•59m ago•4 comments

Intelligent Wearable for Dysarthria Recovery Post-Stroke

https://www.nature.com/articles/s41467-025-68228-9
2•gnabgib•59m ago•0 comments