frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Go Client for GitHub Actions Runner Scale Set APIs

https://github.com/actions/scaleset
1•crohr•54s ago•0 comments

Show HN: Simple, Fast, Accessible Fine-Tuning

https://www.commissioned.tech/
1•rbshamsu•1m ago•0 comments

"The appearance of XXX's name in the files does not imply wrongdoing"

1•chrisjj•1m ago•0 comments

'Ripping' Clips for YouTube Reaction Videos Can Violate the DMCA, Court Rules

https://torrentfreak.com/ripping-clips-for-youtube-reaction-videos-can-violate-the-dmca-court-rules/
1•Signez•1m ago•0 comments

ClawHavoc: Malicious Clawed Skills Found by the Bot They Were Targeting

https://www.koi.ai/blog/clawhavoc-341-malicious-clawedbot-skills-found-by-the-bot-they-were-targe...
1•Santas•2m ago•0 comments

Is almost everyone wrong about America's AI power problem?

https://epoch.ai/gradient-updates/is-almost-everyone-wrong-about-americas-ai-power-problem
1•toomuchtodo•2m ago•0 comments

Show HN: MS Paint for Android

https://play.google.com/store/apps/details?id=com.sketch.paint&hl=en_US
1•Codegres•2m ago•0 comments

RCade: Building a Community Arcade Cabinet

https://www.frankchiarulli.com/blog/building-the-rcade/
1•fcjr•3m ago•1 comments

AI is just the latest Monoculture

https://www.deusinmachina.net/p/ai-is-just-the-latest-monoculture
1•Decabytes•3m ago•0 comments

Large-Scale Research for Repurposing and Supplements

https://goodscience.substack.com/p/proposing-an-nih-high-leverage-trials
1•toomuchtodo•5m ago•1 comments

Smashing the Stack in the 21st Century

https://thesquareplanet.com/blog/smashing-the-stack-21st-century/
1•barishnamazov•5m ago•0 comments

Attack of the Clones: B210 AD9361 RF 70MHz-6GHz Software Defined Radio SDR

https://opensourcesdrlab.com/products/b210-ad9361
1•teleforce•6m ago•0 comments

Vibes Are All You Need

https://tiangewu.com/#/vibes-are-all-you-need
1•tiangewu•7m ago•0 comments

We're Launching Our Second App!

https://www.nullboard.xyz/
1•thximpulse•8m ago•0 comments

The 'elite' couples breeding to save mankind

https://www.telegraph.co.uk/family/life/pronatalists-save-mankind-by-having-babies-silicon-valley/
1•Anon84•9m ago•0 comments

The GitButler CLI

https://blog.gitbutler.com/but-cli
1•dahjelle•9m ago•0 comments

Show HN: DeepBrainz-R1 – Reasoning-First Small Models for Agentic Systems

https://huggingface.co/DeepBrainz
1•DeepBrainz•10m ago•0 comments

Gambit

http://gambitscheme.org/
2•tosh•10m ago•0 comments

Small LLMs vs. Fine-Tuned Bert for Classification: 32 Experiments

https://alex-jacobs.com/posts/beatingbert/
1•tacoooooooo•12m ago•0 comments

Munich makes digital sovereignty measurable with its own score

https://www.heise.de/en/news/Munich-makes-digital-sovereignty-measurable-with-its-own-score-11164...
1•smurda•14m ago•0 comments

Set up a Bazel build that targets an MCU

https://pigweed.dev/build/bazel/mcu-setup.html
1•kaycebasques•15m ago•0 comments

Font Playground

https://www.abishekvenkat.com/fontplay
1•archb•15m ago•0 comments

Competence as Tragedy

https://crowprose.com/blog/competence-as-tragedy/
1•birdculture•15m ago•1 comments

The most misunderstood graph in AI

https://www.technologyreview.com/2026/02/05/1132254/this-is-the-most-misunderstood-graph-in-ai/
2•Brajeshwar•15m ago•0 comments

Google set to double AI spending to $185B after strong earnings

https://www.ft.com/content/22d97d8e-1101-4b1b-8a28-66054dfa363a
2•Brajeshwar•16m ago•0 comments

Why our ancestors had straight teeth without braces

https://www.popsci.com/science/why-need-braces/
1•Brajeshwar•16m ago•0 comments

Show HN: A package manager for agent skills with built-in evals

https://tessl.io/
2•guypod•16m ago•0 comments

Jujutsu v0.38.0 Released

https://github.com/jj-vcs/jj/releases/tag/v0.38.0
1•todsacerdoti•17m ago•0 comments

Show HN: OpenWatch – Open-source alternative to YouTube

https://github.com/openwatch-app/openwatch
1•ge0rg3e•17m ago•0 comments

CoreWeave walks a debt tightrope, counting on key customers to be its safety net

https://deepquarry.substack.com/p/coreweave-walks-a-debt-tightrope
2•zerosizedweasle•19m ago•0 comments
Open in hackernews

The Unreasonable Effectiveness of Reasonless Intermediate Tokens

https://arxiv.org/abs/2505.13775
4•YeGoblynQueenne•8mo ago

Comments

tocs3•8mo ago
I asked ChatGPT to restate this in more laymen's terms (posted below) and I am not to surprised at the answer.

"Lately, some AI models have shown impressive abilities to solve complex problems, and many people credit this to a method called Chain of Thought (CoT), where the model is trained to think through steps like a human might. In this paper, we take a closer look at that idea to see if it's really what's driving better performance.

We focus on the model’s step-by-step thinking (the words it generates along the way) — often treated like human "thoughts" — and examine whether these actually help the model solve problems more accurately. To test this, we train AI models using clean, correct step-by-step reasoning paths and final answers, all based on a known solving method (A* search). This lets us check both the final answers and the reasoning steps to see how they relate.

Interestingly, we find that even when a model gives the right answer, its reasoning steps can still be wrong or messy. To go further, we even train models using completely random and incorrect reasoning steps — and surprisingly, they still perform about the same, and sometimes even better, than those trained on correct steps.

This suggests that the step-by-step "thoughts" the model shows aren’t as meaningful or reliable as many assume. In short, just because a model looks like it’s reasoning through a problem doesn’t mean it actually is — and we should be careful not to treat its outputs as if it thinks like a human or follows strict logic."