frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Claude Sonnet 5

https://www.anthropic.com/news/claude-sonnet-5
982•marinesebastian•10h ago•555 comments

Claude Code is steganographically marking requests

https://thereallo.dev/blog/claude-code-prompt-steganography
1587•kirushik•13h ago•457 comments

Supersonic flight returning to US after half-century ban

https://www.forbes.com/sites/suzannerowankelleher/2026/06/30/faa-supersonic-flight-no-boom/
51•lobbly•2h ago•36 comments

Google copybara: moving code between repositories

https://github.com/google/copybara
122•reconnecting•5h ago•17 comments

Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5

https://twitter.com/AnthropicAI/status/2072106151890809341
441•Pragmata•5h ago•216 comments

Forestiere Underground Gardens

https://en.wikipedia.org/wiki/Forestiere_Underground_Gardens
40•onemoresoop•3h ago•4 comments

Claude Science

https://claude.com/product/claude-science
405•lebovic•11h ago•129 comments

Nano Banana 2 Lite

https://deepmind.google/models/gemini-image/flash-lite/
332•minimaxir•12h ago•135 comments

How does a pull-back car work? Illustrated teardown

https://mechanical-pencil.com/products/car
138•Muhammad523•2d ago•28 comments

From brain waves to words: a new path to communication without surgery

https://ai.meta.com/blog/brain2qwerty-brain-ai-human-communication/?_fb_noscript=1
130•alok-g•7h ago•70 comments

Leanstral 1.5

https://docs.mistral.ai/models/model-cards/leanstral-1-5-26-06
133•vetronauta•8h ago•36 comments

Segmenting Robot Video into Actionable Subtasks

https://macrodata.co/blog/annotating-robot-video-subtasks
6•tomaspduarte•1d ago•0 comments

CERN bids farewell to the LHC and enters Long Shutdown 3

https://home.cern/cern-bids-farewell-to-the-lhc-and-enters-long-shutdown-3/
166•HelloUsername•1d ago•37 comments

I ported Kubernetes to the browser

https://ngrok.com/blog/i-ported-kubernetes-to-the-browser
206•peterdemin•8h ago•66 comments

Ante: A new way to blend borrow checking and reference counting

https://verdagon.dev/blog/ante-blending-borrowing-rc
67•g0xA52A2A•2d ago•15 comments

Deriving the SVD (Single Value Decomposition) from scratch

https://stillthinking.net/posts/connections-in-math-svd/
23•pcael•2d ago•4 comments

Hatari – Online Atari ST/STE/TT/Falcon Emulator

https://hatari.frama.io/hatari/online/hatari.html
45•gregsadetsky•6h ago•4 comments

Scaling Laws, Carefully

https://lilianweng.github.io/posts/2026-06-24-scaling-laws/
40•tehnub•4d ago•11 comments

I built a mmWave material classification radar (2025)

https://gauthier-lechevalier.com/radar
159•GL26•11h ago•39 comments

Tokyo has only two barley tea makers, we visited one to see how mugicha is made

https://soranews24.com/2026/06/30/tokyo-has-only-two-barley-tea-makers-and-we-visited-one-to-see-...
101•zdw•9h ago•20 comments

Stroustrup's Rule (2024)

https://buttondown.com/hillelwayne/archive/stroustrups-rule/
72•bmacho•3d ago•10 comments

Building a custom octocopter from scratch with no prior hardware experience

https://karolina.mgdubiel.com/drone/
338•noleary•3d ago•71 comments

TabFM: A zero-shot foundation model for tabular data

https://research.google/blog/introducing-tabfm-a-zero-shot-foundation-model-for-tabular-data/
59•brandonb•6h ago•8 comments

Redeploying Fable 5

https://www.anthropic.com/news/redeploying-fable-5
53•meetpateltech•1h ago•9 comments

Long Island's decommissioned nuclear power plant

https://nickcarr.com/scouting-a-decommissioned-nuclear-power-plant/
104•mkmk•6d ago•36 comments

Structural Correctness

https://blog.sao.dev/structural-correctness/
15•stuartaxelowen•1d ago•0 comments

Have you restarted your computer this week?

https://taonaw.com/2026/06/27/have-you-restarted-your-computer.html
135•surprisetalk•14h ago•241 comments

Knoppix

https://www.knopper.net/knoppix/index-en.html
276•hoangvmpc•16h ago•108 comments

Reading the internals of Postgres: Database cluster, databases, and tables

https://www.buraksen.dev/articles/internals-of-postgresql-db-cluster-and-tables
67•buraksen•1d ago•1 comments

Hengefinder

https://hengefinder.com/
30•bookofjoe•2d ago•10 comments
Open in hackernews

Scaling Laws, Carefully

https://lilianweng.github.io/posts/2026-06-24-scaling-laws/
40•tehnub•4d ago

Comments

aspenmartin•4d ago
I really wish more people skeptical of AI capabilities would read about scaling laws -- Lilian is always so marvelous at giving a deep overview of the technical side but the whole point of this is: there are scaling laws, and they hold and continue to hold. This is such a huge basis for the predictions about AI capabilities for the past like 5 years.
FromTheFirstIn•4d ago
And sitting right next to the data and compute factors in every cross entropy loss equation is the entropy of the language, which is just a fixed constant. There’s such a hard cap on cross entropy loss training and I never hear it come up!
aspenmartin•4d ago
Right but that is context dependent; it drops with context length, depends on tokenizer, etc. It doesn't end up being super relevant, despite the fact that if you look at the loss for real models it's relatively large in absolute terms. But that doesn't really matter -- all of the interesting stuff happens once you start getting closer and closer to it. You've gotten past all of the easy tokens that dominate the entropy and now you get to the really challenging ones that we care about (like e.g. very difficult reasoning about a next step).
FromTheFirstIn•4d ago
My understanding is that the true entropy floor of a language is intractable- regardless of context length there will be “unpredictable” tokens where cross entropy loss is bound to happen. Even with infinite parameters and data you’ll still have a chance at failing to predict the next token correctly a decent chunk of the time.

Also, linear gains in context length scale quadratically with compute because of attention, so depending on context growth means taking a bath on GPUs for as long as you can, right?

graboy•3d ago
Yeah I mean, if you and I were to play the word-guessing game where you needed to guess what next word I'm thinking of, there's always uncertainty in your guess because it's a game of partial information - you can't fully observe my inner state. But that doesn't mean you couldn't evolve a strategy that spends a really long time thinking and analyzing to get asymptotically close to the best guess. There's no limit on that intelligence.
FromTheFirstIn•3d ago
Isn’t the limit exactly what you’re describing? There’s always uncertainty, and your asymptote can approach its limit but it does have a limit. That’s the limit to the intelligence. And this is just for cross entropy loss- even if you could get loss to 0, I’m still not convinced at all that an enormous semantic map and its convoluted geometries amounts to intelligence.
aspenmartin•3d ago
If you get to E you have generated a Bayes-optimal model of the conditional distribution (as in, next token conditional on context). This is something I thought too, but even if you're a fraction of a nat above the floor, you could have enormous headroom in performance left because there are still rare tokens amongst the irreducible noise that require so much capability to predict. It's not to suggest there truly is no cap on capability, but just that this constant isn't really saying what that is.
an0malous•2h ago
Why should the skeptics be reading it? The scaling laws show diminishing returns on more training data and larger models.

From the Kaplan scaling laws paper:

> We have observed consistent scalings of language model log-likelihood loss with non-embedding parameter count N, dataset size D, and optimized training computation Cmin, as encapsulated in Equations (1.5) and (1.6). Conversely, we find very weak dependence on many architectural and optimization hyperparameters. Since scalings with N,D,Cmin are power-laws, there are diminishing returns with increasing scale.

So the skeptics are right to be skeptical of LLMs being all you need for continued advancement in this space. It seems like the believers are the ones who need to learn about the scaling laws.

gdiamos•1h ago
When I first saw scaling laws in that deep speech experiment notebook, I didn’t believe it could be real. I was worried for months that we made a mistake, or that it only worked for that one dataset.

I started to believe it after we (Joel Hestness in particular) reproduced it in so many experiments in “scaling is predictable empirically”.

The OpenAI work replicated it in a completely different environment, and at that point I was sure it was real.

Sometimes people ask me why I was so surprised by it. Prior work like Banko and Brill and the unreasonable effectiveness of data argued for more data. ML theory had similar models for toy problems, eg coin flips.

At the time I thought deep learning was supposed to be complex. Speech and language datasets seemed much more complex than toy problems. Optimization of deep transformers was complex.

The idea that it was possible for the whole thing to be governed by a 3 term equation seemed too simple. The implication was that it was simple to manufacture intelligence.

Ten years later, I still think it is still the most interesting observation I have seen. We are still learning what it looks like to live in a world where it is possible to manufacture intelligence.

nok22kon•45m ago
the scaling laws work within a "generation". but what about across them?

GPT-3 was 175B, models like Gemma4 with 31B vastly outperform it, so there is more to it

as Karpathy noted, the initial GPTs were trained on complete garbage (literally, the average document from the Common Crawl is random nonsense), yet they worked. now we can use present LLMs to curate the data for the next generation

FromTheFirstIn•3d ago
Yeah, it not a linear cap (x% entropy doesn’t mean x% wrong) but it does seem like a hard cap. To be honest, the more I’ve understood scaling laws the more I think that the elephant in the LLM room is the entropy of the language. It explains why coding languages are so much more tractable (they’ve got WAY less entropy) and it explains why we haven’t seen a step function in capabilities for LLMs since GPT-4 outside of making specific toolings for particular contexts. I think E is coming to dominate and there isn’t a workaround for it.