frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Puzzling Success of Overparameterization: Lottery Tickets or Escape Dimensions?

https://infoscience.epfl.ch/entities/publication/9a49779b-f9f8-448d-b3d1-737c78455309
28•rbanffy•1d ago

Comments

Scene_Cast2•1h ago
IIRC the original author of the Lottery Ticket Hypothesis now disavows that idea.

One intuitive way of looking at it is like so - let's say that you have a gaussian-looking plot. You want to fit a gaussian. You have a stupid simple model where you can slide your gaussian left and right.

If your initial starting point happens to be roughly within range, great, your optimizer will take care of it for you and slide it into the correct place. If you're too far, too bad, no meaningful gradient.

Instead, neural nets give you the option to spawn a gaussian anywhere you please. In this case, no sliding is necessary, but it comes at a heavy parametrization cost.

WithinReason•43m ago
How is this view inconsistent with the lottery ticket hypothesis?
getnormality•27m ago
A while ago a lot of the discussion about overparameterization was about explaining "double descent", the observation that test error doesn't descend monotonically and actually hits a local maximum around the point where the model has just enough parameters to interpolate the data. My favorite article about double descent looks at this in terms of splines [1]. If I can try to summarize that article: when you are designing a parametrized model to fit to data, you have a choice. You can either:

1. Avoid overparameterization by design. Manually create or choose a space of functions that has limited degrees of freedom by construction.

2. Accept overparameterization and regularize.

The latter tends to be more robust, because of the bitter lesson. It's not practical to manually design an ideal, on-demand, just-right limited-parameter model for every dataset we are presented with. The best way to approach that ideal, it turns out, is really to just let the computer figure it out via regularized optimization over an overparameterized space.

Statisticians started moving in favor of overparameterization long before deep learning got off the ground. This trend dates back at least to the machine learning bible, Elements of Statistical Learning (2001).

[1] https://mlu-explain.github.io/double-descent/

You can't unit test for taste

https://dev.karltryggvason.com/you-cant-unit-test-for-taste/
100•kalli•1d ago•38 comments

Half-Life 2 in a Browser

https://hl2.slqnt.dev/
454•panza•8h ago•185 comments

The Disappearance of Japan's Animators

https://economist.com/interactive/1843/2026/06/19/the-strange-disappearance-of-japans-animators
32•andsoitis•3d ago•21 comments

Anthropic says Alibaba illicitly extracted Claude AI model capabilities

https://www.reuters.com/world/china/anthropic-says-alibaba-illicitly-extracted-claude-ai-model-ca...
605•htrp•18h ago•975 comments

Show HN: Turn native language audio into flashcards and shadowing practice

https://lingochunk.com/try
21•alder•3h ago•8 comments

LastPass notifies users of yet another data breach

https://9to5mac.com/2026/06/23/lastpass-notifies-users-of-yet-another-data-breach/
218•mooreds•4h ago•102 comments

Puzzling Success of Overparameterization: Lottery Tickets or Escape Dimensions?

https://infoscience.epfl.ch/entities/publication/9a49779b-f9f8-448d-b3d1-737c78455309
28•rbanffy•1d ago•3 comments

OpenAI unveils its first custom chip, built by Broadcom

https://techcrunch.com/2026/06/24/openai-unveils-its-first-custom-chip-built-by-broadcom/
762•jamdesk•20h ago•437 comments

Cloudflare launched self-managed OAuth for all

https://blog.cloudflare.com/oauth-for-all/
261•terryds•12h ago•113 comments

Wikipedia Workers in Britain set global first by seeking union recognition

https://utaw.tech/news/wikipedia-recognition
158•chobeat•7h ago•155 comments

Tell HN: OpenAI has started putting ads on paid programs

23•shantnutiwari•1h ago•11 comments

Bohemia Interactive: Cold War Assault Remastered Source Code on GitHub

https://github.com/BohemiaInteractive/CWR
151•dewey•2d ago•30 comments

Blogging can just be stating the obvious

https://blog.jim-nielsen.com/2026/blogging-stating-the-obvious/
340•Curiositry•14h ago•109 comments

LuaJIT 3.0 proposed syntax extensions

https://github.com/LuaJIT/LuaJIT/issues/1475
193•phreddypharkus•14h ago•112 comments

45°C cooling design cuts data center water use to near zero

https://blogs.nvidia.com/blog/liquid-cooling-ai-factories/
410•nitin_flanker•1d ago•329 comments

Medical students are using popular research tool to pump out misleading studies

https://www.science.org/content/article/medical-students-are-using-popular-research-tool-pump-out...
106•rndsignals•12h ago•62 comments

Show HN: Secs-man, a secrets manager you can (not) rely on

https://github.com/Fran314/secrets-manager-rs
12•Fran314•2h ago•5 comments

GLM-5.2 is a step change for open agents

https://www.interconnects.ai/p/glm-52-is-the-step-change-for-open
301•vantareed•2d ago•180 comments

Lies, Damn Lies and Database Benchmarks

https://questdb.com/blog/lies-damn-lies-and-database-benchmarks/
38•eigenBasis•2d ago•15 comments

Dostoyevsky isn't difficult

https://www.autodidacts.io/dostoyevsky-isnt-difficult/
190•surprisetalk•2d ago•232 comments

Show HN: StartupsBR – A map of Brazilian startups

https://www.startupsbr.com/sao-paulo
43•leonagano•5d ago•21 comments

Countries are competing to see which can carry out mass surveillance the best

https://mullvad.net/en/why-privacy-matters/state-mass-surveillance
192•Cider9986•1h ago•62 comments

RubyLLM: A Ruby framework for all major AI providers

https://rubyllm.com/
417•doener•1d ago•70 comments

Dolphin Emulator Progress Release 2606

https://dolphin-emu.org/blog/2026/06/25/dolphin-progress-report-release-2606/
176•exploraz•4h ago•23 comments

Words, Words, Words

https://aeon.co/essays/literature-fans-should-welcome-ai-as-a-fellow-wordsmith
23•benbreen•2d ago•8 comments

Ask HN: What surprised you about Estonia e-Residency and running an Estonian OÜ?

5•jvilalta•27m ago•1 comments

Qualcomm to Acquire Modular

https://www.reuters.com/business/qualcomm-buy-ai-startup-modular-2026-06-24/
222•timmyd•1d ago•83 comments

PR spam today looks like email spam in the early 2000s

https://www.greptile.com/blog/prs-on-openclaw
248•dakshgupta•1d ago•143 comments

The Xteink X4 E-Ink Reader

https://blog.omgmog.net/post/xteink-x4-e-ink-reader/
289•felixdoerp•22h ago•171 comments

Federal agents track down woman, demand she remove Instagram post about ICE

https://www.syracuse.com/news/2026/06/federal-agents-track-down-syracuse-woman-demand-she-remove-...
50•coloneltcb•28m ago•14 comments