news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

UGMM-NN: Univariate Gaussian Mixture Model Neural Network

https://arxiv.org/abs/2509.07569

16•zakeria•2h ago

Comments

zakeria•2h ago

uGMM-NN is a novel neural architecture that embeds probabilistic reasoning directly into the computational units of deep networks. Unlike traditional neurons, which apply weighted sums followed by fixed nonlinearities, each uGMM-NN node parameterizes its activations as a univariate Gaussian mixture, with learnable means, variances, and mixing coefficients.

vessenes•44m ago

Meh. Well, at least, possibly “meh”.

Upshot: Gaussian sampling along the parameters of nodes rather than a fixed number. This might offer one of the following:

* Better inference time accuracy on average

* Faster convergence during training

It probably costs additional inference and training compute.

The paper demonstrates worse results on MNIST, and shows the architecture is more than capable of dealing with the Iris test (which I hadn’t heard of; categorizing types of irises, I presume the flower, but maybe the eye?)

The paper claims to keep the number of parameters and depth the same, but it doesn’t report as to

* training time/flops (probably more I’d guess?)

* inference time/flops (almost certainly more)

Intuitively if you’ve got a mean, variance and mix coefficient, then you have triple the data space per parameter — no word as to whether the networks were normalized as to total data taken by the NN or just the number of “parameters”.

Upshot - I don’t think this paper demonstrates any sort of benefit here or elucidates the tradeoffs.

Quick reminder, negative results are good, too. I’d almost rather see the paper framed that way.

ericdoerheit•9m ago

Thank you for your work! I would be interested to see what this means to a CNN architecture. Maybe it wouldn't actually be needed to have the whole architecture based on uGMM-NNs but only the last layers?

Fraudulent Publishing in the Mathematical Sciences

https://arxiv.org/abs/2509.07257

2•bikenaga•3m ago•0 comments

FastComments is Now Globally Distributed (and more rusty)

https://blog.fastcomments.com/(9-10-2025)-fastcomments-is-now-globally-distributed.html

1•winrid•5m ago•1 comments

Dependabot Support for Vcpkg

https://devblogs.microsoft.com/cppblog/dependabot-support-for-vcpkg/

1•mariuz•6m ago•0 comments

Has Google ended support for plain HTML search?

https://www.google.com/httpservice/retry/enablejs

1•hackerb9•7m ago•1 comments

Android 16 QPR1 source code is nowhere to be found but Google swears it's coming

https://www.androidauthority.com/android-16-qpr1-source-code-delay-3596650/

1•cdesai•8m ago•1 comments

Vercel Updates Pro Pricing

https://vercel.com/blog/new-pro-pricing-plan

1•aosaigh•11m ago•1 comments

SourceForge Sunsets Developer Web Hosting

https://sourceforge.net/blog/sunsetting-developer-web-user-web/

1•henry_flower•12m ago•0 comments

Overview of the DiskANN Project (2018–present)

https://harsha-simhadri.org/diskann-overview.html

1•fzliu•12m ago•0 comments

ChatGPT 5 marginalizing Gelman's measurement error model in Stan

https://statmodeling.stat.columbia.edu/2025/09/09/show-dont-tell-chatgpt-5-marginalizing-gelmans-...

1•momeara•14m ago•0 comments

PgEdge Goes Open Source

https://www.pgedge.com/blog/pgedge-goes-open-source

1•atombender•15m ago•0 comments

Lessons from Hidden Satoshi Gold Book on Crypto and AI

https://satoshigoldbook.com/

1•jamnicabpbnik•15m ago•1 comments

Charlie Kirk dies after shooting at Utah Valley University

https://utahnewsdispatch.com/2025/09/10/charlie-kirk-shot-during-event-at-utah-valley-university/

2•Improvement•15m ago•0 comments

HiTex: A spam factory for AI-generated books

https://laurent.le-brun.eu/blog/hitex-a-spam-factory-for-ai-generated-books

1•laurentlb•19m ago•0 comments

Is Apple's iPhone 17 launch a win for India?

https://restofworld.org/2025/is-apples-iphone-17-launch-a-win-for-india-we-asked-experts/

1•colinprince•20m ago•0 comments

Trial and Error Driven Development

https://www.stevenoxley.com/blog/2025/09/09/trial-and-error-driven-development/

1•xonev•21m ago•0 comments

Exploratorium Cookbook Set: Volumes I, II and III

https://www.exploratoriumstore.com/products/exploratorium-cookbook-set

1•mhb•22m ago•1 comments

NATO's Chemical, Biological, Radiological and Nuclear (CBRN) Defence Policy

https://www.nato.int/cps/en/natohq/official_texts_197768.htm

1•type0•24m ago•0 comments

Senator: FTC should investigate Microsoft for dangerous and insecure software

https://www.wyden.senate.gov/news/press-releases/wyden-calls-for-ftc-investigation-of-microsoft-f...

2•Improvement•24m ago•0 comments

'China Is the Engine' Driving Nations Away from Fossil Fuels, Report Says

https://www.nytimes.com/2025/09/08/climate/china-clean-energy-fossil-fuel-research.html

3•bookofjoe•25m ago•1 comments

Show HN: HumanAlarm – Real people knock on your door to wake you up

https://humanalarm.com

1•soelost•25m ago•0 comments

The rules behing Rust functions

https://blog.cuongle.dev/p/the-hidden-rules-behind-rust-functions

2•gidellav•26m ago•0 comments

Launching Bottlenecks Institute

https://www.bottlenecksinstitute.com/

1•parnibrk•28m ago•0 comments

In 1979 one of the best guitar solos recorded was cut for radio time

https://www.seekhifi.com/my-sharona-by-the-knack/

3•wmeredith•29m ago•1 comments

Lifetime Starlink Deal? Nope, It's Just a Scam Circulating on Facebook

https://www.pcmag.com/news/lifetime-starlink-deal-nope-its-just-a-scam-circulating-on-facebook

1•rolph•29m ago•0 comments

Understanding Motion and Relativity with Spacetime Diagrams

https://steuard.github.io/spacetime/intro.html

2•Steuard•29m ago•1 comments

Coffee naps might be the weirdest–and smartest–way to recharge

https://www.nationalgeographic.com/health/article/caffeine-nap-explained

2•manveerc•30m ago•1 comments

How do we decide if a tax is good or bad?

https://www.theguardian.com/australia-news/2025/aug/21/how-do-we-decide-if-a-tax-is-good-or-bad-a...

1•PaulHoule•35m ago•0 comments

What's the real reason games are taking longer to make?

https://www.gamedeveloper.com/production/what-s-the-real-reason-games-are-taking-longer-to-make-

3•starkparker•36m ago•0 comments

Scaling Asyncio on Free-Threaded Python

https://labs.quansight.org/blog/scaling-asyncio-on-free-threaded-python

1•lumpa•36m ago•0 comments

Front-Loaded Vesting: Why Your Tech Offer Looks Different Now

https://www.levels.fyi/blog/front-loaded-vesting.html

1•zuhayeer•37m ago•0 comments