frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: I made a VRAM Calculator in Hugging Face

https://chromewebstore.google.com/detail/hugging-face-vram-calcula/bioohacjdieeliinbpocpdhpdapfkhal
8•PieterBecking•1d ago
It's a chrome extension that automatically loads the specs from the Hugging Face model card into the calculation.

> To test it, install the extension (no registration/key needed) and navigate to a HF model page. Then click the "VRAM" icon on the top right to open the sidepanel.

You can specify quantization, batch size, sequence length, etc.

Works for inference & fine-tuning.

If it does not fit on the specified GPUs, it gives you an advise on how to still run it (e.g. lowering precision).

It is inspired at my work, where we were constantly exporting metrics from HF to estimate required hardware. Now, it saves us in the dev team quite some time and clients can use it, too.

Let me know what you think.

Comments

clemnt•1d ago
pretty cool!
PieterBecking•12h ago
thx!
reach-vb•1d ago
Nice! that's very cool!
PieterBecking•12h ago
Thanks!
rokizero•1d ago
i'm honestly surprised HF doesn't have this feature yet, very useful! will you publish the code on github?

any plans on adding more consumer-grade gpus?

PieterBecking•12h ago
Hey I published the code here: https://github.com/NEBUL-AI/HF-VRAM-Extension

I've added the 4090 and 5090 as well now, make sure to get version 0.5 of the extension

Google co-founder Sergey Brin suggests threatening AI for better results

https://www.theregister.com/2025/05/28/google_brin_suggests_threatening_ai/
2•mdp2021•28s ago•0 comments

New eco-hotel at Everglades national park built for age of super hurricanes

https://www.theguardian.com/us-news/2025/may/28/everglades-national-park-eco-hotel-resilient
1•howard941•2m ago•0 comments

Trump orders US chip designers to stop selling to China

https://www.ft.com/content/2c0db765-03ac-4820-8a02-806469848bee
1•doener•5m ago•0 comments

Show HN: Replace Twitter with AI that reads 10k+ daily sources for you

https://goldenscoop.live
1•last_dunyain•6m ago•0 comments

Telegram announces partnership with Musk's xAI

https://www.bbc.com/news/articles/cdxvr3n7wlxo
1•quantified•10m ago•0 comments

Reverse Engineering Linear's Sync Engine

https://github.com/wzhudev/reverse-linear-sync-engine
1•plondon514•12m ago•0 comments

Curating My Internet Experience

https://uscne.blogspot.com/
1•uscneps•13m ago•0 comments

VeraCrypt

https://veracrypt.jp/en/Home.html
1•smartmic•14m ago•1 comments

What does "Undecidable" mean, anyway

https://buttondown.com/hillelwayne/archive/what-does-undecidable-mean-anyway/
8•BerislavLopac•17m ago•0 comments

Learn to Use Email with Git

https://git-send-email.io/
3•stefankuehnel•17m ago•0 comments

Californian batteries set new output record

https://www.ess-news.com/2025/05/28/californian-batteries-set-new-output-record/
3•philipkglass•18m ago•1 comments

Two large Saharan dust clouds are headed west from Africa to the United States

https://www.accuweather.com/en/hurricane/saharan-dust-clouds-to-approach-florida-gulf/1779232
1•speckx•18m ago•0 comments

What Building an AI Product Taught Me About Human Bias

https://hackernoon.com/what-building-an-ai-product-taught-me-about-human-bias
1•smooke•19m ago•0 comments

MSEP.one – Molecular Systems and Engineering Platform

https://msep.one/
1•Zweihander•22m ago•0 comments

Singapore's fight to save its green spaces from development

https://www.nature.com/articles/d41586-025-01578-y
1•gnabgib•23m ago•0 comments

How Damaging Is Shouting "Fire" in a Crowded Theatre?

https://www.nber.org/papers/w33852
1•jaredwiener•23m ago•0 comments

Show HN: Deidentify – Go library for removing PII before sending data to LLMs

https://github.com/aliengiraffe/deidentify
1•nicolasbistolfi•23m ago•0 comments

China is now the biggest debt collector in the developing world

https://www.npr.org/2025/05/28/nx-s1-5413239/china-loans-developing-world-belt-road
2•geox•24m ago•0 comments

Differentiating IBM 3101, 3270 and 5250 terminal keyboards

https://sharktastica.co.uk/topics/3101-3270-5250_diffs
2•rbanffy•26m ago•0 comments

Al-LLM powered eBPF based security platform

1•gaurav1086•26m ago•0 comments

Europe threatens Apple with additional fines

https://www.computerworld.com/article/3997079/europe-threatens-apple-with-additional-fines.html
1•speckx•26m ago•0 comments

Spann and SPFresh vector indexing in Chroma

https://twitter.com/trychroma/status/1925262162567864620
1•HammadB•26m ago•0 comments

Coding Assistants Threaten the Software Supply Chain

https://martinfowler.com/articles/exploring-gen-ai/software-supply-chain-attack-surface.html
5•BerislavLopac•28m ago•0 comments

Google-DeepMind/formal-conjectures: collection of formalized conjectures in lean

https://github.com/google-deepmind/formal-conjectures
1•diginova•32m ago•0 comments

Unnecessariat (2016)

https://morecrows.wordpress.com/2016/05/10/unnecessariat/
1•kunzhi•34m ago•0 comments

Finding Love Optimally

https://mat.tepper.cmu.edu/blog/index.php/2011/02/27/finding-love-optimally/
2•TMWNN•34m ago•1 comments

Chinese rocket Zhuque-2E creates light streak in US night sky

https://www.digitalcameraworld.com/photography/astrophotography/did-the-chinese-government-ruin-your-astrophotography
1•astroimagery•35m ago•1 comments

The Hitchhiker's Guide to Dark Pools in DeFi

https://research.2077.xyz/the-hitchhikers-guide-to-dark-pools-in-defi-part-three
1•rapawel•36m ago•0 comments

HAProxy 3.2 Is Released

https://www.haproxy.com/blog/announcing-haproxy-3-2
2•causenad•37m ago•1 comments

Apache Iceberg in Modern Data Architectures: A Comprehensive Report

https://blog.twingdata.com/p/apache-iceberg-in-modern-data-architectures
1•dangoldin•37m ago•0 comments