Show HN: A GPU/VRAM filter for finding LLMs that will run on your hardware

https://www.whichllmmodel.com/app/text?local=true

2•mzubairtahir•1h ago

I kept seeing people ask "Which model i can run on my gpu", "will model X fit on my GPU". Thats why I built a filter on whichllmmodel that lets you search models by what will actually fit on your hardware (8GB, 16GB, 24GB, etc.) at a given quantization level.

Comments

necovek•1h ago

Very broken: "live minimums" do not allow me to remove 512 token limit and put a bigger number easily.

No unified or shared memory scenarios (like Apple's M platform or AMD's integrated GPU platform).

johng•18m ago

Was going to mention this. I'm on an M1 Max and wanted to see what the site suggested.

CRSilkworth•44m ago

very nice idea. Would be nice if you could also keep desired context as a free parameter and let the models tell you what maximum context you could have.

Show HN: Turns Any XPost into Carousel

Trump administration asks OpenAI to stagger release of new model

A Survey on Lawvere's Fixed-Point Theorem (2025)

I wrote a 750-page book on self-hosting in production

Supercomplete.ai

Facebook/Astryx

Show HN: Appaca – AI Workspace for Operators

Seeing Radio Waves at 30fps

MIT Open Courseware: Sailing Yacht Design

Fundamental principles of the universe called into question by two physicists

Ludwig Spec Driven Development MCP

The Long-Term Thinker

Show HN: Trophikos – a calm, ad-free recipe and cocktail library for iOS

Architectural Studies #02

Bayer scores landmark victory as Supreme Court overturns Roundup verdict

Echoes of the AI Winter

Researcher got a death threat computing Rosetta Stone for Indus script [video]

Fortune 500 bosses demanding staff RTO share 1 trait: Narcissism, research finds

AI agents are sensitive to nudges

New Business Formation Is Surging–Again

OpenAI will initially only release ChatGPT 5.6 to government-approved customers

The Latent Capability Ceiling: When a Bigger Model Won't Fix Your Problem

Agent Engineering Roadmap – a beginner-friendly guide to building AI agents

Leave Windows 11 Idle for 24 Hours and Watch What Happens [video][18 mins]

You can never replace your understanding

Faster KNN search in Manticore: 2-pass HNSW, batched distances, and AVX-512

Has UC Denver Lost Control of Their Website?

The Art of War

Agent Zero – A full Docker Linux system for your AI agent

Hydrating plants during the heatwave with DIY irrigation