frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: docker/model-runner – an open-source tool for local LLMs

https://github.com/docker/model-runner
17•ericcurtin•13h ago
Hey Hacker News,

We're the maintainers of docker/model-runner and wanted to share some major updates we're excited about.

Link: https://github.com/docker/model-runner

We are rebooting the community:

https://www.docker.com/blog/rebooting-model-runner-community...

At its core, model-runner is a simple, backend-agnostic tool for downloading and running local large language models. Think of it as a consistent interface to interact with different model backends. One of our main backends is llama.cpp, and we make it a point to contribute any improvements we make back upstream to their project. It also allows people to transport models via OCI registries like Docker Hub. Docker Hub hosts our curated local AI model collection, packaged as OCI Artifacts and ready to run. You can easily download, share, and upload models on Docker Hub, making it a central hub for both containerized applications and the next wave of generative AI.

We've been working hard on a few things recently:

- Vulkan and AMD Support: We've just merged support for Vulkan, which opens up local inference to a much wider range of GPUs, especially from AMD.

- Contributor Experience: We refactored the project into a monorepo. The main goal was to make the architecture clearer and dramatically lower the barrier for new contributors to get involved and understand the codebase.

- It's Fully Open Source: We know that a project from Docker might raise questions about its openness. To be clear, this is a 100% open-source, Apache 2.0 licensed project. We want to build a community around it and welcome all contributions, from documentation fixes to new model backends.

- DGX Spark day-0 support, we've got it!

Our goal is to grow the community. We'll be here all day to answer any questions you have. We'd love for you to check it out, give us a star if you like it, and let us know what you think.

Thanks!

Comments

ericcurtin•12h ago
Hi everyone, we're the maintainers.

We're rebooting the model-runner community and wanted to share what we've been up to and where we're headed.

When we first built this, the idea was simple: make running local models as easy as running containers. You get a consistent interface to download and run models from different backends (llama.cpp being a key one) and can even transport them using familiar OCI registries like Docker Hub.

Recently, we've invested a lot of effort into making it a true community project. A few highlights:

- The project is now a monorepo, making it much easier for new contributors to find their way around.

- We've added Vulkan support to open things up for AMD and other non-NVIDIA GPUs.

- We made sure we have day-0 support for the latest NVIDIA DGX hardware.

shelajev•12h ago
Nice, I really like the recent Vulkan support.
ericcurtin•12h ago
Thanks very much. It worked well for you? Which hardware? :) Any other feedback, keep it coming!
jkoenig134•12h ago
Awesome!
ericcurtin•12h ago
What did you like? Anything stand out?
davidnet•12h ago
Docker model run is now part of my demos when deploying ml stack stuff, pretty sure that this is removing the entrypoint of using multiple tools to just do inference, this is great!
ericcurtin•12h ago
Any new features you think we should add to further enhance your usage? Glad you find it useful
juangcarmona•11h ago
Really glad to see DMR getting "new life"... I’ve been experimenting with it for local agentic workloads (MAF, Google's ADK, cagent, Docker MCP, etc...) and it’s such a clean foundation...

A few things that could make it even more powerful (maybe some are out of your scope):

- Persistent model settings (context size, temperature, etc.) across restarts — right now it always resets to 4k, which breaks multi-turn agents. - HTTP/gRPC interface to let tools and frameworks talk to DMR directly, not only through the CLI. (Here the issue is on Docker MCP side, right?) - Simple config management (`docker model set` or `docker model config`) so we can tweak GPU, threads, precision, etc. predictably. (there are at least a couple of issues on this topic already...)

TBH, I love how fast the discussion evolved today.

Congrats and good luck with this. I'll try to help, promised!

ericcurtin•11h ago
Keep opening pull requests and issues, we need these things, you are right!

Show HN: An open source access logs analytics script to block bot attacks

https://github.com/tempesta-tech/webshield
22•krizhanovsky•6h ago•2 comments

Show HN: Metorial (YC F25) – Vercel for MCP

https://github.com/metorial/metorial
43•tobihrbr•11h ago•15 comments

Show HN: Wispbit - Linter for AI coding agents

https://wispbit.com
23•dearilos•6h ago•11 comments

Show HN: CSS Extras

https://github.com/sindresorhus/css-extras
97•mofle•6d ago•60 comments

Show HN: PlayMyMood – Generate YouTube Music playlists based on your mood

https://playmymood.com/
2•speeq•4h ago•0 comments

Show HN: Relaya – Agent calls businesses for you

https://relaya.ai/
5•rishavmukherji•5h ago•0 comments

Show HN: Free API to extract PDF data

6•leftnode•9h ago•0 comments

Show HN: SQLite Online – 11 years of solo development, 11K daily users

https://sqliteonline.com/
448•sqliteonline•1d ago•138 comments

Show HN: Pathwave.io – MCP and mobile app to manually approve AI actions

https://web.pathwave.io/docs
2•felipe-pathwave•5h ago•0 comments

Show HN: Nofan Framework 16 Fan Controller

https://github.com/laktak/nofan
2•laktak•5h ago•0 comments

Show HN: AI toy I worked on is in stores

https://www.walmart.com/ip/SANTA-SMAGICAL-PHONE/16364964771
146•Sean-Der•2d ago•164 comments

Show HN: I built a simple ambient sound app with no ads or subscriptions

https://ambisounds.app/
295•alpaca121•2d ago•117 comments

Show HN:I built a free AI tool that scans and sorts financial news for traders

https://www.fxradar.live/
4•LuckyAleh•8h ago•1 comments

Show HN: Get a PMF score for your website, based on simulated user data

https://semilattice.ai/demos/pmf-report
2•jtewright•9h ago•0 comments

Show HN: I made an esoteric programming language that's read like a spellbook

https://github.com/sirbread/spellscript
171•sirbread•2d ago•55 comments

Show HN: GoHPTS-TCP/UDP Transparent Proxy with ARP Spoofing and Traffic Sniffing

https://github.com/shadowy-pycoder/go-http-proxy-to-socks
2•shadowy-pycoder•11h ago•0 comments

Show HN: Aidlab – Health Data for Devs

55•guzik•3d ago•17 comments

Show HN: Daily install trends of AI coding extensions in VS Code

https://bloomberry.com/coding-tools.html
23•AznHisoka•12h ago•9 comments

Show HN: Baby's first international landline

https://wip.tf/posts/telefonefix-building-babys-first-international-landline/
221•nbr23•6d ago•54 comments

Show HN: A Digital Twin of my coffee roaster that runs in the browser

https://autoroaster.com/
155•jvkoch•1w ago•37 comments

Show HN: docker/model-runner – an open-source tool for local LLMs

https://github.com/docker/model-runner
17•ericcurtin•13h ago•9 comments

Show HN: Wordle-Style Daily Wikipedia Game

https://hyperlinked.wiki
4•Mistri•13h ago•1 comments

Show HN: A Lisp Interpreter for Shell Scripting

https://github.com/gue-ni/redstart
113•quintussss•6d ago•25 comments

Show HN: I extracted BASIC listings for Tim Hartnell's 1986 book

https://github.com/nzduck/hartnell-exploring-ai-book
60•nzduck•4d ago•6 comments

Show HN: I invented a new generative model and got accepted to ICLR

https://discrete-distribution-networks.github.io/
649•diyer22•4d ago•90 comments

Show HN: Lights Out: my 2D Rubik's Cube-like Game

https://raymondtana.github.io/projects/pages/Lights_Out.html
80•raymondtana•4d ago•25 comments

Show HN: AI visuals that feel the music

https://www.trackart.io/
2•feskk•18h ago•0 comments

Show HN: Rift – A tiling window manager for macOS

https://github.com/acsandmann/rift
212•atticus_•3d ago•120 comments

Show HN: Open source, logical multi-master PostgreSQL replication

https://github.com/pgEdge/spock
150•pgedge_postgres•5d ago•60 comments

Show HN: FFTN, faster than FFTW in 700 lines of C

https://gitlab.sac-home.org/sac-group/fftn
7•thomaskoopman•1d ago•0 comments