frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Docker Model Runner Integrates vLLM for High-Throughput Inference

https://github.com/docker/model-runner
5•ericcurtin•1h ago
Hi HN, I’m one of the authors of this post.

We’ve updated Docker Model Runner to support vLLM alongside the existing llama.cpp backend. The goal is to bridge the gap between local prototyping (often done with GGUF/llama.cpp) and high-throughput production (often done with Safetensors/vLLM) using a consistent Docker workflow.

Key technical details:

Auto-routing: The tool detects the model format. If you pull a GGUF model, it routes to llama.cpp. If you pull a Safetensors model, it routes to vLLM.

API: It exposes an OpenAI-compatible API (/v1/chat/completions), so the client code doesn't need to change based on the backend.

Usage: It’s just docker model run ai/smollm2-vllm.

Current Limitations:

Right now, the vLLM backend is optimized for x86_64 with Nvidia GPUs.

We are actively working on WSL2 support for Windows users and DGX Spark compatibility.

Happy to answer any questions about the integration or the roadmap!

https://www.docker.com/blog/docker-model-runner-integrates-v...

Comments

ericcurtin•1h ago
I'm one of the devs, happy to answer any questions

Learn C++ Itanium Symbol Mangling

https://noratrieb.github.io/womangling/
1•g0xA52A2A•15s ago•0 comments

Pareto Principle: 20% of Code Causes 80% of Bugs; 1% Causes 99% of Crashes

https://read.perspectiveship.com/p/pareto
1•birdculture•1m ago•0 comments

My Favorite Software Subreddits

https://notes.eatonphil.com/high-quality-subreddits-you-should-be-following.html
1•ibobev•1m ago•0 comments

Writing Airflow Dags with Excel and Minecraft

https://www.astronomer.io/blog/abstraction-with-dag-factory-from-excel-to-minecraft/
1•jlaneve•1m ago•0 comments

We are replacing OOP with something worse

https://blog.jsbarretto.com/post/actors
1•ibobev•1m ago•0 comments

Ntoh*/hton* is a bad API

https://purplesyringa.moe/blog/ntoh-hton-is-a-bad-api/
1•ibobev•2m ago•0 comments

Nvidia Sonic: Motion Tracking for Natural Humanoid Whole-Body Control

https://nvlabs.github.io/SONIC/
1•oumua_don17•3m ago•0 comments

Turn any link, PDF, or YouTube video into an infographic (with Nano Banana Pro)

https://unrav.io/convert-to-infographic
1•rriley•4m ago•1 comments

Inside Nvidia GPU: Blackwell's Limitations & Future Rubin's Microarchitecture

https://github.com/zartbot/blog/issues/3
2•matt_d•7m ago•0 comments

EU designates 19 tech providers as critical infrastructure

https://www.scworld.com/news/eu-designates-19-tech-providers-as-critical-infrastructure
1•Bender•7m ago•0 comments

Ransomware gangs integrating AWS features into their toolkits

https://www.scworld.com/news/ransomware-gangs-now-integrate-native-aws-features-like-s3-buckets-i...
1•Bender•7m ago•0 comments

Things I learned as an Airbnb host 2021-2022

https://michael-lewis.com/posts/things-i-learned-as-an-airbnb-host/
2•speckx•8m ago•0 comments

Trump revives unpopular Ted Cruz plan to punish states that impose AI laws

https://arstechnica.com/tech-policy/2025/11/trump-revives-unpopular-ted-cruz-plan-to-punish-state...
1•Bender•9m ago•1 comments

Should we eat dinner earlier in winter?

https://theconversation.com/should-we-eat-dinner-earlier-in-winter-why-timing-might-matter-more-t...
1•wjb3•11m ago•0 comments

AI Agents Are the New Web Stack

https://h3manth.com/scribe/blog/ai-agents-web-architecture/
2•init0•12m ago•0 comments

Enzyme-mediated aminoglycoside resistance without target mimicry

https://www.nature.com/articles/s42004-025-01666-0
1•PaulHoule•12m ago•0 comments

Rive – Why Scripting Runs on Luau

https://rive.app/blog/why-scripting-runs-on-luau
1•vyrotek•16m ago•0 comments

Kitsune: Enabling Dataflow Execution on GPUs with Spatial Pipelines

https://dl.acm.org/doi/10.1145/3777466
1•matt_d•18m ago•0 comments

The AI bubble is bigger than you think

https://prospect.org/2025/11/19/ai-bubble-bigger-than-you-think/
4•DarkContinent•19m ago•0 comments

Ask HN: Universal testing machines OEM software needed?

1•DrNuke•19m ago•0 comments

Highlights from Git 2.52

https://github.blog/open-source/git/highlights-from-git-2-52/
1•ossusermivami•20m ago•0 comments

Early science acceleration experiments with GPT-5 [pdf]

https://cdn.openai.com/pdf/4a25f921-e4e0-479a-9b38-5367b47e8fd0/early-science-acceleration-experi...
1•gronky_•22m ago•0 comments

CBP is monitoring US drivers and detaining those with suspicious travel patterns

https://apnews.com/article/immigration-border-patrol-surveillance-drivers-ice-trump-9f5d05469ce8c...
118•jjwiseman•24m ago•29 comments

Google cracked Apple's AirDrop and is adding it to Pixel phones

https://www.theverge.com/news/825228/iphone-airdrop-android-quick-share-pixel-10
6•CharlesW•30m ago•1 comments

Evidence of Fatigue Cracks a 'Major Clue' in Fatal UPS Jet Crash

https://www.reuters.com/business/aerospace-defense/us-ntsb-finds-evidence-fatigue-cracks-fatal-up...
2•sarimkx•30m ago•1 comments

Coding Trance Music from Scratch (Again) [video]

https://www.youtube.com/watch?v=iu5rnQkfO6M
1•raphar•32m ago•1 comments

Lunar Landing Game Related Documents

https://www.cs.brandeis.edu/~storer/LunarLander/LunarLander.html
1•marcodiego•32m ago•0 comments

Application Software Is Dead, Again

https://www.akashbajwa.co/p/application-software-is-dead-again
2•ptrhvns•34m ago•0 comments

Boris Becker:'Whoever says a prison life is easy is lying–it's real punishment'

https://www.theguardian.com/sport/2025/nov/20/boris-becker-prison-tennis-interview
4•bookofjoe•38m ago•0 comments

Fixing kernel updates not applying in Fedora 43

https://nyanpasu64.gitlab.io/blog/fixing-fedora-kernel-updates/
2•speckx•39m ago•0 comments