frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Performance vs. Practicality: A Comparison of VLLM and Ollama

https://robert-mcdermott.medium.com/performance-vs-practicality-a-comparison-of-vllm-and-ollama-104acad250fd
2•mcdermott•6h ago

Comments

mcdermott•6h ago
When raw performance matters, vLLM wins, but Ollama often wins on everything else.

My benchmarks showed vLLM delivering up to 3.2x the requests-per-second of Ollama on identical hardware, with noticeably lower latency at high concurrency.

If you're not looking for the ultimate performance on the latest GPU hardware, then Ollama is still hard to beat. It installs in minutes, runs on laptops, supports CPU fallback, and provides a curated model hub plus on-the-fly model switching. If your typical load is a handful of concurrent users, batch jobs that can wait an extra second, or local exploration during development, Ollama’s “good-enough” performance is exactly that, good enough.

Ollama is the reliable daily driver that gets almost everyone where they need to go; vLLM is the tuned engine you unleash when the freeway opens up and you really need to fly.

Trump administration moves to cut $100M in federal contracts for Harvard

https://apnews.com/article/trump-harvard-federal-contracts-51d2d2618e1f0f5de39cb649644e1dae
3•donsupreme•5m ago•1 comments

Made a Changelog generator for your commits

https://www.gitsaga.io/
1•p_bits•5m ago•1 comments

How a Generation's Struggle Led to a Record Surge in Homelessness

https://www.nytimes.com/2025/05/27/us/politics/homelessness-baby-boomers.html
1•howard941•5m ago•0 comments

Starship's Ninth Flight Test

https://x.com/i/broadcasts/1OwxWXMRAXmKQ
1•seagull_sounds•7m ago•0 comments

Inside the Arnett, OK tornado [video]

https://www.youtube.com/watch?v=NGD2e741Riw
1•layer8•9m ago•0 comments

Squiggle: A simple programming language for intuitive probabilistic estimation

https://www.squiggle-language.com/
1•fanf2•9m ago•0 comments

Fundamental forms for characterizing trapezoid-based origami metamaterials

https://www.nature.com/articles/s41467-025-57089-x
1•PaulHoule•11m ago•0 comments

Show HN: I Vibecoded a Python Class Hierarchy Checker (Needs Your Eyes)

https://github.com/agaz1985/umberto
1•adale•12m ago•0 comments

Show HN: Getting full-text scientific content into LLMs+Agents is stupidly hard

https://www.valyu.network/blogs/deepsearch-v2-updates
2•zk108•14m ago•0 comments

LLM Pricing Calculator

https://www.llm-prices.com/
1•Bluestein•15m ago•0 comments

Claude Voice Mode Beta

https://twitter.com/AnthropicAI/status/1927463559836877214
1•brianjking•15m ago•0 comments

Self-Reflective Uncertainties: Do LLMs Know Their Internal Answer Distribution?

https://arxiv.org/abs/2505.20295
1•badmonster•16m ago•0 comments

Jujutsu from the Trenches

https://mattjhall.co.uk/posts/jujutsu-from-the-trenches.html
2•mattjhall•18m ago•0 comments

Impathy and Emotion Recognition: How Attachment Shapes Emotion Processing

https://www.mdpi.com/2076-3425/15/5/516
1•rendx•19m ago•0 comments

Tokyo startup is turning discarded kimonos into stylish sneakers

https://www.cnn.com/style/japan-upcycle-kimono-tokyo-shoes-hnk-spc
2•Hoasi•20m ago•0 comments

Sodium-air fuel cell for high energy density and low-cost electric power

https://www.cell.com/joule/fulltext/S2542-4351(25)00143-6
2•gnabgib•21m ago•0 comments

FFmate – Automate FFmpeg with Clean APIs and Smart Defaults

https://docs.ffmate.io
2•john-dev•23m ago•1 comments

The Unreliable Nature of Corten Steel for Architectural Applications

https://spenglerindustries.com/the-unreliable-nature-of-corten-steel-for-architectural-applications/
4•Bluestein•24m ago•0 comments

Beyond Compare

https://www.scootersoftware.com/home
1•smartmic•27m ago•0 comments

Dora Research: 2024

https://dora.dev/research/2024/dora-report/
2•cebert•28m ago•0 comments

Harvard’s World-Famous Glass Flowers: Fragile Beauties (2024)

https://fwtmagazine.com/harvards-world-famous-glass-flowers-fragile-beauties/
3•wtp30twice•29m ago•1 comments

MariaDB Acquires Galera Cluster

https://mariadb.com/newsroom/press-releases/mariadb-acquires-galera-cluster/
1•evanelias•30m ago•0 comments

Show HN: AnyClaude – Claude Code with any LLM

https://github.com/coder/anyclaude
3•kylecarbs•31m ago•0 comments

Arc-NCA: Towards Developmental Solutions to the Abstraction and Reasoning Corpus

https://arxiv.org/abs/2505.08778
1•jarmitage•32m ago•0 comments

Concatenative programming and stack-based languages (2023) [video]

https://www.youtube.com/watch?v=umSuLpjFUf8
2•dcreager•32m ago•0 comments

I put 5 years of community writing into NotebookLM. Here's the audio summary [video]

https://www.youtube.com/watch?v=pUKrPXQ6pAE
2•rosiesherry•35m ago•0 comments

All of Paul Graham's essays in 100 words and organized

https://summarygraham.com
1•pentil_kuda•38m ago•0 comments

School Expelled a 12-Year-Old for a Social Media Post

https://www.propublica.org/article/tennessee-school-threat-assessment-expulsion
2•Improvement•39m ago•0 comments

There Are N+1 Hard Things in Computer Science

https://lukebechtel.com/blog/there-are-n-plus-1-hard-things-in-computer-science
2•marviel•39m ago•1 comments

In Vietnam, an unlikely outpost for Chicano culture

https://www.latimes.com/world-nation/story/2025-05-27/chicano-culture-vietnam
11•donnachangstein•41m ago•2 comments