frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Ask HN: What Does Your Self-Hosted LLM Stack Look Like in 2025?

17•anditherobot•1d ago
Back when web development was taking off, there was always a go-to stack — something like Postgres + Django + jQuery, or .NET + Bootstrap, SQLITE. Over the years we had proven tech and proven patterns like : MVC, SPA etc...

Now that local LLMs are gaining traction, I’m wondering what the equivalent stack looks like today.

Models, Runtime, hardware and other tools.

That could rival the Claudes, ChatGPTs or Geminis, etc

Thanks

Comments

fazlerocks•1d ago
Running Llama 3.1 70B on 2x4090s with vLLM. Memory is a pain but works decent for most stuff.

Tbh for coding I just use the smaller ones like CodeQwen 7B. way faster and good enough for autocomplete. Only fire up the big model when I actually need it to think.

The annoying part is keeping everything updated, new model drops every week and half don't work with whatever you're already running.

bluejay2387•22h ago
2x 3090's running Ollama and VLLM... Ollama for most stuff and VLLM for the few models that I need to test that don't run on Ollama. Open Web UI as my primary interface. I just moved to Devstral for coding using the Continue plugin in VSCode. I use Qwen 3 32b for creative stuff and Flux Dev for images. Gemma 3 27b for most everything else (slightly less smart than Qwen, but its faster). Mixed Bread for embeddings (though apparently NV-Embed-v2 is better?). Pydantic as my main utility library. This is all for personal stuff. My stack at work is completely different and driven more by our Legal teams than technical decisions.
gabriel_dev•19h ago
Ollama + mac mini 24gb (inference)
runjake•19h ago
Ollama + M3 Max 36GB Mac. Usually with Python + SQLite3.

The models vary depending on the task. DeepSeek distilled has been a favorite for the past several months.

I use various smaller (~3B) models for simpler tasks.

xyc•18h ago
recurse.chat + M2 max Mac
v5v3•5h ago
Ollama on a M1 MacBook pro but will be moving to a Nvidia GPU setup.

More Federal Workers Are Flooding the Job Market, with Worsening Prospects

https://www.nytimes.com/2025/06/06/us/politics/federal-workers-job-market.html
1•koolba•1m ago•0 comments

Analysis of the Spyware That Helped to Compromise a Syrian Army from Within

https://www.mobile-hacker.com/2025/06/05/analysis-of-spyware-that-helped-to-compromise-a-syrian-army-from-within/
1•blinding-streak•4m ago•0 comments

Someone's fresh Switch 2 arrived with factory firmware on it

https://twitter.com/Nintendeal/status/1930708382794809646
1•bundie•6m ago•0 comments

What Type of [] Are You?

https://www.google.com/search?q=site%3Abuzzfeed.com+(%22which+are+you%3F%22+OR+%22what+are+you%22+OR+%22what+type+of%22)+%22quizzes%22+-personality
1•HocusLocus•6m ago•1 comments

Jared Isaacman reveals how space agency might have looked under his watch

https://www.theregister.com/2025/06/06/how_nasa_might_have_looked/
1•rntn•11m ago•0 comments

McSee: Evaluating Rowhammer Attacks and Defenses via DRAM Traffic Analysis

https://comsec.ethz.ch/research/dram/mcsee/
1•hasheddan•14m ago•0 comments

Lessons from a FAT God: An Introspective on the FAT filesystems

https://elianrieza.dev/posts/lessons-from-a-FAT-god
2•nail_•19m ago•0 comments

Doge Developed Error-Prone AI Tool to "Munch" Veterans Affairs Contracts

https://www.propublica.org/article/trump-doge-veterans-affairs-ai-contracts-health-care
3•afavour•20m ago•0 comments

Chocolate Quake source port preserving original experience even bugs and quirks

https://github.com/Henrique194/chocolate-quake
2•retro_guy•23m ago•1 comments

Displaying Korean Text Efficiently

https://oberg.org/posts/hangul/
2•fanf2•27m ago•0 comments

Napster.com Faced ISP Piracy Blockade for "Massive Copyright Violations"

https://torrentfreak.com/napster-com-faced-isp-piracy-blockade-for-massive-copyright-violations-250606/
1•CoBE10•27m ago•0 comments

Processing Nebula Images with Open Source Tools vs$350 Software

https://astroimagery.com/techniques/post-processing/how-to-edit-astrophotography-with-ease/
2•karlperera•28m ago•1 comments

M&S hackers sent abuse and ransom demand directly to CEO

https://www.bbc.co.uk/news/articles/cr58pqjlnjlo
1•mmarian•28m ago•0 comments

Xiaohongshu(Rednote) released its dots.llm open source AI model

https://github.com/rednote-hilab/dots.llm1
1•error404x•28m ago•0 comments

Crustal to mantle melt storage during the evolution of Hawaiian volcanoes

https://www.science.org/doi/10.1126/sciadv.adu9332
1•PaulHoule•29m ago•0 comments

AI Leaders and Builders Fireside Chat

https://www.supermomos.com/socials/ai-leaders-builders-fireside
1•dubrado•29m ago•1 comments

Natural Intelligence Is Sexy

https://warandpeas.com/2025/06/06/natural-intelligence-is-sexy/
1•askl•31m ago•0 comments

Show HN: Colorcura – Visualize color palettes inside live UI components

https://www.colorcura.site
1•ankitsharma777•38m ago•0 comments

Competitive Coder's Handbook [pdf]

https://cses.fi/book/book.pdf
1•baobabKoodaa•39m ago•0 comments

How to (actually) send DTMF on Android without being the default call app

https://edm115.dev/blog/2025/01/22/how-to-send-dtmf-on-android
2•EDM115•40m ago•0 comments

I Let ChatGPT Make All My Architectural Decisions for a Month: The Surprising R

https://medium.com/@sohail_saifi/i-let-chatgpt-make-all-my-architectural-decisions-for-a-month-the-surprising-results-e21f1254c74c
2•todsacerdoti•43m ago•0 comments

Alpaca's MCP Server for Trading/Quotes

https://github.com/alpacahq/alpaca-mcp-server
2•growbell_social•44m ago•1 comments

AI isn't coming for your job–it's coming for your company

https://www.fastcompany.com/91345570/ai-isnt-coming-for-your-job-its-coming-for-your-company
2•kjhughes•44m ago•1 comments

Three Types of Math Acceleration

https://kidswholovemath.substack.com/p/three-types-of-math-acceleration
1•sebg•47m ago•0 comments

BetterAuth vs. NextAuth

https://www.devtoolsacademy.com/blog/betterauth-vs-nextauth/
2•amalinovic•50m ago•0 comments

Swift and Cute 2D Game Framework: Setting Up a Project with CMake

https://layer22.com/swift-and-cute-framework-setting-up-a-project-with-cmake
12•pusewicz•54m ago•5 comments

How to Use Vheer Text to Image Generator: A Beginner's Guide

https://www.readability.com/how-to-use-vheer-text-to-image-generator-a-beginners-guide
2•vertex_steven•57m ago•0 comments

Show HN: Book to help you build a PostgreSQL-like database server from scratch

https://technicaldeft.com/build-a-database-server
1•zetter•57m ago•0 comments

Conventional commit generator using local LLMs

https://wimpysworld.com/posts/faff-ollama-conventional-commit-generator/
2•popey•57m ago•1 comments

MCP Resources Are for Caching

https://timkellogg.me/blog/2025/06/05/mcp-resources
1•tkellogg•58m ago•0 comments