Ask HN: State of the art with local LLMs and agents

4•prmph•6mo ago

I like Claude Code for what it is, but I want an agentic coding setup that provides much stronger security and privacy guarantees.

What is the state of the art right now regarding running local LLMS and connecting local agents to them?

Which models are suitable and good, what hardware (at a minimum) and software are required, and which open source agents are good to drive those models?

Comments

alganet•6mo ago

It's garbage.

This very own question is problematic. It creates the illusion that local LLM development can compete with huge datacenters, it cannot.

Well-Educated Human Brain > Data center LLMs > Local LLMs

That's how likely things are to be for a while.

incomingpain•6mo ago

>I like Claude Code for what it is, but I want an agentic coding setup that provides much stronger security and privacy guarantees.

Very fair. These big ai arent earning billions off $25/month subs that mostly lose them $.

>What is the state of the art right now regarding running local LLMS and connecting local agents to them?

Best of the best as far as Im aware is Qwen3 coder run in your choice of agentic coder. Up there with cloud strength in coding.

BUT it's 480B. Q4_K_M is 290 GB. You're talking $50,000, rackmount, 30amp electrical going into that beast. 10x 32GB cuda cards is yikes.

Here's what literally just released hours ago:

https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct

I'm currently downloading the unsloth IQ4_NL version which should run well.

This will run on a 24GB card with good context size and good speed. If you have 1x 32GB card even better.

What I've been using lately.

https://mistral.ai/news/devstral

On a 24GB card this is great. Just look at the benchmarks and it's absolutely completely usable.

Meant to be used in openhands. This absolutely is completely functioning once you get the settings correct like a low temperature and such.

Ask HN: Opus 4.6 ignoring instructions, how to use 4.5 in Claude Code instead?

Ask HN: Anyone Using a Mac Studio for Local AI/LLM?

Ask HN: Ideas for small ways to make the world a better place

Ask HN: Non AI-obsessed tech forums

Ask HN: 10 months since the Llama-4 release: what happened to Meta AI?

Ask HN: Who wants to be hired? (February 2026)

Ask HN: Who is hiring? (February 2026)

LLMs are powerful, but enterprises are deterministic by nature

AI Regex Scientist: A self-improving regex solver

Tell HN: Another round of Zendesk email spam

Ask HN: Non-profit, volunteers run org needs CRM. Is Odoo Community a good sol.?

Ask HN: Is Connecting via SSH Risky?

Ask HN: Has your whole engineering team gone big into AI coding? How's it going?

Ask HN: Is there anyone here who still uses slide rules?

Ask HN: How does ChatGPT decide which websites to recommend?

Ask HN: Mem0 stores memories, but doesn't learn user patterns

Ask HN: Why LLM providers sell access instead of consulting services?

Kernighan on Programming

Ask HN: Is it just me or are most businesses insane?

Ask HN: What is the most complicated Algorithm you came up with yourself?

Ask HN: Anyone Seeing YT ads related to chats on ChatGPT?

Ask HN: Does global decoupling from the USA signal comeback of the desktop app?

We built a serverless GPU inference platform with predictable latency

Ask HN: Does a good "read it later" app exist?

Ask HN: Have you been fired because of AI?

Ask HN: Anyone have a "sovereign" solution for phone calls?

Ask HN: Cheap laptop for Linux without GUI (for writing)

GitHub Actions Have "Major Outage"

Ask HN: How Did You Validate?

Ask HN: Has anybody moved their local community off of Facebook groups?