frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: State of the art with local LLMs and agents

4•prmph•6mo ago
I like Claude Code for what it is, but I want an agentic coding setup that provides much stronger security and privacy guarantees.

What is the state of the art right now regarding running local LLMS and connecting local agents to them?

Which models are suitable and good, what hardware (at a minimum) and software are required, and which open source agents are good to drive those models?

Comments

alganet•6mo ago
It's garbage.

This very own question is problematic. It creates the illusion that local LLM development can compete with huge datacenters, it cannot.

Well-Educated Human Brain > Data center LLMs > Local LLMs

That's how likely things are to be for a while.

incomingpain•6mo ago
>I like Claude Code for what it is, but I want an agentic coding setup that provides much stronger security and privacy guarantees.

Very fair. These big ai arent earning billions off $25/month subs that mostly lose them $.

>What is the state of the art right now regarding running local LLMS and connecting local agents to them?

Best of the best as far as Im aware is Qwen3 coder run in your choice of agentic coder. Up there with cloud strength in coding.

BUT it's 480B. Q4_K_M is 290 GB. You're talking $50,000, rackmount, 30amp electrical going into that beast. 10x 32GB cuda cards is yikes.

Here's what literally just released hours ago:

https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct

I'm currently downloading the unsloth IQ4_NL version which should run well.

This will run on a 24GB card with good context size and good speed. If you have 1x 32GB card even better.

What I've been using lately.

https://mistral.ai/news/devstral

On a 24GB card this is great. Just look at the benchmarks and it's absolutely completely usable.

Meant to be used in openhands. This absolutely is completely functioning once you get the settings correct like a low temperature and such.

Discuss – Do AI agents deserve all the hype they are getting?

2•MicroWagie•1h ago•0 comments

Ask HN: Anyone Using a Mac Studio for Local AI/LLM?

47•UmYeahNo•1d ago•29 comments

LLMs are powerful, but enterprises are deterministic by nature

3•prateekdalal•5h ago•3 comments

Ask HN: Non AI-obsessed tech forums

27•nanocat•16h ago•21 comments

Ask HN: Ideas for small ways to make the world a better place

16•jlmcgraw•18h ago•20 comments

Ask HN: 10 months since the Llama-4 release: what happened to Meta AI?

44•Invictus0•1d ago•11 comments

Ask HN: Who wants to be hired? (February 2026)

139•whoishiring•4d ago•517 comments

Ask HN: Who is hiring? (February 2026)

313•whoishiring•4d ago•512 comments

Ask HN: Non-profit, volunteers run org needs CRM. Is Odoo Community a good sol.?

2•netfortius•13h ago•1 comments

AI Regex Scientist: A self-improving regex solver

7•PranoyP•20h ago•1 comments

Tell HN: Another round of Zendesk email spam

104•Philpax•2d ago•54 comments

Ask HN: Is Connecting via SSH Risky?

19•atrevbot•2d ago•37 comments

Ask HN: Has your whole engineering team gone big into AI coding? How's it going?

18•jchung•2d ago•13 comments

Ask HN: Why LLM providers sell access instead of consulting services?

5•pera•1d ago•13 comments

Ask HN: How does ChatGPT decide which websites to recommend?

5•nworley•1d ago•11 comments

Ask HN: What is the most complicated Algorithm you came up with yourself?

3•meffmadd•1d ago•7 comments

Ask HN: Is it just me or are most businesses insane?

8•justenough•1d ago•7 comments

Ask HN: Mem0 stores memories, but doesn't learn user patterns

9•fliellerjulian•2d ago•6 comments

Ask HN: Is there anyone here who still uses slide rules?

123•blenderob•4d ago•122 comments

Kernighan on Programming

170•chrisjj•4d ago•61 comments

Ask HN: Any International Job Boards for International Workers?

2•15charslong•16h ago•2 comments

Ask HN: Anyone Seeing YT ads related to chats on ChatGPT?

2•guhsnamih•1d ago•4 comments

Ask HN: Does global decoupling from the USA signal comeback of the desktop app?

5•wewewedxfgdf•1d ago•3 comments

We built a serverless GPU inference platform with predictable latency

5•QubridAI•2d ago•1 comments

Ask HN: Does a good "read it later" app exist?

8•buchanae•3d ago•18 comments

Ask HN: Have you been fired because of AI?

17•s-stude•4d ago•15 comments

Ask HN: How Did You Validate?

4•haute_cuisine•1d ago•6 comments

Ask HN: Anyone have a "sovereign" solution for phone calls?

12•kldg•4d ago•1 comments

Ask HN: Cheap laptop for Linux without GUI (for writing)

15•locusofself•3d ago•16 comments

Ask HN: OpenClaw users, what is your token spend?

14•8cvor6j844qw_d6•4d ago•6 comments