frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Ask HN: State of the art with local LLMs and agents

4•prmph•1d ago
I like Claude Code for what it is, but I want an agentic coding setup that provides much stronger security and privacy guarantees.

What is the state of the art right now regarding running local LLMS and connecting local agents to them?

Which models are suitable and good, what hardware (at a minimum) and software are required, and which open source agents are good to drive those models?

Comments

alganet•1d ago
It's garbage.

This very own question is problematic. It creates the illusion that local LLM development can compete with huge datacenters, it cannot.

Well-Educated Human Brain > Data center LLMs > Local LLMs

That's how likely things are to be for a while.

incomingpain•1d ago
>I like Claude Code for what it is, but I want an agentic coding setup that provides much stronger security and privacy guarantees.

Very fair. These big ai arent earning billions off $25/month subs that mostly lose them $.

>What is the state of the art right now regarding running local LLMS and connecting local agents to them?

Best of the best as far as Im aware is Qwen3 coder run in your choice of agentic coder. Up there with cloud strength in coding.

BUT it's 480B. Q4_K_M is 290 GB. You're talking $50,000, rackmount, 30amp electrical going into that beast. 10x 32GB cuda cards is yikes.

Here's what literally just released hours ago:

https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct

I'm currently downloading the unsloth IQ4_NL version which should run well.

This will run on a 24GB card with good context size and good speed. If you have 1x 32GB card even better.

What I've been using lately.

https://mistral.ai/news/devstral

On a 24GB card this is great. Just look at the benchmarks and it's absolutely completely usable.

Meant to be used in openhands. This absolutely is completely functioning once you get the settings correct like a low temperature and such.

Ask HN: Who is hiring? (August 2025)

142•whoishiring•9h ago•189 comments

Ask HN: Who wants to be hired? (August 2025)

64•whoishiring•9h ago•163 comments

Ask HN: How do you avoid job hunting burnout?

6•b8•3h ago•6 comments

Ask HN: Is "messaging systems specialist" a real job title or niche?

5•pella_may•1h ago•1 comments

I launched 17 side projects. Result? I'm rich in expired domains

351•cesargstn•2d ago•248 comments

Ask HN: Who Is Looking for a Cofounder?

21•dontoni•7h ago•11 comments

Ask HN: Best AI Automation Platform

2•franze•4h ago•1 comments

Ask HN: AI Chat Agent vs. Traditional Personal Website?

4•JaiRathore•6h ago•1 comments

Ask HN: Anyone know how to reach Cloudflare support?

6•OhMeadhbh•7h ago•5 comments

Nova: A New Web Framework for Erlang

64•taure•1d ago•26 comments

Ask HN: Which software companies hire people in Africa for remote work?

6•DanieleProcida•11h ago•2 comments

Ask HN: Startups, 0 Stability – Is It Time to Move on from Tech?

6•OulaX•13h ago•6 comments

Claude Code weekly rate limits

598•thebestmoshe•4d ago•702 comments

Comparison Between Sync Engines

2•belchiorb•16h ago•1 comments

Ask HN: What are you working on? (July 2025)

258•david927•5d ago•845 comments

Has any YC founder ever gone to jail for startup-related crimes?

8•TeslaK20•22h ago•3 comments

Has AI coding gone too far? I feel like I'm losing control of my own projects

13•Shaun0•1d ago•11 comments

Ask HN: Anyone using llms.txt on blogs? Worth it for AI search?

6•logic_node•7h ago•5 comments

Ask HN: Are developers sad about AI writing more of their code?

13•JFerreol_J•1d ago•21 comments

New budget financial API, based on EDGAR data

7•jgfriedman1999•1d ago•5 comments

Tell HN: Add "NSFW" words in your Google query to avoid AI summary

22•behnamoh•3d ago•19 comments

Ask HN: How will the OSA affect small Mastodon instances?

29•Digit-Al•3d ago•15 comments

Ask HN: Is there a way to see HN without all the posts about AI?

4•dotcoma•13h ago•8 comments

Ask HN: Small Utility App Monetization

5•mywacaday•1d ago•0 comments

Ask HN: Local LLM agents on Jetson/RPi without a heavy runtime

3•takuya_h•1d ago•3 comments

Ask HN: Advise for technical solo founders trying to secure venture capital?

5•siva7•1d ago•4 comments

Ask HN: Catching Up with Current Datacenters

3•Damogran6•1d ago•7 comments

Warp.dev Terminal – Overpriced, Buggy, and AI-Sabotaged My Code

56•MistermanX•5d ago•39 comments

Ask HN: State of the art with local LLMs and agents

4•prmph•1d ago•2 comments

Google Maps Reviews in Germany Are Basically Dead

32•tahaygun•2d ago•18 comments