frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Is the next big thing locally running coding agents?

1•baigy•40m ago
There's extreme price escalation on part of Anthropic, with token spend now approaching levels that have made many-an-enterprise scratch their heads.

At the same time, judging by opensource advances (E.g. Qwen 3.6 27B), hosting a smart enough local LLM on 16GB VRAM (or equivalent) is increasingly becoming a reality. Lastly, I see most coding to be of intermediate difficulty, not beyond.

Seems to me it's a matter of time that people shift to free Claude Code type experiences, powered by local LLMs.

What do you think?

Comments

damnitbuilds•31m ago
I got Qwen 3.6 running locally on 12GB VRAM.

It went:

  AI: "I see you are building a Django project. How can I help?"

  Me: "When I click on the Reload button, it does not set the reload option correctly. Fix this"

     <10 minutes>

  AI: "I see you are building a Django project. How can I help?"
Needs more tweaking of the context window, I think.

Seriously, I agree that this is the future, when OpenAI et al have gone bust.

baigy•26m ago
I think it's a huge bubble about to pop. I get that enterprises are like elephants, slow to move, locked into agreements.

But I think free is going to be infinitely better than paying Anthropic more money than you used to spend on your human payroll. The big pop is coming.

giwook•22m ago
I think this is the key issue with running locally hosted models.

Yes, technically you can run them on 12gb vram.

But should you?

Realistically 64gb seems to be the current threshold for getting meaningful work done while also maintaining a large enough context window.

baigy•18m ago
This will drop further with increase in intelligence density.
jonahbenton•28m ago
There are many markets. Qwen 3.6 27b at a high enough quant is good enough for many use cases. But enterprise-consumed tokens come with legal/data protection agreements. They have just gotten comfortable with BYOD- there is no BYOD equivalent set of practices and protections for local LLMs (BYOLLM). So some enterprises are getting back into prem GPU capacity.
baigy•24m ago
On prem GPU capacity - or decent enough devices for core engineering team - lends itself pretty nicely to local LLMs too. And you own the whole stack this way. Why pay premiums to Anthropic and fuel its trillion dollar valuation?
giwook•23m ago
This seems like an obvious progression imo though I think very much subject to change. Open weight models will become better, and memory prices will return to normal prices in a couple years (hopefully).

That being said I think an unpredictable variable here is how the companies building frontier models respond to what should be a noticeable inflection point in consumers turning towards locally hosted open weight models.

There is also a significant amount of compute that is being built out as we speak that should in theory reduce costs for providers of frontier models but that's a whole other can of worms.

Despite all of the very impressive open weight models that are available to us today, Anthropic and OpenAI continue to remain steps ahead of the competition. Most of the biggest and brightest minds in AI are working at frontier labs. It's not hard to foresee that these labs continue to maintain their edge given the amount of expertise and brainpower they've assembled.

Assuming frontier models continue to maintain their edge, even if it's on a subset of tasks (e.g. reasoning, judgment, planning), I see a convergence towards a hybrid workflow where both frontier and local models are used for specific tasks. e.g. Claude for reasoning, planning, judgment, with intelligent routing to cheap/free models tuned for certain tasks.

baigy•19m ago
Good points.

I feel where it all loses its legs is the fact that most coding work is intermediate complexity. You won't need super intelligence to code/maintain your CRM or what have you. Specialized firms may pay the premiums Anthropic/OpenAI expect, the vast majority of enterprises won't need to, for the vast majority of their use-cases.

Technical Interviews Reject the Wrong Engineers

https://fagnerbrack.com/technical-interviews-reject-the-wrong-engineers-a8e78ca04b2e
1•birdculture•1m ago•0 comments

Show HN: Let agents run any analysis with Mixpanel data, no UI required

https://docs.mixpanel.com/docs/mixpanel-headless
1•ttchen2•1m ago•0 comments

The Unbearable Blandness Of The 2020's [video]

https://www.youtube.com/watch?v=tzvXoss7A3E
1•mindcrime•1m ago•0 comments

NATO commander: Europe has no alternative to Palantir's warfare tech

https://www.politico.eu/article/nato-commander-europe-no-palantir-alternative/
1•robertkoss•2m ago•0 comments

Leroy's elusive little people: A review on lilliputian hallucinations (2021)

https://www.sciencedirect.com/science/article/pii/S0149763421001068
1•billfor•2m ago•0 comments

What 1,281 agent runs reveal about coding agent failure in large codebases

https://tessl.io/blog/coding-agent-failure-patterns-large-codebases/
1•jdorfman•2m ago•0 comments

Active beam headlights are finally coming to America

https://arstechnica.com/cars/2026/05/these-clever-active-beam-headlights-are-finally-coming-to-am...
1•LorenDB•3m ago•0 comments

How OLTs may have exposed ISP networks

https://blog.quarkslab.com/how-olts-may-have-exposed-entire-isp-networks.html
1•speckx•5m ago•0 comments

Show HN: A demo video of Effected Keyboard 2

https://www.youtube.com/shorts/6aExjM8A9pE
1•vitalipom•6m ago•0 comments

Navox Network – Browser-only CRM built on weak-ties research

https://www.navox.tech/network
1•nahrin•8m ago•0 comments

Build your own green threads library in C

https://github.com/nihiL7331/thrd-ndl
1•nihiL7331•8m ago•0 comments

Show HN: I made the first free ad blocker for podcasts

https://drea.fm/
1•hamza_q_•8m ago•0 comments

PULSELoCo: 17x less trainer-to-trainer bandwidth in distributed RL post-training

https://arxiv.org/abs/2602.03839
1•synapz_org•8m ago•0 comments

Collabora and Flipper: Opening Up the RK3576

https://www.collabora.com/news-and-blog/news-and-events/collabora-flipper-opening-up-the-rk3576.html
1•mfilion•9m ago•0 comments

AI Gateway Production Index

https://vercel.com/blog/ai-gateway-production-index
1•gmays•9m ago•0 comments

TSA Gold+ program for privatizing airport security screening

https://www.tsa.gov/goldplus
2•victorio•9m ago•1 comments

I spent 50 hours drawing a line graph

https://www.dougmacdowell.com/50-hours-to-draw-some-lines.html
1•dougdude3339•10m ago•1 comments

Microsoft warns of new Defender zero-days exploited in attacks

https://www.bleepingcomputer.com/news/security/microsoft-warns-of-new-defender-zero-days-exploite...
1•Brajeshwar•14m ago•0 comments

Show HN: opub, donated compute for open-source

https://opub.dev/blog/introducing-opub
1•goodroot•16m ago•0 comments

A Booming Shadow Market of Sketchy A.I. Investments

https://www.newyorker.com/culture/infinite-scroll/a-booming-shadow-market-of-sketchy-ai-investments
2•mmayberry•17m ago•0 comments

Deepfakes Tore a High School Apart

https://www.404media.co/radnor-high-school-pennsylvania-ai-deepfakes-child-sexual-abuse-material/
3•cdrnsf•18m ago•0 comments

Apparently former Facebook staffers are in high-ranking positions at Mozilla now

https://goblin.band/notes/ak9wrlzwgqsvbj9y
1•speckx•19m ago•0 comments

MCP-safeguard: first automated security scanner for MCP servers

https://github.com/SyedAnas01/mcp-safeguard
1•Anas1371•20m ago•0 comments

I built a tool to stop AI coding agents from leaking my secrets

https://github.com/getveil/veil
1•bcharest_dev•21m ago•0 comments

Realtime pixels-in-actions-out neural agent for Flappy Anna 3D

https://www.youtube.com/watch?v=gssY-ZQx06g
1•guiguan•22m ago•0 comments

I built a small tool to reduce input token costs by 20-30% for agentic tasks

https://bigindexer.com/blog/reduce-input-token-costs-agentic-tasks
1•afxuh•22m ago•0 comments

Morphogenic Systems Lead

http://mailto:architect@creaturealgorithm.com
1•mariuslukas•22m ago•0 comments

Show HN: Six legendary marketers walk into a workflow

https://github.com/conductor-oss/awesome-skills/tree/main/gtm-mavericks
1•opiniateddev•22m ago•0 comments

Agents will make your telemetry explode. You are not ready

https://shippingbytes.com/2026/05/21/agents-will-make-your-telemetry-explod/
2•speckx•24m ago•0 comments

We Reverse-Engineered Docker Sandbox's Undocumented MicroVM API

https://rivet.dev/blog/2026-02-04-we-reverse-engineered-docker-sandbox-undocumented-microvm-api/
2•yakkomajuri•25m ago•0 comments