Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
6•limondas•13h ago
Claude Code like agentic workflow ai too costly for me.Any LLM can I run with VSCode at the below setup?
16ram
Intel core i7 h processor 13gen
512gb NVMe SSD
I want to run the ai as local agentic workflow with Vscode.I want use LLAMA agent to build it.
I have tried Gemma3 270M, Qwen3.5 4bit ,they are fast but less coding focused.
I will be grateful for your response.
Thank you.
Comments
denn-gubsky•13h ago
Try qwen3-coder or qwen3-coder-next models which fit into your configuration. This is team-of-experts model which may load only actual experts into GPU.
limondas•11h ago
Thanks for your reply.
But it's to big for my PC.
In PC around 1.5GB models got 20 token/s , which is too low for agentic workflow.
denn-gubsky•2h ago
try latest gemma4:12b. It fits into 16Gb with 256K context window
denn-gubsky•13h ago
limondas•11h ago
denn-gubsky•2h ago