Ask HN: If I cancel Codex today whats the next best local inference agent?

6•Bulbasaur2015•1h ago

better place to ask over /r/LocalLLaMA

Comments

bigyabai•1h ago

For local inference? It entirely depends on what your hardware is.

verdverm•41m ago

OpenCode + vllm, model will depend on your hardware, but OpenCode also has a killer $10/m plan with quotas for some top tier open weight models.

I'm using qwen3.6 on a DGX spark, llama-cpp has prompt cache bugs for qwen/gemma models (among more being reported). Using my OpenCode-go sub when I want a bigger / more capable model

Ask HN: If I cancel Codex today whats the next best local inference agent?

Ask HN: Any advice on how to learn good software architecture practices?

Ask HK: How are you building AI apps today?

Ask HN: What Is an "AI Engineer"?

Ask HN: Does Claude Code remove the need for so many front-end frameworks?

Ask HN: Are Tech Meetups Dead?

Ask HN: I found out that I'm about to be laid off. How do people find jobs?

A disk-first C++ vector engine

Garnix, the Nix CI, is shutting down

Do not use Cloudflare DNS regsitrar

Ask HN: How do you feel about posts about GenAI taking over the HN front page?

Train 1T parameter LLM with 8 GPUs?

Ask HN: Is anyone working at least 4 hours daily on an Apple Vision Pro?

Ask HN: Why not have an EU browser?

Sqlit – A lazygit-style TUI for SQL databases

Ask HN: How do you model temporarily invalid data structures

Did the Linux memory management maintainer "just quit"?

Ask HN: When and why did you start believing in God?

Ask HN: Why didn't the C64 come with Simons' BASIC in the box from 1983 onward?

Ask HN: If I cancel Codex today whats the next best local inference agent?

Comments

Ask HN: If I cancel Codex today whats the next best local inference agent?

Ask HN: Any advice on how to learn good software architecture practices?

Ask HK: How are you building AI apps today?

Ask HN: What Is an "AI Engineer"?

Ask HN: Does Claude Code remove the need for so many front-end frameworks?

Ask HN: Are Tech Meetups Dead?

Ask HN: I found out that I'm about to be laid off. How do people find jobs?

A disk-first C++ vector engine

Garnix, the Nix CI, is shutting down

Do not use Cloudflare DNS regsitrar

Ask HN: How do you feel about posts about GenAI taking over the HN front page?

Train 1T parameter LLM with 8 GPUs?

Ask HN: Is anyone working at least 4 hours daily on an Apple Vision Pro?

Ask HN: Why not have an EU browser?

Sqlit – A lazygit-style TUI for SQL databases

Ask HN: How do you model temporarily invalid data structures

Did the Linux memory management maintainer "just quit"?

Ask HN: When and why did you start believing in God?

Ask HN: Why didn't the C64 come with Simons' BASIC in the box from 1983 onward?