Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?

15•cloudking•1h ago

Has anyone here fully swapped Claude/GPT for a local model as their main coding tool, not just for side experiments? If so, please share your setup and performance (e.g tok/s)

Comments

tumetab1•18m ago

Not yet, tried Gemma 4 on an Apple M4 but the tok/s is significant lower than the cloud offering.

Also,the lack of enterprise tooling to help selected an appropriate model and tooling to run a local LLM does not help.

arjie•17m ago

Not “local” and not interactive coding but sharing since it might be helpful. I have 2x RTX Pro 6000 Blackwell running DeepSeek V4 Flash. I get 160 tok/s raw but it’s a reasoning model. For my use case, I have it auto-write code and another system auto-review the code.

I occasionally use it with pi to write some code and it’s blazing fast but it’s mostly habit that keeps me with CC and Codex.

HappySweeney•6m ago

I have an optane and lots of ram, so I tried full-fat models for writing some function overnight, as I get about 0.7 t/s. My current go-to test is to update a scalar function to transpose a bit-matrix to one using avx512. the cloud models all play with that like its nothing. Kimi 2.6 and GLM 5.1 both failed miserably.

How the UK social media ban could affect you

We are living in the dial-up era of AI

Immutability Changes Everything (2016) [pdf]

Show HN: I turned the Lex Fridman podcast archive into a browsable idea map

US says Trump, Vance and Iran's parliament speaker have signed deal to end war

How to Measure WWDC

Marc Andreessen on X: "SpaceX and the Sentient Sun " / X

Is This the End of Political Islam?

Mathematicians use Lean to verify proofs, whats the equivalent for patent claims

Protect an MCP Server with an Authorization Server

Terraform Registry Is Down

Show HN: 0-0.io – Multiplayer browser football with server-authoritative physics

Patched Claude Code, now 2–8× faster

OpenAI wins dismissal of trade secret lawsuit by Musk's xAI

Building an AI skill marketplace for GTM teams

Show HN: I built an open-source financial research terminal (SEC data and SQL)

EA Advertising

Claude Debugs a Postgres Alarm: Multixacts, SLRU Caches, and a False Crisis

TinyWind: A pixel pirate sailing game with real wind physics (380k+ kms sailed)

How Brexit has made Britain poorer – in charts

Sovereign AI is not just about building a national AI model

Would You Believe That This Is It?

UCCL-EP: DeepEP-style expert parallelism on any NIC, no GPU-initiated comms

Darkbloom Dashboard

Compiling Haskell into Lean: Common Abstract Syntax for Haskell and Provers

Memory safety CVEs differ between Rust and C/C++

I Talked to a Squirrel Today

Over half of parents of 18-25 year-olds track adult children w smartphone apps

To study how chips work, MIT researchers built their own operating system

Can Ukraine Isolate Crimea?