I am tired of Anthropic's rate limits. Is there a coding model + coding harness combination that I can run 100% locally? I need a local AI model and coding harness that matches Claude Code in performance (ideally Opus-level, but Sonnet-level would also be amazing). With Claude Code's Mar 31 leak, we now have free, open source competing harnesses like https://github.com/ultraworkers/claw-code and https://github.com/Gitlawb/openclaude
Has anyone found a 100% local setup that rivals Claude's models in quality for professional-level agentic coding?
Some Context:
- I have about 20GB-30GB of space left on my Macbook Pro, which I can use for downloading a local model. I do not want to use Ollama Cloud and pay yet another $20/mo subscription.
- I have tried Gemma 4 on Ollama (Google's SOTA open source model that released Apr 2026), and it is absolute garbage. Stay away.