It uses a quantized Qwen 3 4B model and runs fully on your machine. I also added a Claude Code plugin via UserPromptSubmit that calls the CLI and swaps in the enriched prompt before the request goes to Claude.
It uses a quantized Qwen 3 4B model and runs fully on your machine. I also added a Claude Code plugin via UserPromptSubmit that calls the CLI and swaps in the enriched prompt before the request goes to Claude.