Concept: use Claude Code as the primary agent, and have it call models like GLM 4.6 to offload routine or automatable tasks, minimizing Claude's token consumption. Orchestrate models such as GLM 4.6, Gemini 2.5 Pro, Kimi K2, Qwen-Coder, and other open or paid API-accessible models to distribute work intelligently.
Think of it as a grep-like tool where, instead of running a grep command, the system delegates the search or transformation to other LLMs and executes there. For trusted models (e.g., GLM 4.6), returning only a summary of actions is sufficient, while the detailed changes are applied directly to code and persisted.
I’m looking for a framework that coordinates multiple models similar to how Claude Code uses subagents.
bigyabai•21h ago