Here's what's available:
- Google Gemini - 5 models including gemini-2.5-pro and gemini-2.5-flash. Up to 250K tokens per minute across all models. The pro model has a 1M context window on the free tier.
- Cohere - command-a-03-2025 and command-a-reasoning-08-2025. 1,000 calls per month, 256K context.
- Kilo Code - 4 models including Qwen 3.6 Plus, Nemotron 3 Super 120B, and Step 3.5 Flash. Around 200 requests per hour. Some support image and video input.
The whole point is to get started without spending anything. Connect one or two free providers, set up a routing config with fallbacks, and you already have a working setup. If Gemini hits its rate limit, Manifest falls back to Cohere or Kilo Code automatically.
More ready-to-setup free models are coming. hey are all listed here: https://manifest.build/free-models
We're still in beta and actively trying to understand how people use this. What does your setup look like? What providers are you using? If you run into anything weird or have feedback, we want to hear it.