We made the switch at Cosmico when building AI-native applications with Claude Code. Agentic search gives us better accuracy but at higher cost per query.
For those running production systems:
1. What prompted your switch (or decision to stick with RAG)? 2. What broke during the transition? 3. What's your hybrid approach (if any)?
Specifically curious about code search vs. document retrieval use cases, and how you handle the latency/cost trade-offs.
Our context: building custom software with AI agents in weeks, not months. Accuracy matters more than speed for our workflow, but that's not universal.
What's working (or not working) for you?