Our CI pipeline was calling live models during integration tests. That means every pull request consumes tokens.
Mocking at the HTTP layer felt brittle. Hardcoded fixtures were unrealistic. Calling live models felt unnecessary just to validate application logic.
Curious how others here are handling this.
Are you stubbing SDK calls? Using deterministic fixtures? Or just running tests against live providers?
Would love to hear what has worked in production.