I’ve been trying to understand what is actually blocking broad adoption of truly agentic or autonomous systems. Lots of enterprises are running agent pilots right now, but I’m curious on how do teams decide whether these agents are succeeding or failing?
Further what could be a good generic framework to think about this?