How are others benchmarking the agentic fulfillment of intent?
I've started to explore this space with intent-bench, https://intent-bench.github.io/intent-bench
ryan4rtmx•16m ago
How are others benchmarking the agentic fulfillment of intent?
I've started to explore this space with intent-bench, https://intent-bench.github.io/intent-bench