fp.
newest
Open in hackernews
Demystifying Evals for AI Agents
https://www.anthropic.com/engineering/demystifying-evals-for-ai-agents
1
•
i7l
•
1h ago