AI Engineer Summit: Agents at Work!

Agent Evals: Finally, With The Map

February 19, 2025

A systematic and principled map of the key aspects of AI Agent Evaluation is presented. Agent Evals are often approached as a laundry list of ad hoc metrics, making it hard to plan ahead towards a comprehensive quality assurance for your agents. In contrast, this presentation directly provides you with a solid foundation for your agent evaluation roadmap, towards making your agents reliable, effective and safe.


Watch Recording