Quick Overview: This video introduces a new series on testing AI This lecture discusses the critical shift from Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your exam ...

The Agent Evaluation Revolution - Detailed Overview & Context

This video introduces a new series on testing AI This lecture discusses the critical shift from Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your exam ... Jason Lopatecki, Co-Founder and CEO of Arize AI, dives into the world of In this video, I explain why **observability is essential when building AI Hamel Husain and Shreya Shankar teach the world's most popular course on AI evals and have trained over 2000 PMs and ...

Harsh Nilesh Pathak, Tech Lead ML/AI at GoDaddy, presents "Principles of AI ... verbosity, self-enhancement bias 00:47:22 Best practices 00:54:06 Factuality 01:00:15

Photo Gallery

The agent evaluation revolution
Agent Evaluation & Benchmarks - Agentic AI MOOC 2025 Lecture 4 Summary
Agent evaluation with ADK & Vertex AI | The Agent Factory Podcast
Top 5 AI Agent Evaluation Tools (2025): Maxim AI, Langfuse, Arize | LLM Observability Comparison
AI Agent evaluation: A complete guide to measuring performance
LLM as a Judge: Scaling AI Evaluation Strategies
Evaluating Agents and Assistants: The AI Conference
How to Evaluate AI Agents: Comprehensive Strategies for Reliable, High‑Quality Agentic Systems
Evaluating and Debugging Non-Deterministic AI Agents
AI Evaluation: Autonomous Agent Evaluation: How to Measure AI That Plans and Acts Independently |...
Evaluation & Observability in AI agents
How to evaluate agents in practice
Sponsored
Sponsored
View Main Result
Sponsored
Sponsored