Operating Ai Agents: Failure And Recovery
Released 2/2026
With Kesha Williams
MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz, 2 Ch
Skill level: Intermediate | Genre: eLearning | Language: English + subtitle | Duration: 41m | Size: 140 MB
Course details
As AI agents shift from experimentation to production, operational failures can create serious business risks. This intermediate course explores practical techniques for monitoring agent behavior, tracing execution paths, and identifying failure modes across single‑ and multi‑agent systems. Through hands-on GitHub Codespaces exercises, you learn how to implement rollback mechanisms, build automated recovery workflows, and create reports that surface agent health and system status in real time. By the end of the course, you'll have the skills to improve the safety and predictability of AI agents in production, and to respond quickly and effectively when failures occur.
Skills covered
AI Security, AI Policy, Governance, and Regulation, Agentic AI Development
Code:
Bitte
Anmelden
oder
Registrieren
um Code Inhalt zu sehen!