From Prototype to Production: Bridging the AI Reliability Gap
Add to Calendar
Apr 17, 2025
Apr 17, 2025
America/Los_Angeles
From Prototype to Production: Bridging the AI Reliability Gap
Building a prototype AI application is easy, but getting it to work reliably in production is not. Just ask the developers of New York’s MyCity, whose AI got caught telling businesses to break the law. (And there are many other stories like this!) We envision a future where AI applications avoid such issues by design, using automated detection and remediation of bad responses. In this hands-on workshop, we’ll walk you through a case study of building a reliable retrieval-augmented generation (RAG) application. We’ll start by implementing a baseline application, then integrate Cleanlab’s automated real-time evaluations to detect general issues such as knowledge gaps and hallucinations as well as custom application-specific evaluation criteria, and finally add real-time remediation of these issues.
Workshop 3 | Gateway Pavilion - Pier 2 | 2 Marina Blvd, San Francisco, CA 94123