Developer Day
Apr 17, 2025
4:00 pm
-
4:50 pm

From Prototype to Production: Bridging the AI Reliability Gap

Add to Calendar Apr 17, 2025 Apr 17, 2025 America/Los_Angeles From Prototype to Production: Bridging the AI Reliability Gap Building a prototype AI application is easy, but getting it to work reliably in production is not. Just ask the developers of New York’s MyCity, whose AI got caught telling businesses to break the law. (And there are many other stories like this!) We envision a future where AI applications avoid such issues by design, using automated detection and remediation of bad responses. In this hands-on workshop, we’ll walk you through a case study of building a reliable retrieval-augmented generation (RAG) application. We’ll start by implementing a baseline application, then integrate Cleanlab’s automated real-time evaluations to detect general issues such as knowledge gaps and hallucinations as well as custom application-specific evaluation criteria, and finally add real-time remediation of these issues. Workshop 3 | Gateway Pavilion - Pier 2 | 2 Marina Blvd, San Francisco, CA 94123

About this session

Building a prototype AI application is easy, but getting it to work reliably in production is not. Just ask the developers of New York’s MyCity, whose AI got caught telling businesses to break the law. (And there are many other stories like this!) We envision a future where AI applications avoid such issues by design, using automated detection and remediation of bad responses. In this hands-on workshop, we’ll walk you through a case study of building a reliable retrieval-augmented generation (RAG) application. We’ll start by implementing a baseline application, then integrate Cleanlab’s automated real-time evaluations to detect general issues such as knowledge gaps and hallucinations as well as custom application-specific evaluation criteria, and finally add real-time remediation of these issues.

Session Speaker

Anish Athalye

Co-Founder & CTO
Cleanlab
Access Prerequesites

More sessions

Developer Day
Apr 17, 2025
3:30 pm

Harnessing AI Context for Superior Code Quality

Discover how agentic workflows can revolutionize software development with Qodo AI-powered tools. Learn to utilize context-awareness and automated review workflows to ensure impeccable code integrity. This session will explore real-world applications, focusing on best practice integration and AI-driven insights to optimize your development process.

Developer Day
Apr 17, 2025
12:40 pm

The Full Stack of Open Generative Ai

Join an AI expert from Meta for an in depth look at the latest advancements in the open generative AI stack. This session will cover from the metal to the agent how large scale AI systems are built, the tools used and how you can build your own with open source AI from Meta including PyTorch, Triton, Llama and more.

Developer Day
Apr 17, 2025
3:50 pm

Ensuring Responsible AI with Advanced Model Evaluation

This session will cover Sama's comprehensive Model Evaluation services, emphasizing the importance of aligning AI systems with ethical guidelines and improving their performance. Attendees will learn how to systematically assess AI outputs, identify inaccuracies and vulnerabilities, and employ strategies for accelerated time-to-market with robust model validation. Leveraging Sama's expert-driven platform, participants will discover how to build reliable, high-performing generative AI models while adhering to industry standards.