GenAI Testing Workshop: Hands-on Techniques for Building Reliable AI Apps Using Automated Evaluation and CI Integration

Building Reliable GenAI Applications: A Hands-on Testing & CI Workshop

Testing LLM-based applications has become one of the most crucial challenges in modern software development. While traditional software testing gives us clear pass/fail criteria, how do you verify that your AI is consistently giving good responses? When is a response "correct enough"? And how do you automate this testing process in a way that scales?In this hands-on workshop, we tackle these challenges head-on by building and testing three different types of AI applications. Rather than getting ...