From https://www.jfokus.se/talks.html?showid=2701
Testing is hard, which is why developers tend to avoid it. Testing non-deterministic things is even harder, which is unfortunate, since we're all writing AI-infused applications, and AI models are notoriously non-deterministic. What happens when the applications start using advanced features, such as RAG, tools, and agents? How do you test these applications? There must be some tools, technologies, and practices out there that can help, while not costing your organization lots of money!
Join Java Champions Oleg & Eric in this session as they revisit a topic they debuted at JFokus last year. The AI landscape changes at a breathtaking pace, so what new capabilities and strategies have come along in the last year?
Hopefully by the end of the presentation you will be able to answer the question "If I change my model/prompt/application, did I get better or worse"?