Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Building AI Applications with Java, LLMs, and S...

Building AI Applications with Java, LLMs, and Spring AI

This presentation will guide you through building Java applications with AI capabilities, leveraging Generative AI and Large Language Models (LLMs) using Spring AI. You will learn how to:

- Integrate chat models into your applications, including prompt design and managing structured outputs.
- Implement Retrieval-Augmented Generation (RAG) workflows to combine LLMs with your custom data, covering both basic and advanced use cases.
- Enable agentic behavior by integrating your APIs with LLMs through tool calling.
- Explore approaches for evaluating and monitoring LLM-based applications.
- Enhance the developer experience with tools such as Testcontainers and Ollama.

Avatar for Thomas Vitale

Thomas Vitale

November 19, 2025
Tweet

More Decks by Thomas Vitale

Other Decks in Technology

Transcript

  1. Machine Learning Subset of Arti fi cial Intelligence Platform/Infrastructure Platform

    Engineers HTTP API Application Developer Model Training Model Inference ML Engineers Data Preparation Data Scientists @thomasvitale.com
  2. Java for AI-Infused Applications Integrations with Model Inference Platform/Infrastructure Platform

    Engineers Model Training Model Inference ML Engineers Data Preparation Data Scientists Application Developers Application @thomasvitale.com
  3. Application Inference Service Consum es LLM s Architecture Database Reads/writes

    data Observability Platform Exports telem etry Spring Boot Application Vaadin Spring AI Arconia @thomasvitale.com
  4. Arconia Dev Services and OpenTelemetry arconia dev gradle bootRun mvn

    spring-boot:run @thomasvitale.com arconia.io
  5. Multimodality Inference Service Request Response Modalities and Structured Output Question

    Answer Application Format Instructions Output Converter @thomasvitale.com
  6. Chat Memory Inference Service Request Response Question Answer Application Multiple

    Interactions @thomasvitale.com Augment with Memory Memory Read Update Memory Write
  7. Prompt Stuffing Inference Service Request Response Augmenting Prompts with Context

    Answer Application @thomasvitale.com Question Context
  8. Retrieval Augmented Generation Inference Service Request Response Question Answer Application

    Augment with Context Web Search Engines Search Engine HTTP API @thomasvitale.com Context
  9. Retrieval Augmented Generation Inference Service Request Response Question Answer Application

    Augment with Context Vector Stores Vector Store Semantic Search @thomasvitale.com
  10. Retrieval Augmented Generation Inference Service Request Response Question Answer Application

    Augment with Context Prompt Augmentation with Retrieved Context Source Query @thomasvitale.com
  11. Tools Inference Service Request Tool Calling Question Response Answer Application

    API Tool Call Tool Execution Tool Call Request Tool Call Response @thomasvitale.com
  12. API MCP Server MCP Inference Service Request Tools Question Response

    Answer Application Tool Call MCP Client Tool Call Request Tool Call Response @thomasvitale.com