Concerto for Java and AI - Building Production-Ready LLM Applications (YOW! Australia 2024)

Thomas Vitale YOW! Conference December 2024 Concerto for Java and
AI Building Production-Ready LLM Applications @thomasvitale.com

Systematic Thomas Vitale @thomasvitale.com

LLM RAG Prompting Embeddings Vector Stores Hallucinations Agents Generative AI
Ph. Francisco Emilio Diaz @thomasvitale.com

One Buzzword To Rule Them All @thomasvitale.com

The WHY Factor @thomasvitale.com

The WHY Factor What problem does it solve? How ready
is it for production? Yeah, but how about the DevEx? @thomasvitale.com

Generative AI @thomasvitale.com

Machine Learning Subset of Arti fi cial Intelligence Platform/Infrastructure Platform
Engineers HTTP API Application Developer Model Training Model Inference ML Engineers Data Preparation Data Scientists @thomasvitale.com

If you like it, you should put an API on
it @thomasvitale.com

Model Inference via HTTP APIs Application Model Inference Service Do
you wanna build a snowman? HTTP Application Database Service DELETE * FROM HYPE; JDBC @thomasvitale.com

Java for AI-Infused Applications Integrations with Model Inference Platform/Infrastructure Platform
Engineers Model Training Model Inference ML Engineers Data Preparation Data Scientists Application Developers Application @thomasvitale.com

Models and Inference Services @thomasvitale.com

Inference Services and LLMs How to choose? Managed Service Unmanaged
Service Cloud On-Premises Proprietary Open Source @thomasvitale.com

opensource.org/ai @thomasvitale.com

ollama.com @thomasvitale.com

Privacy @thomasvitale.com

Chatbot @thomasvitale.com

Chatbot @thomasvitale.com “Legacy software companies adding an AI chatbot to
their product" @andykreed

Text Classification @thomasvitale.com

Text Classification Application Model Service Classify HARMONY @thomasvitale.com

Resilience LLM Applications Resilience4J @thomasvitale.com

LLM Observability Generative AI Request Rate Errors Duration Prompt Evaluation
Token Usage Context Window @thomasvitale.com

LLM Security Risks (1) OWASP Top 10 for LLM Prompt
Injection Model Denial of Service Sensitive Information Disclosure OWASP Top 10 LLM Applications and Generative AI https://genai.owasp.org/ @thomasvitale.com

Semantic Search @thomasvitale.com

Semantic Search From Keywords to Meaning Application Melancholic Embedding Model
Melancholic [42…] LIKE ‘%melancholic%' SQL Store Vector Store [42…] @thomasvitale.com

Data Ingestion LLM Applications JobRunr Document Reader Document Transformer Document
Writer @thomasvitale.com

Question Answering with Docs @thomasvitale.com

Question Answering with Docs Retrieval Augmented Generation Application Melancholic instrument?
Embedding Model Melancholic instrument? [42…] Get Similar Documents Vector Store Model Question + Similar Documents @thomasvitale.com

Retrieval Augmented Generation @thomasvitale.com From Naive to Advanced

One Does Not Simply Test LLM Applications @thomasvitale.com

Structured Data Extraction @thomasvitale.com

Structured Data Extraction From Text to JSON Application Text Text
to Structured JSON Model Database Save Structured JSON @thomasvitale.com

Speech Transcription From Speech to Text Application Audio Audio to
Text Audio Model Chat Model Text to Structured JSON @thomasvitale.com

Image Processing From Image to Text Application Image Image to
Text Image Model Chat Model Text to Structured JSON @thomasvitale.com

Hallucinations @thomasvitale.com

Data Validation JSON Schema Humans in the Loop Optional Values
Mitigating hallucination risks @thomasvitale.com

@thomasvitale.com

LLM Security Risks (2) OWASP Top 10 for LLM Insecure
Output Handling Excessive Agency Insecure Plugin Design OWASP Top 10 LLM Applications and Generative AI https://genai.owasp.org/ @thomasvitale.com

Agents @thomasvitale.com

Agents Tools/Function Calling Application Is this instrument available? API Function
Call Result Question Model Function Call Result @thomasvitale.com Function Call Request

1979 IBM

Going to Production @thomasvitale.com

“Friends don’t let friends write Dockerfiles!” - Josh Long @thomasvitale.com

Image pack build Cloud Native Buildpacks From source code to
container image Cloud Native Buildpacks https://buildpacks.io @thomasvitale.com

Image pack build gradle bootBuildImage Cloud Native Buildpacks From source
code to container image Cloud Native Buildpacks https://buildpacks.io @thomasvitale.com

Build & Deploy Cloud Native Buildpacks Kubernetes Service Binding Native
Executables with GraalVM Going to Production @thomasvitale.com

Service Bindings for Spring AI @thomasvitale.com

@vitalethomas Ph. Francisco Emilio Diaz Composer Assistant https://github.com/ThomasVitale/concerto-for-java-and-ai

Thomas Vitale @thomasvitale.com thomasvitale.com Concerto for Java and AI Building
Production-Ready LLM Applications https://github.com/ThomasVitale/llm-apps-java-spring-ai https://github.com/ThomasVitale/concerto-for-java-and-ai

Concerto for Java and AI - Building Production-...

Concerto for Java and AI - Building Production-Ready LLM Applications (YOW! Australia 2024)

More Decks by Thomas Vitale

Other Decks in Technology

Featured

Transcript