Slide 1

Slide 1 text

Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Marco Frodl @marcofrodl Principal Consultant for Generative AI

Slide 2

Slide 2 text

Why is it important? Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Generative AI User Input AI Processing Generated Output OpenAI GPTx DALL-E 3 GPT-4 Vision Whisper Text Text Text-to-Speech (TTS-1)

Slide 3

Slide 3 text

Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI What is RAG? https://aws.amazon.com/what-is/retrieval-augmented-generation/ RAG = Ingestion + Retrieval P

Slide 4

Slide 4 text

Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Demo: RAG

Slide 5

Slide 5 text

Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI About Me Marco Frodl Principal Consultant for Generative AI Thinktecture AG X: @marcofrodl E-Mail: [email protected] https://www.thinktecture.com/thinktects/marco-frodl/

Slide 6

Slide 6 text

Ingestion Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Simple RAG in a nutshell Splitted (smaller) parts Embedding- Model Embedding π‘Ž 𝑏 𝑐 … Vector- Database Document Metadata: Reference to original document

Slide 7

Slide 7 text

Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Demo: Ingestion

Slide 8

Slide 8 text

Ingestion++ HyQE: Hypothetical Question Embedding Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Simple Advanced RAG in a nutshell LLM, e.g. GPT-3.5-turbo Transformed document Write 3 questions, which are answered by the following document. Chunk of Document Embedding- Model Embedding π‘Ž 𝑏 𝑐 … Vector- Database Metadata: content of original chunk

Slide 9

Slide 9 text

Watch the Webinar Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Simple Advanced RAG in a nutshell https://www.thinktecture.com/webinare/moderne-semantic-search-mit-llms-vektor-datenbanken-und-langchain/

Slide 10

Slide 10 text

Ask me anything Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Simple RAG Question Prepare Search Search Results Question Answer LLM Vector DB Embedding Model Question as Vector Workflow Terms - Retriever - Chain Elements Embedding- Model Vector- DB Python LLM Langchain πŸ¦œπŸ”—

Slide 11

Slide 11 text

Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Demo: Simple RAG

Slide 12

Slide 12 text

Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI How to Debug/Trace Generative AI-Apps?

Slide 13

Slide 13 text

Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Demo: Debugging

Slide 14

Slide 14 text

Just one Vector DB? Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI What’s wrong with Simple RAG?

Slide 15

Slide 15 text

Just one Vector DB/Retriever? β€’ Multiple GenAI-Apps β€’ Scaling and Load Balancing β€’ Query Params per Retriever β€’ Hosting (Environment, Product) β€’ Fast Updates & Re-Indexing β€’ Access Rights β€’ Custom Retriever Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI What’s wrong with Simple RAG? On-Premise AI-App πŸ¦œπŸ”— Cloud Docs Public Tickets Features Website Sales Docs Internal Tickets

Slide 16

Slide 16 text

Finding the best source before asking Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Advanced RAG Question Retriever Selection 0-N Search Results Question Answer LLM Embedding Model Vector DB A Question as Vector Vector DB B LLM Prepare Search or 1

Slide 17

Slide 17 text

Finding the best source before asking Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Advanced RAG Question Retriever Selection 0-N Search Results Question Answer LLM Embedding Model Vector DB A Question as Vector Vector DB B LLM Prepare Search or Question Prepare Search Search Results Question Answer LLM Vector DB Embedding Model Question as Vector

Slide 18

Slide 18 text

Advanced RAG Dynamically Selecting the best Retrievers for Queries with AI Demo: Dynamic Retriever Selection with AI P