Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Advanced RAG – Dynamically Selecting the best R...

Advanced RAG – Dynamically Selecting the best Retrievers for Queries with AI

Retrieval Augmented Generation (RAG) uses data from retrieval systems such as Vector DBs to find the relevant information to answer a user query. When multiple retrieval systems are to be used, selecting the optimal retrieval system for a query can be a challenge. Marco Frodl will present the MultiRouteChain paradigm from the LangChain framework, which enables AI-based dynamic retrieval system selection based on semantic matching of the user query and the retrieval system focus. The live coding will illustrate how MultiRouteChain improves the performance of RAG for answering user queries by selecting the most relevant QA chain for retrieval for each query.

The talk for the AI-Meetup Frankfurt is very hands-on with lots of live demos and coding in Python and LangChain framework to show the difference between a simple RAG approach and this more real-world approach.

Marco Frodl

April 28, 2024
Tweet

More Decks by Marco Frodl

Other Decks in Programming

Transcript

  1. Generative AI Advanced RAG - AI-based Retriever Selection About Me

    Marco Frodl Principal Consultant for Generative AI Thinktecture AG @marcofrodl [email protected] https://www.thinktecture.com/thinktects/marco-frodl/
  2. Why is it important? Generative AI Advanced RAG - AI-based

    Retriever Selection Generative AI AI understands and generates natural language AI can access knowledge from the training phase
  3. Generative AI Advanced RAG - AI-based Retriever Selection What is

    RAG? https://aws.amazon.com/what-is/retrieval-augmented-generation/ RAG = Ingestion + Retrieval
  4. Ingestion Generative AI Advanced RAG - AI-based Retriever Selection Simple

    RAG in a nutshell Splitted (smaller) parts Embedding- Model Embedding 𝑎 𝑏 𝑐 … Vector- Database Document Metadata: Reference to original document
  5. Similarity search in a Vector DB Generative AI Advanced RAG

    - AI-based Retriever Selection Simple RAG in a nutshell
  6. Ingestion++ HyQE: Hypothetical Question Embedding Generative AI Advanced RAG -

    AI-based Retriever Selection Simple Advanced RAG in a nutshell LLM, e.g. GPT-3.5-turbo Transformed document Write 3 questions, which are answered by the following document. Chunk of Document Embedding- Model Embedding 𝑎 𝑏 𝑐 … Vector- Database Metadata: content of original chunk
  7. Ask me anything Generative AI Advanced RAG - AI-based Retriever

    Selection Simple RAG Question Prepare Search Search Results Question Answer LLM Vector DB Embedding Model Question as Vector Workflow Terms - Retriever - Chain Elements Embedding- Model Vector- DB Python LLM LangChain
  8. Similarity search in a Vector DB – Limits with K

    Generative AI Advanced RAG - AI-based Retriever Selection Simple RAG in a nutshell
  9. Similarity search in a Vector DB – Threshold Generative AI

    Advanced RAG - AI-based Retriever Selection Simple RAG in a nutshell
  10. Just one Vector DB/Retriever? • Multiple Generative AI-Apps • Scaling

    and Hosting • Query Parameter per Retriever • Prompts per Retriever • Fast Updates & Re-Indexing • Access Rights • Custom Retriever Generative AI Advanced RAG - AI-based Retriever Selection What’s wrong with Simple RAG? ✅ ✅ ✅ On-Premise AI-Apps Cloud Docs Public Tickets Features Website Sales Docs Internal Tickets
  11. Best source determination before the search Generative AI Advanced RAG

    - AI-based Retriever Selection Advanced RAG Question Retriever Selection 0-N Search Results Question Answer LLM Embedding Model Vector DB A Question as Vector Vector DB B LLM Prepare Search or
  12. Best source determination before the search Generative AI Advanced RAG

    - AI-based Retriever Selection Advanced RAG Retriever Selection LLM Vector DB A Vector DB B or
  13. Best source determination before the search Generative AI Advanced RAG

    - AI-based Retriever Selection Advanced RAG Question Retriever Selection 0-N Search Results Question Answer LLM Embedding Model Vector DB A Question as Vector Vector DB B LLM Prepare Search or Question Prepare Search Search Results Question Answer LLM Vector DB Embedding Model Question as Vector
  14. Dynamic Retriever Selection with AI Generative AI Advanced RAG -

    AI-based Retriever Selection Advanced RAG
  15. Dynamic Retriever Selection with AI Generative AI Advanced RAG -

    AI-based Retriever Selection Advanced RAG
  16. Dynamic Retriever Selection with AI Generative AI Advanced RAG -

    AI-based Retriever Selection Advanced RAG
  17. Dynamic Retriever Selection with AI Generative AI Advanced RAG -

    AI-based Retriever Selection Advanced RAG
  18. Dynamic Retriever Selection with AI Generative AI Advanced RAG -

    AI-based Retriever Selection Advanced RAG