Advanced RAG AI-driven Retriever Selection with Turbo

Advanced RAG AI-driven Retriever Selection with Turbo Marco Frodl [email protected]
Principal Consultant for Generative AI @marcofrodl

Advanced RAG AI-driven Retriever Selection with Turbo Turbo https://www.aurelio.ai/semantic-router Semantic
Router is a superfast decision-making layer for your LLMs and agents. Rather than waiting for slow, unreliable LLM generations to make tool-use or safety decisions, we use the magic of semantic vector space — routing our requests using semantic meaning.

Advanced RAG AI-driven Retriever Selection with Turbo Turbo https://www.aurelio.ai/semantic-router Semantic
Router is a superfast decision-making layer for your LLMs and agents. Rather than waiting for slow, unreliable LLM generations to make tool-use or safety decisions, we use the magic of semantic vector space — routing our requests using semantic meaning. It’s perfect for: input guarding, topic routing, tool-use decisions.

Advanced RAG AI-driven Retriever Selection with Turbo But why? Safety
Speed Budget

Advanced RAG AI-driven Retriever Selection with Turbo Turbo in Numbers
In my RAG example, a Semantic Router using remote services is 3.4 times faster than an LLM and it is 30 times less expensive. A local Semantic Router is 7.7 times faster than an LLM and it is 60 times less expensive.

Advanced RAG AI-driven Retriever Selection with Turbo About Me Marco
Frodl Principal Consultant for Generative AI Thinktecture AG X: @marcofrodl E-Mail: [email protected] LinkedIn: https://www.linkedin.com/in/marcofrodl/ https://www.thinktecture.com/thinktects/marco-frodl/

Advanced RAG AI-driven Retriever Selection with Turbo Refresher: What is
RAG? “Retrieval-Augmented Generation (RAG) extends the capabilities of LLMs to an organization's internal knowledge, all without the need to retrain the model.

Advanced RAG AI-driven Retriever Selection with Turbo Refresher: What is
RAG? https://aws.amazon.com/what-is/retrieval-augmented-generation/ “Retrieval-Augmented Generation (RAG) extends the capabilities of LLMs to an organization's internal knowledge, all without the need to retrain the model. It references an authoritative knowledge base outside of its training data sources before generating a response”

Ask me anything Advanced RAG AI-driven Retriever Selection with Turbo
Simple RAG Question Prepare Search Search Results Question LLM Vector DB Embedding Model Question as Vector Workflow Terms - Retriever - Chain Elements Embedding- Model Vector- DB Python LLM LangChain

Our sample content Advanced RAG AI-driven Retriever Selection with Turbo
Simple RAG in a nutshell

Advanced RAG AI-driven Retriever Selection with Turbo Demo: Simple RAG

Which retriever do you want? Advanced RAG AI-driven Retriever Selection
with Turbo Multiple Retriever

Best source determination before the search Advanced RAG AI-driven Retriever
Selection with Turbo Advanced RAG Question Retriever Selection 0-N Search Results Question LLM Embedding Model Vector DB A Question as Vector Vector DB B LLM Prepare Search or

Advanced RAG AI-driven Retriever Selection with Turbo Demo: Dynamic Retriever
Selection with LLM

Advanced RAG AI-driven Retriever Selection with Turbo Embedding Model

Selection with Turbo Advanced RAG Question Retriever Selection 0-N Search Results Question LLM Embedding Model Vector DB A Question as Vector Vector DB B LLM Prepare Search or

Selection with Turbo Advanced RAG w/ Semantic Router Question Retriever Selection 0-N Search Results Question Embedding Model Vector DB A Question as Vector Vector DB B LLM Prepare Search or Embedding Model

Advanced RAG AI-driven Retriever Selection with Turbo Demo: Semantic Router
with RAG

LLM as Router Advanced RAG AI-driven Retriever Selection with Turbo
Turbo

Semantic Router with remote embedding model Advanced RAG AI-driven Retriever
Selection with Turbo Turbo

Advanced RAG AI-driven Retriever Selection with Turbo Demo: Semantic Router
running locally

Semantic Router with local embedding model Advanced RAG AI-driven Retriever
Selection with Turbo Turbo

Advanced RAG AI-driven Retriever Selection with Turbo Speed & Budget
in Numbers

Advanced RAG AI-driven Retriever Selection with Turbo Yes, please! Safety
Speed Budget

• Please rate my talk in the conference app •
I look forward to your questions and comments Your feedback is important to me

Thank you! Any questions? Marco Frodl @marcofrodl Principal Consultant for
Generative AI

Advanced RAGAI-driven Retriever Selection with...

Advanced RAG AI-driven Retriever Selection with Turbo

Marco Frodl

More Decks by Marco Frodl

Other Decks in Technology

Featured

Transcript