Agent-Powered Retrieval with Haystack and Qdrant

Agent-Powered Retrieval with Haystack and Qdrant Promise and Pitfalls Bilge
Yücel, Developer Relations Engineer @ deepset

Hello 👋 Bilge Yücel • Developer Relations Engineer at deepset
🥑 • B.Sc. Computer Science • M.Sc. Artificial Intelligence • Learning & teaching how to build with AI in/bilge-yucel @bilgeyucl

Who is deepset? Company Solving Custom AI challenges since 2018.
HQ in Berlin and NYC. Backed by: Leading open source framework & commercial platforms for custom enterprise-grade AI Products Used by 70 Thought leaders

Scan the QR Code to receive materials and ask your
questions!

Are agents the future of search, or do classic retrievers
still rule?

Traditional Retrieval • Retrieve relevant information based on query •
Different algorithms: keyword-based, semantic, hybrid, …

Agent-Powered Retrieval • Use agent architecture for retrieval • Retrieval
as a tool → function calling

Movie Recommendation Assistant with Qdrant and Haystack 🍿

🍿 Tech Stack • 🎬 Pablinho/movies-dataset → 10k movies with
title, rating, description,... • Haystack → AI Orchestration Framework (⭐22.8k, open source, python) • Qdrant → Vector Database (free cloud cluster) • FastEmbed (Qdrant/minicoil-v1) → Sparse Neural Embedding • OpenAI (gpt-4o-mini) → Cheap, powerful

Load Data and Create Embeddings

Upload Data to Qdrant Collection

Retrieval Pipeline

Retrieval Pipeline + Filtering

Agent-Based Retrieval

🛠 Define the Tool • Name → retrieval_tool • Description
→ "Use this tool to get movies fitting to the user criteria" • Parameter → JSON schema • Function

JSON Schema • Define each parameter, query and metadata_filters •
Be precise with the metadata_filter by providing field names and options in detail • 💡Alternative: Treat field names as parameters

Agent • Bring them all together, create a Tool •
Pass that to your Agent alongside prompt and the LLM (OpenAIChatGenerator)

Test the Agent

Observability? Covered 👍 • Get insights on the tool names
and tool calls • Connect it to LLM observability tools

Advantages of Agent-Powered Retrieval

1⃣ Query Rewrite • Query: I love Titanic. Which movie
should I watch next?

2⃣ Metadata Extraction • Query: Find me a highly-rated action
movie about car racing

3⃣ Error/Retry mechanism • Query: Can you get me a
turkish movie rated above 8 that I can watch with my kids? Preferably an animation • Answer: It seems the search for Turkish animated movies rated above 8 was quite narrow, as I couldn't find any that met all your criteria….

Reality Check

Comparison Traditional Retrieval Agent-based Retrieval • Works with complex, ambiguous
queries • Error recovery & Retry • Filter generation • Slower 4 secs) • Less cheap :) • Fails with complex queries • No error recovery • Manual filtering • Faster 0.5 secs) • Cheap

🤝 Middle Ground • Filter generation (more performant than pure
retrieval) ✅ • Cheaper (than agent-based) ✅ • Faster (than agent-based) ✅ • Not every LLM supports ❌ • No query enhancements ❌ • No error handling ❌ Structured Output Generation + Retrieval

Thank you Weʼre hiring! deepset.ai/careers deepset.ai | haystack.deepset.ai in/bilge-yucel @bilgeyucl
Receive materials and ask questions

Agent-Powered Retrieval with Haystack and Qdrant

Agent-Powered Retrieval with Haystack and Qdrant

Bilge Yücel

More Decks by Bilge Yücel

Other Decks in Technology

Featured

Transcript