Most AI Agents Are Broken. Let’s Fix That

Most AI Agents Are Broken. Letʼs Fix That Bilge Yücel,
Sr. DevRel Engineer @ deepset

Hello 👋 Bilge Yücel • Sr. DevRel Engineer at deepset
• B.Sc. Computer Science • M.Sc. Artificial Intelligence • Learning & teaching how to build with AI in/bilge-yucel @bilgeyucl

Who is deepset? Company Solving Custom AI challenges since 2018.
Offices in Berlin, Munich, Hamburg, London and Barcelona. Backed by: Leading open source framework & commercial platforms for custom enterprise-grade AI Products Used by 70 Thought leaders

Where are you on your agent journey?

What is an AI agent? An AI agent is an
LM-based system that autonomously pursues a goal by interacting with its environment using tools. Human LLM Call Environment Action Feedback Stop

Why Agents Fail? 🤔

“Agent involves a lot more work than expected and 90%
of it is pure engineering. Nothing really to do with the LLM, but it is how to blend LLM into an agentic workflow that makes senseˮ - a Haystack community member

Agent Harness Agent = Model + Harness Harness Engineering: How
to Build Reliable AI Agents by Engineering the System, Not the Model

Failure Reason 1 Too Much Trust, Too Little Design 🛠
Too Many Tools Complexity grows exponentially with every additional tool. 📉 No Fallback Plans No contingency when the primary agentic path, tools fails. 🔍 No Observability Without tracing, debugging failures becomes nearly impossible. 🛡 No Guardrails Missing safety and validation checks for agent inputs/outputs.

Failure Reason 2 Wrong Use Case • You have complex,
multi-step problems requiring diverse actions • Tasks involve multiple tools / sources • Goals are clear but the optimal path to achieve them isn't predetermined

multi-step problems requiring diverse actions • Tasks involve multiple tools / sources • Goals are clear but the optimal path to achieve them isn't predetermined • When interactions can follow predictable patterns (e.g. Q&A • Tasks can be decomposed into clear steps • Stability, robustness and efficiency are prioritized over automation potential

multi-step problems requiring diverse actions • Tasks involve multiple tools / sources • Goals are clear but the optimal path to achieve them isn't predetermined • When interactions can follow predictable patterns (e.g. Q&A • Tasks can be decomposed into clear steps • Stability, robustness and efficiency are prioritized over automation potential Agents for Reasoning, Pipelines for Defined Flow

How to design reliable agents?

• Open-source AI orchestration framework by deepset • Backbone of
the Haystack Enterprise Platform • Agents, RAG & Context Engineering with visibility, control and modularity • Building blocks: Components & Pipelines & Agents Component Component Component Component Pipeline pip install haystack-ai

Haystack Agents User Request Generated Answer Agent LM (e.g. OpenAI,
Anthropic, Google, Open Models) System Prompt Tools  Python Functions  External APIs  Haystack Pipelines Components Agents  MCP Servers

Letʼs Build!

Itinerary Agent A multi-agent travel itinerary planning system that to
create comprehensive travel plans with accommodation optimization and detailed daily itineraries. Orchestration Haystack Agent Tools Various MCP Servers • Perplexity • Google Maps • Optimal Route Model Provider OpenAI (gpt-4.1 Deployment Hayhooks REST API layer User Interface Open WebUI Guardrails, streaming, & observability

https://itinerary-agent.deepset.ai

Itinerary Agent - Agent Architecture

Observability - Langfuse

Itinerary Agent - LLM

Itinerary Agent - Guardrails

Itinerary Agent - Tools

Itinerary Agent - Tools Specify the tools

Itinerary Agent - Tools BM25 search for progressive discovery Specify
the tools

Itinerary Agent - Sub Agents

Itinerary Agent - Sub Agents Agents as tools through ComponentTool
Only last_message is returned

Itinerary Agent

Itinerary Agent Error Handling If a tool call fails, the
error is sent back to the LLM by default. Resilience Ensure uptime with FallbackChatGenerator for model connections. Termination Stops on direct text output or reaching max_agent_step limit.

Soon: Haystack 3.0 ☀  Simplified UX Improving the user
experience of the Agent component for better accessibility.  Enhanced Harness Expanding harness capabilities to provide more robust agents.  Advanced Streaming Improved streaming capabilities for real-time data processing.

Practical Tips For Building Agents Optimal Autonomy Maintain control over
your system by finding the right level of autonomy. Multi-Agent Systems Split tasks between specialized agents like planners, coders, and fixers. 'Fat' Tools Encapsulate logic, retries, and fallbacks directly within the tool logic. Guardrails & Observability Validate all inputs/outputs and log every step with comprehensive tracing. Human-in-the-Loop Ensure agents are overseen and guided by people for optimal reliability. Minimize Tools Avoid unnecessary complexity in context; every additional tool increases the risk of failure.

Unbreakable AI Agents with Haystack  Human-in-the-Loop View Tutorial 
Mem0 Memory Store View Cookbook  AI Guardrails & Safety Explore Content Moderation  Multi-Agent Systems System Design Guide  Context Engineering for Agentic Systems Deep Dive into Agent Logic

Thank you! 🤖 Demo: itinerary-agent.deepset.ai 🌐 Demo GitHub: github.com/deepset-ai/itinerary-agent STAY
CONNECTED 🌟 Haystack GitHub: github.com/deepset-ai/haystack 🤝 Ambassadors: haystack.deepset.ai/ambassadors in/bilge-yucel @bilgeyucl Get the presentation

Most AI Agents Are Broken. Let’s Fix That

Most AI Agents Are Broken. Let’s Fix That

Bilge Yücel

More Decks by Bilge Yücel

Other Decks in Programming

Featured

Transcript

Most AI Agents Are Broken. Letʼs Fix That Bilge Yücel,

Hello 👋 Bilge Yücel • Sr. DevRel Engineer at deepset

Who is deepset? Company Solving Custom AI challenges since 2018.

Where are you on your agent journey?

What is an AI agent? An AI agent is an

Why Agents Fail? 🤔

“Agent involves a lot more work than expected and 90%

Agent Harness Agent = Model + Harness Harness Engineering: How

Failure Reason 1 Too Much Trust, Too Little Design 🛠

Failure Reason 2 Wrong Use Case • You have complex,

Failure Reason 2 Wrong Use Case • You have complex,

Failure Reason 2 Wrong Use Case • You have complex,

How to design reliable agents?

• Open-source AI orchestration framework by deepset • Backbone of

Haystack Agents User Request Generated Answer Agent LM (e.g. OpenAI,

Letʼs Build!

Itinerary Agent A multi-agent travel itinerary planning system that to

https://itinerary-agent.deepset.ai

Itinerary Agent - Agent Architecture

Observability - Langfuse

Itinerary Agent - LLM

Itinerary Agent - Guardrails

Itinerary Agent - Tools

Itinerary Agent - Tools Specify the tools

Itinerary Agent - Tools BM25 search for progressive discovery Specify

Itinerary Agent - Sub Agents

Itinerary Agent - Sub Agents Agents as tools through ComponentTool

Itinerary Agent

Itinerary Agent Error Handling If a tool call fails, the

Soon: Haystack 3.0 ☀  Simplified UX Improving the user

Practical Tips For Building Agents Optimal Autonomy Maintain control over

Unbreakable AI Agents with Haystack  Human-in-the-Loop View Tutorial 

Thank you! 🤖 Demo: itinerary-agent.deepset.ai 🌐 Demo GitHub: github.com/deepset-ai/itinerary-agent STAY