Upgrade to Pro — share decks privately, control downloads, hide ads and more …

The Retrieval Bottleneck: Why Your AI Strategy ...

The Retrieval Bottleneck: Why Your AI Strategy is Still a Search Strategy

The industry has spent the last year obsessing over model parameters and prompt engineering, yet the biggest point of failure in Generative AI remains the one thing SEOs know best: Information Retrieval.
In this session, we peel back the magic of AI to reveal the Retrieval-Augmented Generation (RAG) architecture that actually powers it. We will demonstrate why an LLM is only as smart as the data it can successfully pull from an index. When AI fails to answer correctly, it’s rarely a "hallucination" problem—it’s usually a structure, freshness, or relevance problem.

Attendees will move beyond the AI hype to understand the mechanical reality of how content is processed, chunked, and surfaced. We’ll discuss why the future of visibility isn't about gaming an algorithm, but about building a technical foundation that makes your content impossible for an AI to miss.

Key Takeaways:

The RAG Reality Check: Why retrieval is the unsung hero (and single point of failure) of the AI era.

SEO’s New Mandate: Shifting focus from Ranking #1 to Contextual Retrievability.

The Architecture of Knowledge: How document hierarchy and metadata now serve as the primary map for AI models.

Avatar for Dawn Anderson

Dawn Anderson

March 20, 2026
Tweet

More Decks by Dawn Anderson

Other Decks in Marketing & SEO

Transcript

  1. Nom du conférencier | Société | Fonction The Retrieval Bottleneck

    ...Why Your AI Strategy is Still a Search Strategy
  2. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Every company now claims an AI strategy. Billions invested in foundation models Endless discussions about: • Tokens • Model size • Prompts • Temperatures The AI Goldrush
  3. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant What Everyone Talks About Most AI conversations focus on: • Prompt engineering • Model parameters • Fine-tuning • LLM benchmarks But very few people talk about… RETRIEVAL
  4. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant ...is the true engine behind AI answers Understanding Why Information Retrieval (IR)
  5. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Who is Dawn Anderson? • SEO practitioner for almost 20 years • International SEO conference speaker since 2017 • Boutique agency owner • Enterprise SEO consultant • Information retrieval & AI search world interloper • Pomeranian lover ;P
  6. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant • an index • a retrieval engine • a ranking system Under every AI answer lies:
  7. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant LLMs cannot generate what they cannot retrieve. Failures usually come from: • missing documents • outdated data • poor structure • weak relevance signals The Retrieval Bottleneck
  8. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The Hallucination Reality Common explanation: “LLMs hallucinate.” Reality: Often a retrieval failure.
  9. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant 'The Hallucination Narrative' Just filling in the probability gaps as the model sees fit in the face of uncertainty.
  10. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant A Bit Like 'GuesSEO' and the Misinformation Super Highway
  11. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Open Book Analogy LLMs are like students in an open-book exam But the answers depend on: Which pages were in the book.
  12. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant What is RAG? Retrieval- Augmented Generation combines: • Search • Indexing • Language models
  13. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Why RAG Exists LLMs alone have limitations • Static knowledge • Outdated training data • Hallucination risk RAG adds external retrieval
  14. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The Return of IR Information Retrieval research now powers AI search.
  15. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The RAG Pipeline
  16. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Where Errors Actually Start Typical failure chain: • Missing documents • Weak retrieval • Incomplete context • Model guesses (educated guesstimating)
  17. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Generation is the Last Step
  18. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Retrieval Determines Truth AI answers depend on: • What was retrieved? • What was ranked highest (importance and consensus for LLMs)? • What entered context?
  19. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The Retrieval Bottleneck AI systems fail when: • content isn’t indexed • context isn’t retrieved • relevance signals are weak
  20. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Where SEO Fits SEO influences: • Crawlability • Indexing • Structure • Retrievability SEO is becoming even more foundational
  21. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Sparse vs Dense Retrieval Two retrieval approaches with many models: • sparse (includes BM25) • dense (includes vector)
  22. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Reality of Modern Hybrid Retrieval Modern search uses: hybrid retrieval BM25 + vector search
  23. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant 'Query Fan Out' AI systems expand queries into multiple searches. Works with the notion of 'recall' improvement to move from 'sparse' to 'dense' retrieval
  24. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Why 'Fan Out' Exists Fan-out improves: • coverage • recall • context
  25. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Example 'Fan Out' Query: “How does AI search work?” Subqueries: • RAG architecture • vector search • ranking models
  26. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant AI systems process content as: • chunks • embeddings • vectors NOT PAGES Documents Become Data
  27. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Documents are broken into chunks Each chunk becomes a retrieval unit Poor chunking can cause: • fragmented context • incomplete answers Chunking
  28. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant LLMs have limited context windows Only a small number of chunks can be included.
  29. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Ranking Still Matters Retrieval → Ranking → Context Ranking decides: • which chunks are used • which are ignored
  30. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Semantic Collapse Large models risk semantic collapse... Meaning distinctions blur.
  31. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Semantic Metadata As Navigation Semantic Metadata helps AI understand: • hierarchy • topic boundaries • relationships
  32. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Traditional SEO Goal Traditional objective: Rank #1 Focus on: • Keywords • Backlinks • SERP positions
  33. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The New Objective BE RETRIEVED... BE EXTRACTED... BE CITED... BE TRUSTED...
  34. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The Retrieval Question As well as asking... "Can we rank this?" Ask... "Can we get this retrieved / extracted / cited?"
  35. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Contextual Visibility Semantic Clarity Context Relevance Entity Relationships
  36. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Contextual Retrievability Content must be: • structured • understandable • machine interpretable
  37. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Structure is the Map Structure helps machines understand: • hierarchy • relationships • topic organisation
  38. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Why Headings Matter Headings create: • semantic anchors • clear topical sections THOSE HEADINGS WERE NEVER JUST FOR DESIGN PURPOSES. Use them ALL down the hierarchy to h6
  39. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Internal Links Internal links create: knowledge pathways & Hierarchical Navigable Small World Graphs (HNSW) 'Emulations of 'real world' entities and their connections
  40. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Context Graphs These connected documents form 'context graphs' to aid to both relevance and disambiguation. "You shall know content by the pages it is connected to". Firthian approaches & improved retrieval reasoning
  41. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Topic Networks Build topic clusters • Strengthen relationships and themes between pages • Semantic similarity reinforces via power of 'Firthian' linguistics
  42. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Entity Based Content AI systems understand entities Content should clarify: • people • organisations • concepts
  43. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Knowledge Graphs Knowledge graphs connect: • entities • documents • relationships
  44. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant AI Needs Fresh Data LLMs rely on retrieval for freshness
  45. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The Freshness Problem Outdated content → outdated answers
  46. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Retrieval and Time Retrieval systems evaluate: • relevance (precision) • authority (consensus / trust) • freshness (what can add to the closed book value?)
  47. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The New SEO Mission Solid SEO as: Engineering retrievability (ranking optimisation and retrieval optimisation)
  48. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Retrieval First Content Design content for: • human-first trust • focused-brevity • topical connections • structure • think 'inverted pyramid of journalism'
  49. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Machine Readable Knowledge Content must be: • structured • semantically clear • connected
  50. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Indexability Still Matters AI systems rely on: • crawlable pages • accessible content • stable URLs Watch out for 'crawl budget' comeback as a huge SEO topic
  51. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant AI Bots are Rampant Nosey Crows
  52. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Retrieval Optimisation Focus on: • structure • metadata • entities • semantic similarity • topic focus • avoiding semantic drift
  53. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Retrieval Friendly Content Include: • clear sections • descriptive headings • contextual explanations • multi-modal aspects
  54. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Context Window Design Brevity and self-contained topics and sub-topics have never been more important. 'Chunking' matters but don't fall for what is potentially the modern day 'keyword stuffing'
  55. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Don't 'Chunk' Spam Chunks and context windows are important for AI Search, but it's arguably not the job of SEOs to create tiny blocks of content.
  56. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Solid Technical Foundation Everything rests on solid technical foundations: Signals include: • architecture • canonicalisation • linking • structured data
  57. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The Retrieval Bottleneck AI systems fail when: retrieval fails 'Closed book' trained models are NOT enough Other knowledge / search MUST augment
  58. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The New Visibility Model Retrievability → Context → Generation
  59. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The Real AI Strategy A real AI strategy includes: • knowledge architecture • retrieval infrastructure • content design & structure
  60. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Why SEOs Matter? SEOs already understand: • crawling • indexing • ranking • structure
  61. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The Future of Search Search evolves from: documents → knowledge
  62. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Key Takeaways 1 Retrieval is the biggest AI failure point 2 SEO shifts to retrievability 3 Knowledge architecture determines visibility
  63. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant The organisations that understand and optimise for retrieval will be the ones whose knowledge becomes visible in the AI search era.
  64. SEO & GEO Summit 2026 Dawn Anderson | Bertey |

    Founder & SEO Consultant Thank you X - @dawnieando Bluesky - @dawnieando Linkedin – Ms Dawn Anderson Threads - @dawnieando Bertey.com