Upgrade to Pro — share decks privately, control downloads, hide ads and more …

From Search Results to Insights

From Search Results to Insights

GenAI services have been adopted successfully in no time across various digital business models, but what if your data has the better answers? How could this innovative technology be combined with a companies knowledge and data?

In this talk, we delve into the intricacies of Large Language Models (LLMs) and their augmentation with custom data through the use of Retrieval-Augmented Generation (RAG). Learn about Statista’s pioneering journey in moving from extensive search results to concise and well-founded answers, using their LLM-based application, ResearchAI. We will tackle the challenges faced, including building a skilled team for such an emerging technology, the impact of exclusive data sources on answer quality, high product costs and latency per request, and the tendency of LLMs to produce hallucinations despite the availability of accurate data. This session offers a realistic look at the hurdles encountered, and the strategies employed, providing valuable lessons on building and optimizing RAG applications in the real world.

Benedikt Stemmildt

January 20, 2025
Tweet

More Decks by Benedikt Stemmildt

Other Decks in Programming

Transcript

  1. Matthias Lau #machinelearning #developer #founder Ingo Schellhammer #biztech #cto Bene

    Stemmildt #socio-technical arch #networker #cto #velominatus
  2. 1. Add Traceability 2. Define your Metrics Relevance 3. Create

    a Reference Dataset 4. Measure a Baseline 5. Experiment and Measure Delta Recap Optimization Playbook.
  3. Retrieval Query Rewrite Query How tall is the Eiffel Tower?

    It looked so high when I was there last year? What is the height of the Eiffel Tower?
  4. Retrieval Query Rewrite Query Variants Retrieval Retrieval Reranking Which company

    had more revenue 2015 to 2020, Apple or Microsoft? Apple revenue from 2015 to 2020. Microsoft revenue from 2015 to 2020. Microsoft and Apple revenue comparison from 2015 to 2020.
  5. Retrieval HyDE Query Which company had more revenue 2015 to

    2020, Apple or Microsoft? Between 2015 and 2020, Apple consistently had higher revenue than Microsoft. Apple’s revenue grew from approximately $233.7 billion in 2015 to $274.5 billion in 2020, while Microsoft’s revenue increased from about $93.6 billion in 2015 to $143 billion in 2020.
  6. There’s no One-Fits-All. Query with good Retrievals Llama 3.1 8B

    👍 Query with the need for complex conclusions Llama 3.1 8B 👎
  7. 24.7 -10% 2.8c -65% 72% +140% 14.6 -41% 1.6c -43%

    72% +/- 0% POST HEUREKA IMPROVEMENTS TODAY (Sep. ‘24) 27.6 8.1c 30% START (Sep. ‘23)
  8. Meeting our quality ambition in a high-growth game 3,1 bn

    (+9%) 275 mn (-4%) 72 mn (+16%) 29 k (+11%) Quality Monthly visits