RAG: Accuracy and Explainability in GenAI Applications

RAG: Accuracy and Explainability in GenAI Applications Jennifer Reif [email protected]
@JMHReif github.com/JMHReif jmhreif.com linkedin.com/in/jmhreif

Who is Jennifer Reif? Developer Advocate, Neo4j • Tech speaker,
author, blogger, podcaster • Continuous learner • Java champion • Other: geek Jennifer Reif [email protected] @JMHReif github.com/JMHReif jmhreif.com linkedin.com/in/jmhreif

When AI misses the mark…

Who is Jennifer Reif? Public info IS available…

But this is good! Generates new from existing patterns

Adding some context… Pointing to a speci fi c source
as guide

Negative AI stories Even well-respected companies get it wrong •
Hallucinating non-existent policy, legal cases • Chatbot produces Python • Legally binding vehicle o ff er • Harmful health advice • Threatening users • Inventing new language • Illegal activities (insider trading + local health laws)

Standalone LLM Doesn’t often work • Design: • human-consumable output
• creative variation (probabilistic answers) • Problems: • too little detail, vague prompt • missing information (recent or private knowledge) • probabilistic ~= inconsistent

How do we avoid this? Add as much context as
possible • Guide LLM to relevant ideas and content • Focuses / narrows search area • Adds to LLM knowledge • Reduces margin of error Photo by Ali Alauda on Unsplash

Adding context…

RAG architecture • Retrieval • Data retrieved from external source
• Augmented • Augments response with facts • Generation • Response in natural language Prompt + Relevant Information LLM API LLM  Chat API User Database Search Prompt Response Relevant Results / Documents 2 3 1 Database

Types of context • Databases (all kinds) • Documents/folders •
Media (music/images/video) • Charts (reports/visuals)

• Data -> array of fl oating point numbers •
Captures meaning and how it’s used • Comparable format! Embeddings / Vectors Convert data to a point in space

• Vectors -> semantic similarity • Proximity in vector space
• Example: Library • Book classi fi cation - genre vs location of plot • More speci fi c + more relevant! Photo by Martin Adams on Unsplash Searching the data …e ffi ciently

Retrieval sources

Vector database Store and retrieve data e ffi ciently •
Create/store/search embeddings • Index is top unit of data storage • Prioritize stats • Some metadata/connections • Highly e ff i cient semantic search

Where do vectors fall flat? How do you… • Limited
metadata / connections • Verify vector representations? • Explain how it got to answer? • Similar not always highest relevance?

“Unstructured” data Is it though? • Might be unstructured in
appearance, length, styling • Structure is there! • Less obvious + consistent • Structure: • Themes • Moments • Stories

Graphs -> how/why connections Analyze data from relationships • Relationships
+ entities • Designed for storing / navigating networks • Traceable • Explainable

Nodes (vertices) Objects or entities • Can have labels •
May have properties Person Degree Degree Company Person Person School Degree Edward Jones Michael Jennifer SIUE Music CMIS CS Dominic

Relationships (edges) Connect entities • Must have type (label) •
Must have direction • May have properties Person Degree Degree Company Person Person School ATTENDED ATTENDED W ORKED_FOR WORKED_FOR EN RO LLED_IN ENROLLED_IN ENROLLED_IN Degree C O M PLETED COM PLETED Edward Jones Michael Jennifer SIUE Music CMIS CS Dominic

What is a graph? Answers through relationships • Who has
similar journeys/ experiences? • New friends/club members? • Where did an employee/ student come from? • What makes 2 people similar? Person Degree Degree Company Person Person School ATTENDED ATTENDED W ORKED_FOR WORKED_FOR EN RO LLED_IN ENROLLED_IN ENROLLED_IN Degree C O M PLETED COM PLETED Edward Jones Michael Jennifer SIUE Music CMIS CS Dominic

Graphs connect the dots Structured + Unstructured • Flexible schema
= natural fi t • Document connections, paths, etc • Connect varying data structures

GraphRAG Using a graph as data source for RAG •
Can be one of many sources • Layer relationships from structured • Caveat - still not silver bullet!

Architecting GenAI

Layers of AI • More layers = better result •
Complexity vs value

Layers GenAI systems • Vector search • Additional context retrieval
• Lexical search / fi ltering • Polyglot system • Agents / tools • LLM decides which to use (and order) • Range: automation -> autonomous • MCP https://www.anthropic.com/engineering/building-e ff ective-agents

Align retrieval strategy Vector, graph, hybrid, agents? • Determine questions
• Match retrieval strategy • Iterate!

Resources • Github repository (today’s code): github.com/JMHReif/rag-vector-graph • GraphAcademy LLM
courses: graphacademy.neo4j.com/knowledge-graph-rag • Knowledge graph ebook: dev.neo4j.com/jmhreif-kg-ebook Jennifer Reif [email protected] @JMHReif github.com/JMHReif jmhreif.com linkedin.com/in/jmhreif

RAG: Accuracy and Explainability in GenAI Appli...

RAG: Accuracy and Explainability in GenAI Applications

Jennifer Reif

More Decks by Jennifer Reif

Other Decks in Technology

Featured

Transcript