Upgrade to Pro — share decks privately, control downloads, hide ads and more …

.NET Day 2025: Enhancing Legal Document Analysi...

.NET Day 2025: Enhancing Legal Document Analysis with Reflection Agents, Semantic Kernel, and Azure AI Search

Avatar for dotnetday

dotnetday

August 29, 2025
Tweet

More Decks by dotnetday

Other Decks in Technology

Transcript

  1. AGENDA LLM Basics Customer Project - FTA The Microsoft Way

    - Azure AI Services Step-by-Step Improvements - FTA Conclusion
  2. AZURE AI SERVICES OpenAI Vision Speech Language Content Safety Face

    Document Intelligence Azure AI services AI Search AI Agent Service AI Model Inference AI Foundry
  3. 1ST ITERATION Focus on 50 documents Index Laws per article

    Index PDFs per page Azure Open AI – on your data Vector Search
  4. EVALUATION TYPES LLM evaluation How good the foundation models performs

    on a certain task. LLM system evaluation How good the LLM performs in your specific use case, on your data, in your domain.
  5. MEAI.EVALUATION OVERVIEW • Open-source • Predefined LLM-based evaluators • Interface

    for custom-evaluators • Local and Azure Storage Account • In Preview
  6. EVALUATION RESULTS 0 10 20 30 40 50 60 70

    80 90 100 Provided source Applied source Retrieval Step - % of documents provided and applied Vector Search Hybrid Search
  7. EVALUATION RESULTS 0.8 0.948 0.914 0.85 0.948 0.917 0.7 0.75

    0.8 0.85 0.9 0.95 1 Relevance Groundedness Cosin Sim Answer Generation – Quality Metrics Vector Search Hybrid Search
  8. EVALUATION RESULTS 0 10 20 30 40 50 60 70

    80 90 100 Provided source Applied source Retrieval Step - % of documents provided and applied Vector Search Hybrid Search Hybrid Search with Summary Hybrid Search with Reranking
  9. EVALUATION RESULTS 0.8 0.948 0.914 0.85 0.948 0.917 0.86 0.945

    0.917 0.9 0.988 0.927 0.7 0.75 0.8 0.85 0.9 0.95 1 Relevance Groundedness Cosin Sim Answer Generation – Quality Metrics Vector Search Hybrid Search Hybrid Search with Summary Hybrid Search with Reranking
  10. IMPLEMENTATION • Not supported by Azure OpenAI - On your

    data • Derive Search Query • Using Azure AI Search SDK + Autogen
  11. EVALUATION – EXECUTION TIME 2889 1999 2966 0 500 1000

    1500 2000 2500 3000 3500 Azure OYOD - Hybrid Custom - Hybrid Multiagent (with Reranking) ms LLM system evaluation – Mean Execution Time
  12. EVALUATION RESULTS 0.86 0.88 0.9 0.92 0.94 0.96 0.98 1

    Relevance Groundedness Cos Sim LLM System evaluation – MultiAgent Single Agent MultiAgent
  13. SURVEY RESULTS 86% OF ANSWERS RATED POSITIVELY 89% OF THE

    USERS WANT TO USE THE SYSTEM PRODUCTIVELY
  14. CONCLUSION • SK is your SDK of choice • Azure

    AI Search for unstructured data • Use advanced capabilities • Start Evaluating early • What is your Use case • Business Value & Innovation The dotnet Stack is ready for productive AI Applications