Turbocharging AI Innovation: How AI Platforms Enable The Bulletproof Deployment of GenAI Use Cases. #SAG2024

qaware.de Turbocharging AI Innovation How AI Platforms Enable The Bulletproof
Deployment of GenAI Use Cases Mario-Leander Reimer © 2024 QAware

2 Mario-Leander Reimer Managing Director | CTO @LeanderReimer #cloudnativenerd #qaware
#gernperDude

Platform engineering is the discipline of designing and building toolchains
and workﬂows that enable self-service capabilities for software engineering organizations in the cloud-native era. Platform engineers provide an integrated product most often referred to as an “Internal Developer Platform” covering the operational necessities of the entire lifecycle of an application. https://platformengineering.org/blog/what-is-platform-engineering

“Too much cognitive load will become a bottleneck for fast
flow and high productivity for many teams.” ▪ Intrinsic Cognitive Load Relates to fundamental aspects and knowledge in the problem space (e.g. languages, APIs, frameworks) ▪ Extraneous Cognitive Load Relates to the environment (e.g. console command, deployment, configuration) ▪ Germane Cognitive Load Relates to specific aspects of the business domain (aka. „value added“ thinking)

An IDP and your platform engineers are key enablers for
high productivity of the stream-aligned DevOps teams. QAware | 5 ▪ Responsible to build and operation a platform to enable and support the teams in their day to day development work. ▪ The platform aims to hide the inherent complexity to reduce the cognitive load for the other teams. – Standardization (Compliance, Security, …) – Developer Self-Service ▪ Fully automated software delivery is the goal! https://hennyportman.wordpress.com/2020/05/25/review-team-topologies/

AI platform engineering is the discipline of designing and building
toolchains and workﬂows to provide self-service capabilities for data and AI driven organizations. Business experts, data engineers as well as software engineers work together in an integrated platform from now on referred to as an “Enterprise AI Platform” covering the operational necessities of the entire lifecycle of AI use cases. © 2024, M.-Leander Reimer

Endless Possibilities and Use Cases Chatbots, CWYD, Content Creation

The most common uses for GenAI tools are in marketing,
sales, product development and service operations. Source: https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai-in-2023-generative-ais-breakout-year

Use Case: Customer Support Support Ressourcen Recherche Support Flow Customer
Data Images and Icons were generated with the assistance of AI RAG Automation Intent Recognition Text2Speech Speech2Text Anomaly Detection Similarity Matching AI Assistant Call Chat Multi Agent Workﬂow

RAG in a Nutshell. Index, e.g. Vector DB Indexing (Chunking
& Embedding) Documents Ingestion Phase Query Encoding Retrieval Phase Context Prompt LLM with world knowhow Response

From input to embedding: this is how a high-performance semantic
search using vector databases works. Embedding Model Images were generated with the assistance of AI { 23.567, 45.899, 76.345, …}

Chatbots and AI Assistants The more speciﬁc the use case,
the more complex it becomes. ChatGPT or comparable with world knowhow ChatGPT with organisational context knowledge Specialized AI Assistent ▪ Retrieval Augment Generation ▪ Transfer Learning ▪ Specially trained model ▪ Process automation Complexity Beneﬁt ▪ Easy to use and cost efficient ▪ Requires guidelines on data protection and compliance

Why do we need an AI platform?

The 80% Fallacy Juan Pablo Bottaro, LinkedIn Engineering Blog

Key challenges: models, tools and skills. Source: https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai-in-2023-generative-ais-breakout-year

Each stakeholder involved has a different expertise and thus a
different focus. Domain Expert Software Engineers and Architects This image was generated with the assistance of AI Data Scientists, AI Experts Platform Engineers

It all starts with an understanding of data and use
cases. This is crucial for a structured start in any AI project. Business Understanding Data Understanding Data Preparation Modelling Evaluation Deployment

"But we are already doing this!" Really? MLOps only covers
part of the tasks related to GenAI. Source: https://neptune.ai/blog/mlops

Our proposal for an AI platform architecture

Integration & Delivery Plane Service Plane Platform Plane Resource Plane
Quality Plane Compliance Plane Foundation Foundation Interfaces Domain Service Domain Service Domain Service

Compliance Plane Integration & Delivery Plane Service Plane Data Plane
Platform Plane Observability Operability Resource Plane User Serving Plane Access Plane / APIs Orchestration Plane Data Modelling Plane Model Plane Compute Data Integration Security Delivery FinOps Quality Plane

Quality Plane Integration & Delivery Plane Service Plane Access Plane/APIs
User Serving Plane Orchestration Plane Data Modelling Pl. Data Plane Model Plane Compliance Plane Platform Plane Observability: Monito- ring, Logging, Tracing Security: Secrets, IAM Encryption, Certs, … Scale, Backups, Recovery, … Delivery: CI/DC, Registry Pipelines, Orchestrator, … FinOps Resource Plane Compute: CPU and GPU Data: Vector DBasS, other Storage, … Integration: Self-hosted LLMs Public LLMs Managed AI Services

User Serving Plane „Convenience UIs“, Self Service, RAG per Drag and Drop, … (a) LLM, Embedding, (b) RAG, Chatbot, … (c) Data Access, … Orchestration Plane Data Modelling Pl. Playground Prompt Engineering Konﬁguration Runtime, Instantiation, Orchestration, Scaling, Conﬁguration Data Plane Model Plane Compliance Plane Platform Plane Observability: Monito- ring, Logging, Tracing Security: Secrets, IAM Encryption, Certs, … Scale, Backups, Recovery, … Delivery: CI/DC, Registry Pipelines, Orchestrator, … FinOps Resource Plane Compute: CPU and GPU Data: Vector DBasS, other Storage, … Integration: Self-hosted LLMs Public LLMs Managed AI Services

User Serving Plane Technical and Business Metrics like Accuracy, Harmfulness, … Test Automation for LLMs „Convenience UIs“, Self Service, RAG per Drag and Drop, … (a) LLM, Embedding, (b) RAG, Chatbot, … (c) Data Access, … Orchestration Plane Data Modelling Pl. Playground Prompt Engineering Konﬁguration Runtime, Instantiation, Orchestration, Scaling, Conﬁguration Data Plane Ingestion Pipelines Data Versioning Embeddings & Vectorization Model Plane MLOps: Model Registry Model Management Experiment Tracking Model Serving Compliance Plane Tonality, Bias Security, Data Protection Platform Plane Observability: Monito- ring, Logging, Tracing Security: Secrets, IAM Encryption, Certs, … Scale, Backups, Recovery, … Delivery: CI/DC, Registry Pipelines, Orchestrator, … FinOps Resource Plane Compute: CPU and GPU Data: Vector DBasS, other Storage, … Integration: Self-hosted LLMs Public LLMs Managed AI Services

From concept to realisation: possible variants

All roads lead to Rome. Depending on the context, one
or other variant makes sense. Buy an AI platform solution Combination of cloud provider building blocks Custom platform with open source components

Azure AI Studio (Preview) Azure AI Content Safety Quality Plane
Integration & Delivery Plane Service Plane Azure API Management Access Plane Azure AI Studio (Preview) User Serving Plane Azure AI Studio (Preview) Semantic Kernel Orchestration Plane Azure AI Document Intelligence Data Modelling Pl. Azure AI Search with Indexers, Indices incl. Vector DBs. OneLake, Fabric Data Plane Azure OpenAI Azure Machine Learning Model Plane Azure AI Content Safety Compliance Plane Platform Plane Observability Security Scale, Backups, Recovery, … Delivery FinOps Resource Plane Compute Data Azure OpenAI Azure AI Language Speech Service Azure AI Translator Integration Overview on Azure AI Services: https://learn.microsoft.com/en-us/azure/ai-services/what-are-ai-services

Azure AI Studio (Preview) Azure AI Content Safety Quality Plane
Integration & Delivery Plane Service Plane Azure API Management Access Plane Azure AI Studio (Preview) User Serving Plane Azure AI Studio (Preview) Semantic Kernel Orchestration Plane Azure AI Document Intelligence Data Modelling Pl. Azure AI Search with Indexers, Indices incl. Vector DBs. OneLake, Fabric Data Plane Azure OpenAI Azure Machine Learning Model Plane Azure AI Content Safety Compliance Plane Platform Plane Observability Security Scale, Backups, Recovery, … Delivery FinOps Resource Plane Compute Data Azure OpenAI Azure AI Language Speech Service Azure AI Translator Integration Overview on Azure AI Services: https://learn.microsoft.com/en-us/azure/ai-services/what-are-ai-services Just give it a try. Or ask Azure experts.

mlflow, Evidently AI, RAGAS (for RAG), DeepEval (for LLM) Quality
Plane Integration & Delivery Plane Service Plane API Gateways Access Plane Build your own User Serving Plane Kubeflow Orchestration Plane Jupyter Kubeflow Data Modelling Pl. Weaviate, neo4J, … Custom Pipelines Data Plane mlflow (Registry) BentoML (Serving) Kubeflow (Serving) Model Plane Build your own Compliance Plane Platform Plane Observability Security Scale, Backups, Recovery, … Delivery FinOps Resource Plane Compute Data LLMs: Llama, Mistral, … mlflow BentoML Integration

mlflow, Evidently AI, RAGAS (for RAG), DeepEval (for LLM) Quality
Plane Integration & Delivery Plane Service Plane API Gateways Access Plane Build your own User Serving Plane Kubeflow Orchestration Plane Jupyter Kubeflow Data Modelling Pl. Weaviate, neo4J, … Custom Pipelines Data Plane mlflow (Registry) BentoML (Serving) Kubeflow (Serving) Model Plane Build your own Compliance Plane Platform Plane Observability Security Scale, Backups, Recovery, … Delivery FinOps Resource Plane Compute Data LLMs: Llama, Mistral, … mlflow BentoML Integration Use at your own risk! Or ask an AI platform expert.

Which one is right for me?

Start lean and agile! Tailored to the domain and problem
instead of ‘One size ﬁts all’. Use Case Identiﬁcation Business Understanding Skill, Resource & Requirements Analysis Building Block Mapping & Prioritization Implementation Evaluation Commoditization

Turbocharging AI Innovation: How AI Platforms E...

Turbocharging AI Innovation: How AI Platforms Enable The Bulletproof Deployment of GenAI Use Cases. #SAG2024

M.-Leander Reimer PRO

More Decks by M.-Leander Reimer

Other Decks in Technology

Featured

Transcript

qaware.de Turbocharging AI Innovation How AI Platforms Enable The Bulletproof

2 Mario-Leander Reimer Managing Director | CTO @LeanderReimer #cloudnativenerd #qaware

Platform engineering is the discipline of designing and building toolchains

“Too much cognitive load will become a bottleneck for fast

An IDP and your platform engineers are key enablers for

AI platform engineering is the discipline of designing and building

Endless Possibilities and Use Cases Chatbots, CWYD, Content Creation

The most common uses for GenAI tools are in marketing,

Use Case: Customer Support Support Ressourcen Recherche Support Flow Customer

RAG in a Nutshell. Index, e.g. Vector DB Indexing (Chunking

From input to embedding: this is how a high-performance semantic

Chatbots and AI Assistants The more speciﬁc the use case,

Why do we need an AI platform?

The 80% Fallacy Juan Pablo Bottaro, LinkedIn Engineering Blog

Key challenges: models, tools and skills. Source: https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai-in-2023-generative-ais-breakout-year

Each stakeholder involved has a different expertise and thus a

It all starts with an understanding of data and use

"But we are already doing this!" Really? MLOps only covers

Our proposal for an AI platform architecture

Integration & Delivery Plane Service Plane Platform Plane Resource Plane

Compliance Plane Integration & Delivery Plane Service Plane Data Plane

Quality Plane Integration & Delivery Plane Service Plane Access Plane/APIs

Quality Plane Integration & Delivery Plane Service Plane Access Plane/APIs

Quality Plane Integration & Delivery Plane Service Plane Access Plane/APIs

From concept to realisation: possible variants

All roads lead to Rome. Depending on the context, one

Azure AI Studio (Preview) Azure AI Content Safety Quality Plane

Azure AI Studio (Preview) Azure AI Content Safety Quality Plane

mlﬂow, Evidently AI, RAGAS (for RAG), DeepEval (for LLM) Quality

mlﬂow, Evidently AI, RAGAS (for RAG), DeepEval (for LLM) Quality

Which one is right for me?

Start lean and agile! Tailored to the domain and problem

QAware GmbH | Aschauer Straße 30 | 81549 München |