Slide 1

Slide 1 text

Build Your Own Secured AI Platform with Google Cloud Vertex AI by unleashing the Power of Multi-LLM MONTRÉAL Eckarath Khounsombath

Slide 2

Slide 2 text

Business & Technology Architect Eckarath Khounsombath From Internet Pioneers to Cloud Innovators and AI Trailblazers

Slide 3

Slide 3 text

The Dawn of GenAI Why ? What ? How ? What’s Next Agenda

Slide 4

Slide 4 text

The Dawn of GenAI A Modern fairytale MONTRÉAL

Slide 5

Slide 5 text

We stand at the cusp of a technological revolution—the era of generative AI. GenAI is rapidly transforming industries and redefining what's possible. GenAI is unlocking unprecedented opportunities for businesses of all sizes

Slide 6

Slide 6 text

Samsung employees pasted sensitive data into ChatGPT to help into their day-to-day work MONTRÉAL

Slide 7

Slide 7 text

Samsung employees pasted sensitive data into ChatGPT to help into their day-to-day work MONTRÉAL Unawareness of how ChatGPT handles data (it's sent to OpenAI for training).

Slide 8

Slide 8 text

Samsung employees pasted sensitive data into ChatGPT to help into their day-to-day work MONTRÉAL Unawareness of how ChatGPT handles data (it's sent to OpenAI for training). Potential compromise of confidential information, reputational damage.

Slide 9

Slide 9 text

50 More than new LLMs models popup since ChatGPT appears

Slide 10

Slide 10 text

Every tech company is now an AI company.

Slide 11

Slide 11 text

Why ? A GenAI Governance MONTRÉAL

Slide 12

Slide 12 text

GenAI is transforming how businesses operate. To maximize benefits and minimize risks, enterprises must establish clear governance frameworks

Slide 13

Slide 13 text

1. Vision 2. Value 3. Risk 4. Adoption Build a GenAI Strategy GenAI needs to pivot from a futuristic novelty to practical, actionable business applications. Companies needs to capitalize on its capabilities to solve real-world business problems and achieve tangible outcomes with this new technology. A GenAI strategy must anchor their approach in four fundamental pillars 4 Fundamental Pillars

Slide 14

Slide 14 text

What ? GenAI - Use Cases MONTRÉAL

Slide 15

Slide 15 text

Common Use Cases AI Content Creation AI Content Discovery AI Conversational AI Simulations Writing/Text, Speech, Audio, Image, Video, Code Search, Analysis, Knowledge Management Virtual Assistant, Translation Prediction, Synthetic Data, Simulation

Slide 16

Slide 16 text

High Level Design Infrastructure DataLake LLM Mesh Observability Private LLM Custom LLM Vector Store Security Monitoring API GenAI Apps Internal Apps GenAI Studio Reporting GenAI Services API Discovery Content Conversation

Slide 17

Slide 17 text

Google Cloud Platform brings us the perfect backbone to build our GenAI Enterprise Platform.

Slide 18

Slide 18 text

How ? Google Cloud Platform fit it all MONTRÉAL

Slide 19

Slide 19 text

Low Level Design - Compute & Data Layer Infrastructure DataLake Use to deploy complex LLM model Use to deploy simplest model Use GPU/TPU to execute models Use data services for : ● Usage in GenAI ● Training/Augmenting LLM

Slide 20

Slide 20 text

Low Level Design - LLM Mesh LLM Mesh Private LLM Custom LLM Vector Store API Cloud API Vertex AI Model Garden CloudRun with GPUs CloudRun

Slide 21

Slide 21 text

Low Level Design - GenAI Services GenAI Services API Discovery Content Conversation Vertex AI Agent Builder Build your own RAG Vertex AI Agent Builder Build your own Chatbot/Assistant Google Translate Google AI Service Expose secured API for general usage LLM Mesh API

Slide 22

Slide 22 text

Low Level Design - Observability Observability Security Monitoring Reporting Cloud Monitoring Cloud Logging Cloud Trace Identity & Access Management Security Command Center BigQuery Pub/Sub LLM Mesh GenAI Services Google Data Studio

Slide 23

Slide 23 text

LLM Mesh Technical Design - Multi-LLMs Vertex AI Model Garden Mistral Anthropic LLama API / SDK CloudRun LLM API CloudRun Ollama Mistral (1) Use Model Garden to let VertexAI to manage LLM models deploiement for you. You will able to access to LLM model through a VertexAI Endpoint or with VertexAI SDK if it has been embedded (1’) Use CloudRun with GPUs option to deploy Ollama and available LLM. You will be able to access to LLM model through Ollama API. (2) Call dedicated API or SDK related to the selected LLM to answer the request Observability E 1 2 1 1’ 3 4 5 (5) Send all usage of Services by logging, sending request to PubSub/BQ, tracing, … API Generic LLM inference (1) Receive a request to find an answer with a specific LLM (4) Send generic answer (3) Send an answer in selected LLM format

Slide 24

Slide 24 text

GenAI Services (1) Use Agent Builder Datastore to RAGify your data to be interrogated. Technical Design - Knowledge Base w/ Agent Builder Agent Builder DataStore Web GCS, Drive Database API CloudRun GenAI Services API Agent Builder Search App (2) Use Agent Builder Search Application to create dedicated Knowledge following a list of datastore (2)Call Vertex AI Endpoint to use search app to get an answser from request (5) Send all usage of Services by logging, sending request to PubSub/BQ, tracing, … Observability WebUI Automation AgentBuilder DataStore & Search App can be create through Cloud API Limitation AgentBuilder DataStore & Search App can be only use with Google LLMs models E 4 2 1 1 2 3 5 (3) Send a formatted answer with sources Service Knowledge Base (1) Receive a request to find an answer in a specific Knowledge Base (4) Send an enrich answer with link to the sources

Slide 25

Slide 25 text

GenAI Services LLM Mesh Technical Design - Knowledge Base w/ Langchain CloudRun GenAI Services API (1) Use Cloud Storage to store (un)structured documents for your Knowledge Base Observability CloudRun (Jobs) Embeddings CloudSQL Vector Store Cloud Storage Documents E 1 2 3 5 4 5 1 2 4 3 6 (2) Use EventArc to trigger job to ingest documents received (3) Use Cloud Run Jobs to ingest documents using Langchain framework (read, chunk, embeddings) (4) Use LLM Mesh API to create embeddings with LLM models available (5) Use CloudSQL PostgreSQL with pgvector plugin to store embeddings vector (2) Embed the request (6) Send all usage of Services by logging, sending request to PubSub/BQ, tracing, … (4) Call specific LLM to generate an answer from request + documents chunk retrieved (1) Receive a request to find an answer in a specific Knowledge Base (5) Send an enrich answer with link to the sources (3) Use embeddings to do a vector search

Slide 26

Slide 26 text

High Level Design - Assistant Anatomy 1 2 3 Goals High level description of the goal the assistant intends to accomplish Instructions Tools A step-by-step execution instructions to accomplish target goal Assistant can use selected tools (API, documents, …) to generate response Agentic Framework

Slide 27

Slide 27 text

GenAI Services (1) Use Tools connector providing by Agent Builder to empower your assistan. Technical Design - Assistant w/ Agent Builder Agent Builder DataStore API CloudRun GenAI Services API Agent Builder Agent (2) Use Agent Builder Agent Application to create dedicated Knowledge following a list of datastore (2)Call Vertex AI Endpoint to use search app to get an answser from request (5) Send all usage of Services by logging, sending request to PubSub/BQ, tracing, … Observability WebUI Automation AgentBuilder Agent can be create through Cloud API Limitation AgentBuilder DataStore & Search App can be only use with Google LLMs models E 4 2 1 2 3 5 (3) Send a formatted answer with sources Service Knowledge Base (1) Receive a request to find an answer in a specific Knowledge Base (4) Send an enrich answer with link to the sources Function Calling Tools OpenAPI 1

Slide 28

Slide 28 text

What’s Next ? From Passive Tools to Proactive Agents MONTRÉAL

Slide 29

Slide 29 text

( 1 ) Key Takeaways GenAI Governance Establish governance frameworks to maximize GenAI benefits and minimize risks. ( 2 ) GenAI Platform Use a Multi-LLM approach to package GenAI services to be served in an API first paradigm ( 3 ) Google Cloud for GenAI Platform Leverage GCP services like Vertex AI, CloudRun, and LangChain to create a multi-LLM, knowledge-driven GenAI platform.

Slide 30

Slide 30 text

( 1 ) Towards Agentic AI GenAI (Generative AI) has rapidly evolved from narrow, constrained models to large, flexible language models. ( 2 ) These models are demonstrating increasingly advanced language understanding and generation capabilities. ( 3 ) But GenAI is poised to become even more "agentic" - gaining the ability to reason, plan, and take autonomous actions.

Slide 31

Slide 31 text

REMEMBER Every prompt we send creates a carbon footprint. Let's train our AIs to be smarter, not just bigger, and make every interaction count for a sustainable digital future.