Slide 1

Slide 1 text

©2023 Databricks Inc. — All rights reserved 1 Frank Munz, Principal TM Engineer, Databricks / April 2024 Generative AI for Data Platforms Cutting to the Chase

Slide 2

Slide 2 text

©2022 Databricks Inc. — All rights reserved Hi, I am Frank! • Principal @Databricks. TMM for Data, Analytics and AI products • Large scale data & compute • Based in 🍻 ⛰ 🥨 󰎲 Munich • Formerly AWS Tech Evangelist, SW architect, data scientist, published author etc. • @frankmunz / LindedIn

Slide 3

Slide 3 text

©2023 Databricks Inc. — All rights reserved 10,000+ global customers $1.5B+ in revenue $4B in investment Inventor of the lakehouse & Pioneer of generative AI Gartner-recognized Leader Database Management Systems + Data Science and Machine Learning Platforms The data and AI company Creator of

Slide 4

Slide 4 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary 4 Streaming Data

Slide 5

Slide 5 text

©2023 Databricks Inc. — All rights reserved Streaming Data ● Small sized data ● Continuously produced ● Expectation -> processed in time ● Programming paradigm ○ Right-time vs real-time 5

Slide 6

Slide 6 text

©2023 Databricks Inc. — All rights reserved Streaming Data Think “right-time” instead of “real-time” 6 Manually Continually Scheduled Latency Cost

Slide 7

Slide 7 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary 7 Latency vs Throughput

Slide 8

Slide 8 text

©2023 Databricks Inc. — All rights reserved 8

Slide 9

Slide 9 text

©2023 Databricks Inc. — All rights reserved TPC-DS Benchark from Barcelona HPC Center 2.2x faster with Photon than previous record for DWH 9

Slide 10

Slide 10 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary 10 But how about Latency?

Slide 11

Slide 11 text

©2023 Databricks Inc. — All rights reserved 11 Project Lightspeed https://www.databricks.com/blog/project-lightspeed-update-advancing-apache-spark-structured-streaming

Slide 12

Slide 12 text

©2023 Databricks Inc. — All rights reserved Subsecond Latency for Stateless Pipelines 12

Slide 13

Slide 13 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary 13 Serverless

Slide 14

Slide 14 text

©2023 Databricks Inc. — All rights reserved SIMPLE and FAST EFFICIENT RELIABLE Serverless Compute for Data Platforms Serverless Compute Hands-off auto-optimized compute No knobs Fast startup For any practitioner Fully managed and versionless Paying only what you use Strong cost governance Secure by default Stable with smart fail-overs Storage Notebooks with Spark Pipelines AI Model hosting SQL DWH "Put your vendor T-shirts down" 14 multi-cloud

Slide 15

Slide 15 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary 15 System Architecture

Slide 16

Slide 16 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary Architecture 16

Slide 17

Slide 17 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary Walk trough 17 ● Single Page App (S3) ● Kinesis Stream ○ JSON Structure ○ Kinesis Ingest with EFO ● Delta Live Tables (ETL) ● Spark Streaming Data Analytics ○ Histogram streaming data ○ Window-based aggregation ● Databricks Workflows ● Databricks SQL

Slide 18

Slide 18 text

©2024 Databricks Inc. — All rights reserved Databricks Data Intelligence Platform Use generative AI to understand the semantics of your data Data Intelligence Engine Open Data Lake (lake first approach: S3, ADLS, GCS) Databricks SQL Text-to-SQL Workflows optimized based on past runs Delta Live Tables Automated data qualility Mosaic AI Create, tune, and serve custom LLMs Unity Catalog Securely get insights in natural language Delta Lake with Delta UniForm Data layout is automatically optimized based on usage patterns

Slide 19

Slide 19 text

©2023 Databricks Inc. — All rights reserved Streaming ETL with Delta Live Tables Pipelines Python or SQL. STs for ingestion and MVs for transformation Bronze cloud_files CREATE STREAMING TABLE Use a short retention period to avoid compliance risks and reduce costs Avoid complex transformations that could have bugs or drop important data Retain infinite history Easy to perform GDPR and other compliance tasks CREATE MATERIALIZED VIEW Materialized views automatically handle complex joins / aggregations, and propagate updates and deletes. Silver/Gold Ad-hoc DML for GDPR / Corrections

Slide 20

Slide 20 text

©2023 Databricks Inc. — All rights reserved 20 Delta Live Tables ● Serverless Compute (zero compute settings) ● Streaming Ingest from Message Buses with SQL read_kafka(), read_kinesis(), … ● Incrementally computed Materialized Views Link to blog

Slide 21

Slide 21 text

©2023 Databricks Inc. — All rights reserved Building Blocks of Databricks Workflows 21 A unit of orchestration in Databricks Workflows is called a Job. Databricks Notebooks Python Scripts Python Wheels SQL Files/Queries Delta Live Tables Pipeline dbt Java JAR file Spark Submit Jobs consist of one or more Tasks Sequential Parallel Conditionals (Run If) Jobs-as-a-Task (Modular) Control flows can be established between Tasks. Jobs supports different Triggers DBSQL Dashboards Manual Trigger Scheduled (Cron) API Trigger File Arrival Triggers Table Triggers Continuous (Streaming) Preview Coming Soon

Slide 22

Slide 22 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary 22 gen AI for Data Platforms

Slide 23

Slide 23 text

©2024 Databricks Inc. — All rights reserved Databricks Data Intelligence Platform Use generative AI to understand the semantics of your data Data Intelligence Engine Open Data Lake (lake first approach: S3, ADLS, GCS) Databricks SQL Text-to-SQL Workflows optimized based on past runs Delta Live Tables Automated data qualility Mosaic AI Create, tune, and serve custom LLMs Unity Catalog Securely get insights in natural language Delta Lake with Delta UniForm Data layout is automatically optimized based on usage patterns

Slide 24

Slide 24 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary We’re infusing AI in our experiences AI-generated docs + semantic search in Catalog Explorer Databricks Assistant SQL to Dashboard Data Rooms (Project Genie)

Slide 25

Slide 25 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary 25 Data Platforms for gen AI

Slide 26

Slide 26 text

©2023 Databricks Inc. — All rights reserved MosaicML Model Serving MosaicML Model Serving Vector Search MLflow AI Gateway Model Serving MLflow AI Gateway MLflow Evaluation MLflow Prompt Engg Generative AI Solutions Enable every architectural pattern Prompt Engineering and Chains Retrieval Augmented Generation (RAG) Fine-tuning Pre-training Unity Catalog | Lakehouse Monitoring Crafting specialized prompts to guide LLM behavior Combining an LLM with enterprise data Adapting a pre-trained LLM to specific data sets or domains Training an LLM from scratch Complexity / Compute-intensiveness

Slide 27

Slide 27 text

©2023 Databricks Inc. — All rights reserved Hallucination Lacking enterprise context Gen AI gone wrong

Slide 28

Slide 28 text

Model Serving Custom Models Foundation Models APIs External Models Deploy any model as a REST API with Serverless compute, managed via MLflow. CPU and GPU. Integration with Feature Store and Vector Search. Govern external models and APIs. This provides the governance of MLflow Deployments for LLMs, plus the monitoring and payload logging of traditional Databricks Model Serving. Databricks curates top Foundation Models and provides them behind simple APIs. You can start experimentation immediately, without setting up serving yourself. Databricks Model Serving Unified UI, API & SDK for managing all types of AI Models

Slide 29

Slide 29 text

©2024 Databricks Inc. — All rights reserved Built-in governance with permissions and lineage Automatically synchronizes streaming source data with vector db. No separate data pipelines Vector DB Serverless vector database for RAG

Slide 30

Slide 30 text

©2024 Databricks Inc. — All rights reserved Finetuning Finetune your LLM on your data Serverless: no need to reserve or pick GPUs Pick the data from Unity Catalog or from Huggingface Maintain control and ownership of the model. It is your Intellectual Property.

Slide 31

Slide 31 text

©2024 Databricks Inc. — All rights reserved Mosaic AI Training Up to 7X faster and cheaper training of large AI Models Simplified, scalable, and cost-effective training of large AI models. Train or fine-tune your own generative AI model with your data in your secure environment. Full control of your model and privacy of your data. Your data, your model, built in your secure environment.

Slide 32

Slide 32 text

©2024 Databricks Inc. — All rights reserved Databricks Marketplace Share data sets with notebooks, and OSS / proprietary AI models Based on OSS Delta Sharing One click Instant Access 32

Slide 33

Slide 33 text

©2024 Databricks Inc. — All rights reserved DBRX Databricks OSS LLM

Slide 34

Slide 34 text

©2024 Databricks Inc. — All rights reserved Model architecture ● Sparse Mixture-of-Experts (MoE) ● 4 of 16 experts for a given input Model training ● Pre-trained on 3072 NVIDIA H100s in 3 months. ● on Databricks Data Intelligence Platform, Notebooks, Jobs, etc. The models ● DBRX Base for fine-tuning ● DBRX Instruct for RAG chains ● 132B parameters ● 32k token context length License and data ● Open-source for commercial use ● Pretrained on publicly available 12T tokens ● Designed for enterprises Introducing DBRX’s details

Slide 35

Slide 35 text

©2024 Databricks Inc. — All rights reserved DBRX outperforms established open source models on language understanding (MMLU), Programming (HumanEval), and Math (GSM8K).

Slide 36

Slide 36 text

©2024 Databricks Inc. — All rights reserved DBRX outperforms GPT 3.5 on language understanding (MMLU), Programming (HumanEval), and Math (GSM8K).

Slide 37

Slide 37 text

©2024 Databricks Inc. — All rights reserved Hugging Face WildBench Leaderboard 37

Slide 38

Slide 38 text

©2024 Databricks Inc. — All rights reserved How can I try DBRX? Hugging Face Spaces Databricks FM API AI Playground labs.perplexity you.com, poe.com

Slide 39

Slide 39 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary 39 Gen AI meets Data Platforms Data Intelligence Engine + Unified Governance -> Assistant, Intelligent Search, automated documentation, natural language queries and better scheduling, automated data quality

Slide 40

Slide 40 text

©2023 Databricks Inc. — All rights reserved Confidential and Proprietary 40 Data Platform meets gen AI "There is no good model with bad data"

Slide 41

Slide 41 text

©2023 Databricks Inc. — All rights reserved 41 New Databricks Demo Center databricks.com/demos Todays demo

Slide 42

Slide 42 text

©2023 Databricks Inc. — All rights reserved 42 Thank You! @frankmunz Please rate this presentation!