Slide 1

Slide 1 text

AI for Data Infrastructure Sync Computing | 2024 Prepared for IT Press Tour Declarative Computing The future of automated cloud infrastructure

Slide 2

Slide 2 text

[Confidential] 01. 02. 03. 04. Agenda Introduction to Sync What is declarative computing? Introducing Gradient by Sync Looking ahead

Slide 3

Slide 3 text

[Confidential] Introduction to Sync Staff & Advisors From Founder’s story: Hailing from MIT / UC Berkeley with a background in high performance computing

Slide 4

Slide 4 text

Tech Partners ● Databricks Technology Partner ● NVIDIA Inception Program member ● Native integrations for Airflow, Azure Data Factory, NVIDIA RAPIDS, etc.

Slide 5

Slide 5 text

[Confidential] 01. 02. 03. 04. Agenda Introduction to Sync What is declarative computing? Introducing Gradient by Sync Looking ahead

Slide 6

Slide 6 text

[Confidential] The Resource Allocation Problem: Today Cost: $100 Runtime: 1 hour Latency: 300 ms Compute Resources

Slide 7

Slide 7 text

Business Problems with Today’s Architecture Can’t miss SLA deadlines Compute cost are too high Cannot tune at scale

Slide 8

Slide 8 text

[Confidential] Declarative Compute Resources Cost: $50 Runtime: 1 hour Latency: 100 ms Cost: Minimum Runtime: 1 hour Latency: 100 ms

Slide 9

Slide 9 text

Reinforcement ML Model for Cloud Infra Key Concept: Closed-loop feedback enables automatic tuning towards infrastructure goals

Slide 10

Slide 10 text

[Confidential] 01. 02. 03. 04. 05. Agenda Introduction to Sync What is declarative computing? Introducing Gradient by Sync How are we different? Looking ahead

Slide 11

Slide 11 text

Introducing Gradient by Sync Can’t miss our SLA deadlines Teams are relying on mission critical pipelines finishing on time- hitting SLA goals is critical Cannot tune at scale Platform teams struggle to meet business demands due to lack the Spark expertise to make changes confidently Customer’s Problems Gradient’s Solutions Visibility to make teams faster Gradient’s insights and visibility helps data engineers quickly diagnose problems and get to solutions faster Free data engineering time With automatic cluster tuning, data teams are free to focus on more business relevant tasks, freeing up precious time Databricks cost are too high Databricks is a high percentage of cloud spend with ample opportunity to save costs Maximize cloud computing performance Gradient’s ML model custom trains and tunes each cluster to reduce costs and improve efficiency

Slide 12

Slide 12 text

How is Gradient Different? Cannot scale As you scale your data infrastructure, platform teams struggle to tune clusters to meet business demands due to expertise, or the sheer volume of workloads running (e.g. you cannot manually tune 10K jobs) Alternatives Gradient Opinionated optimization Leverage years of research and millions of DBUs managed that have shaped opinions about the right metrics to monitor and power Gradient’s intelligent insights & custom optimizations. Advanced ML models Gradient’s self-improving ML algorithms were developed at MIT. They use closed-loop feedback to continue to improve Passive recommendations Lists of optimizations that might have an impact, can only take you so far. Active management of data infrastructure Gradient automates compute optimization with ML-powered optimizations, customized per workload

Slide 13

Slide 13 text

Gradient’s Goals Lower costs Faster runtimes Hit your SLAs Save time with automation

Slide 14

Slide 14 text

User results cost savings 2:1 ROI The average starting ROI customers see from Gradient 2x Faster runtimes, while hitting mission critical SLAs

Slide 15

Slide 15 text

User Results in Production

Slide 16

Slide 16 text

Gradient Usage

Slide 17

Slide 17 text

[Confidential] 01. 02. 03. 04. Agenda Introduction to Sync What is declarative computing? Introducing Gradient by Sync Looking ahead

Slide 18

Slide 18 text

[Confidential] Databricks is Just The Start

Slide 19

Slide 19 text

Solutions to Today’s Architecture SLAs met at scale Optimized compute costs Automated tuning at scale

Slide 20

Slide 20 text

AI for your data infrastructure