20+ End User Trials v1.0 GA v2.0 GA v3.0 GA Series A = $15M Exits Stealth- Mode Series B = $40M Series A1 = $25M Record- Breaking Y1 Series C = $100M $1.2B Valuation C O N F I D E N T I A L O V E R V I E W
Mike Wing President Shachar Feinblit VP, R&D and Co-Founder Avery Pham VP, Operations Jeff Denworth VP, Products & Co-Founder C O N F I D E N T I A L O V E R V I E W
6 First Full Year Performance Gross Profit Revenue z z z Customer Adoption Dozens Across 4 Continents Average Customer Y1 Spend $1,020,000 Selling Like a Unicorn, Spending Like a Camel Clear Path to Breakeven; $140M of Cash in Bank C O N F I D E N T I A L O V E R V I E W
DATA Write at 3D XPoint speeds, Read at TB/s, Ms of IOPS ALL-NVMe PERFORMANCE exceeds the capacity needs of any size organization EXABYTE-SCALE FILE & OBJECT STORAGE Engineered at every level to deliver unrivaled system efficiency TIER 5 COST EFFICIENCY C O N F I D E N T I A L O V E R V I E W
DATA UNIVERSAL STORAGE 3D XPoint QLC Flash NVMe Enclosure N V M E F A B R I C : C O M M O D I T Y E T H E R N E T O R I N F I N I B A N D CLIENTS: NFS, NFSoRDMA, SMB, S3, K8S CSI VAST Storage Servers (Containers) 3D XPoint QLC Flash NVMe Enclosure 3D XPoint QLC Flash NVMe Enclosure 3D XPoint QLC Flash NVMe Enclosure G L O B A L N A M E S P A C E 1 4 C O N F I D E N T I A L O V E R V I E W
1 0 µ s D I S T A N C E T O S T O R A G E S Y S T E M S T A T E 3D XPoint QLC Flash NVMe Enclosure 3D XPoint QLC Flash NVMe Enclosure 3D XPoint QLC Flash NVMe Enclosure G L O B A L N A M E S P A C E 1 5 A FOUNDATION FOR GLOBAL ALGORITHMS global namespace | global flash translation | global data protection | global data reduction STATELESS ARCHITECTURE, ELIMINATES EAST-WEST TRAFFIC no cache coherency challenges | no batteries | no rebuilds during server failure | docker-based auto-scaling C O N F I D E N T I A L O V E R V I E W
Enclosure 3D XPoint QLC Flash NVMe Enclosure 3D XPoint QLC Flash NVMe Enclosure 3D XPoint QLC Flash NVMe Enclosure G L O B A L N A M E S P A C E 1 6 One Storage System To Consolidate Noisy Neighbors & Islands of Storage S E G M E N T B Y A P P L I C A T I O N O R N E T W O R K S E G M E N T B Y A P P L I C A T I O N O R N E T W O R K C O N F I D E N T I A L O V E R V I E W
400TB and 675TB Options 4 x 100Gb NVMeF Connectivity 2U of Space NVMe Fabric 100Gb Ethernet or InfiniBand VAST Container Servers 4 Servers in 2U Chassis Ethernet and InfiniBand Connectivity Capacity-Optimized vs. HDD Options: - Scale-Out NAS: 6.7PB - VAST Data @2:1: 23PB (2.4x more dense) Performance-Optimized vs. Flash Options: - Scale-Out NAS: 2.3PB - VAST Data @2:1: 11PB (3.8x more dense) C O N F I D E N T I A L O V E R V I E W
NAMESPACE 1 9 SMB S3 NFS tbd Protocol Layer Attributes: Rich, Shared-Everything Metadata Structures POSIX & S3 metadata, multi-protocol mappings, links, lock state, snap state, keys Data store: Capacity-Efficient a byte-granular, thin-provisioned, sharded, infinitely-scalable data store Element Store C O N F I D E N T I A L O V E R V I E W
A L O V E R V I E W 2 0 L E G A C Y S H A R E D - N O T H I N G S T O R A G E THE PROBLEM WITH STATEFUL PROTOCOLS legacy NAS systems do not effectively mirror SMB state across all controllers locks, leases in DRAM
A L O V E R V I E W 2 1 L E G A C Y S H A R E D - N O T H I N G S T O R A G E THE PROBLEM WITH STATEFUL PROTOCOLS “with isilon, because it takes so long to do the upgrades in a rolling manner it is just hard to say for the next two days your smb connection may drop at any time” - customer quote from product requirements gathering session
HA Enclosure QLC Flash HA Enclosure C O N F I D E N T I A L O V E R V I E W 2 2 N V M e F a b r i c G L O B A L N A M E S P A C E INTRODUCING: RESILIENT SMB stateless 3D XPoint 3D XPoint 3D XPoint 3D XPoint metadata
A L O V E R V I E W 2 3 3D XPoint QLC Flash HA Enclosure N V M e F a b r i c 3D XPoint QLC Flash HA Enclosure 3D XPoint QLC Flash HA Enclosure 3D XPoint QLC Flash HA Enclosure G L O B A L N A M E S P A C E INTRODUCING: RESILIENT SMB State, locks, leases: fail over in microseconds. Upgrades are simple. Uptime is mastered.
of Tradeoffs GAME CHANGING STORAGE INNOVATIONS 2 5 Global Erasure Codes Global Data Reduction QLC Flash Saves 85% Save Up To Another 66% Unprecedented Efficiency MLC/ TLC QLC COST C O N F I D E N T I A L O V E R V I E W
64KB Erase Block = 200MB #1 4K Write Consumes 64K Page #2 Garbage Collection #3 Short lived data ages out #4 More Garbage Collection +16x +2x +2x… C O N F I D E N T I A L O V E R V I E W 2 6
E R C O N T R O L L E R C O N T R O L L E R C O N T R O L L E R DI ST RI B UT E D, P E RSI ST E NT XP OI NT B UFFE R 100x LARGER THAN DRAM C O N F I D E N T I A L O V E R V I E W
O N F I D E N T I A L O V E R V I E W 2 9 F L A S H B U F F E R X P O I N T X P O I N T X P O I N T X P O I N T • Shared Nothing Designs • Small, Volatile Write Caches • Reed-Solomon Historic Barriers to Wide Striping VAST codes accelerate rebuild speed by using a new type of algorithm that gets faster with more redundancy data. • 36+4: Much faster than classic RAID, more resilience, 9% overhead • 146+4: 60M years of mean-time-to-data-loss, only 2.6% overhead VAST’s Next-Generation Locally-Decodable Erasure Encoding
O N F I D E N T I A L O V E R V I E W 3 0 Data is fingerprinted in large blocks after the write is persisted in SCM Fingerprints are compared to measure relative distance, similar chunks are clustered Clustered data is compressed together; byte-level deltas are extracted & stored DELTAS REFERENCE subsequent reads are serviced within 1ms using locally decodable compression algorithms
3:1 SEARCH 4:1 BACKUP APPLIANCES 3:1 pre-compressed and pre-deduplicated pre-compressed pre-compressed pre-compressed un-compressed HPC 3:1 MARKET DATA 8:1 pre-compressed BACKUP 20:1 C O N F I D E N T I A L O V E R V I E W
D E N T I A L O V E R V I E W 3 6 Up to 200GB/s per Client (100x Faster Than. TCP NFS) Linear Scaling to Exabytes NAS (over RDMA) Simplicity 80% Less Cost N O T R A D E O F F S THE NEW AI STORAGE PARADIGM T O D A Y ’ S S H A D O W A I O P T I O N S SCALE-OUT NAS Simple & Versatile, But Slow HPC FILE SYSTEMS Fast & Scalable, But Complex Both Are Prohibitively Expensive for All-Flash
A L O V E R V I E W 3 7 DISAGGREGATION IS THE PATH TO EMBARASSINGLY PARALLEL SCALE ONLY VAST IS TRULY LINEAR N V M e F A B R I C G L O B A L N A M E S P A C E
EFFECTIVE COST/PB HDD Flash C O N F I D E N T I A L O V E R V I E W 3 8 PIONEERING RADICAL FLASH SAVINGS VAST ALL-FLASH IS IDEAL FOR LAUNCHING NEXT-GEN AI INITIATIVES $2M $760K $400K Universal Storage brings flash TCO in line with tiered storage approaches to de-risk customer storage decisions all while freeing up HW capital for GPUs. 200TB @ $3.5/GB 800TB @ $0.2/GB QLC + 2.5% Erasure Codes + Similarity-Based Data Reduction Our formula for compounded flash savings: C O N F I D E N T I A L O V E R V I E W 3 8
N I V E R S A L S T O R A G E : O N E P L A T F O R M F O R A L L D A T A C E N T E R D A T A Secondary Storage Big Data & AI Vertical Solutions Genomics Finance Content HPC Enterprise Infrastructure C O N F I D E N T I A L O V E R V I E W
I D E N T I A L O V E R V I E W 4 1 A I • Q U A N T I T A T I V E T R A D I N G • L I F E S C I E N C E S • M E D I A • S E A R C H A N I M A T I O N & V F X • B A C K U P • C L U S T E R C O M P U T I N G • C O N T A I N E R S
beyond the hard drive era, software-enabled storage architectures helps us modernize our scientific agenda and enable AI- driven research with the power of flash.” Jose Arrieta, CIO of HHS “We invest in many cutting-edge technologies, which includes VAST Data’s Universal Storage platform, to support our most intensive computing efforts, for hundreds of researchers on a global scale" Olivier Delahaye, CTOof Squarepoint “VAST provides Zebra a solution to all of our A.I. storage challenges by delivering performance superior to what is possible with traditional NAS while also providing a simple, scalable appliance that requires no effort to deploy and manage.” Eyal Toledano, CTO of Zebra Medical ““VAST Data provides Ginkgo the potential to ride the declining cost curve of flash while also providing near- infinite scale.” Austin Che, CTO of Ginkgo Bioworks C O N F I D E N T I A L O V E R V I E W
(TBu @ 2:1) Random Writes (GB/s) Random Read 2019 Enclosure (GB/s) Random Read 2020 Enclosure (GB/s) IOPS (4K random Read) 1x1 1.1PBu 5 20 40 225K 2x2 2.3PBu 10 40 80 500K 3x3 3.5PBu 15 60 120 750K 4x4 4.6PBu 20 80 160 1M 5x5 5.7PBu 25 100 200 1.25M 6x6 6.8PBu 30 120 240 1.5M 7x7 7.9PBu 35 140 280 1.75M 8x8 9.1PBu 40 160 320 2M 9x9 10.3PBu 45 180 360 2.25M At 20GB/s per enclosure, NIH saw linear scale to 9 of VAST’s 2019 enclosures in a single cluster C O N F I D E N T I A L O V E R V I E W
A L O V E R V I E W 4 4 18TB 3D XPoint 675TB QLC Flash HA Enclosure Render Servers 7x faster than Isilon Character Cache Servers 2ms or less, Same as Avere AI GPU Machines Unrivaled NAS Throughput – 5x Faster than Isilon
A L O V E R V I E W 4 5 ingest editing enterprise Universal Storage in Action 18TB 3D XPoint 675TB QLC Flash HA Enclosure 18TB 3D XPoint 675TB QLC Flash HA Enclosure 18TB 3D XPoint 675TB QLC Flash HA Enclosure
A L O V E R V I E W 4 6 I N I T I A L O R D E R 10 PB ($Ms) A D D I T I O N A L U P S I D E 2,000+ PB (SW) U N S E A T I N G NetApp in one their largest accounts V A S T A D V A N T A G E DAS Economics. Appliance Simplicity. Exabyte Scale. C A S E S T U D Y LEADING US TELCO