organization Eliminate data silos to accelerate innovation Multi-Workload Massive ingest bandwidth Mixed read/write handling Ultra-low latency Multi-Performant Across on-premises, cloud, or hybrid environments Easy data mobility between locations Multi-Location Easily scale along with project Up & down, elastically Without disruption or degradation Multi-Scale Data Source Data Source Data Source Data Source Data Source Data Source Data Platform
file and object performance • No tuning required Effortless Sustainability • Reduce carbon emissions • Cut data pipeline idle time • Extend the usable life of your hardware • Move workloads to the cloud. Seductive Simplicity • Eliminates storage silos across on-prem and cloud. • Single, easy-to-use data platform for whole pipeline Infinite Scale • Linear scale • Scale compute and storage independently • Trillions of files of all data types and sizes.
Industry-leading metadata performance Operational efficiency with storing billions of small files Read and write performance on small and large file operations
1PB WEKA 1PB Other vs 88x More Read IOPS 14x More Write Bandwidth 7x More Read Bandwidth 34x More Write IOPS WEKA Data Platform DDN Lustre AI400X2 IBM Spectrum Scale Pure Flashblade NetApp AFF800 Vast Data IOPS: 1PB of Storage More IOPS = Faster Training Traditional Parallel FS All Flash NAS & Cloud Native Storage Source: WEKA comparisons based on publicly available data from other vendors
Resources Single High-Performance Pool for Apps and Data Lower Costs and Energy 80% GPU Utilization 40x Data Performance Pushing the Frontier of Generative AI Combining LLM, text, images, video, and audio into immersive generative experiences
access 12X faster epoch times 2x faster modal training “ WEKA running in Cloud is game- changing for us. We’re able to run experiments in less than a week instead of three months or more. ” Jon Sorenson, PhD, VP of Technology Development, Atomwise
all types of IO. • Massive parallelization of all data and metadata operations • Use of kernel bypass technologies • Patented data layout, protection and more • Software defined allows it to run on-prem or in the cloud • Flexibility to use any hardware (physical or virtual) and utilize latest clients • Rapid adoption of new HW as needed • Performance Benchmarking strategy • Use published, audited, repeatable and transparent benchmarks to show performance leadership • Performance may be raw #, synthetic metric ($/IOP, #jobs/TB, #cores/GB/sec, etc), efficiency, and more • Mix On-prem with cloud benchmarks to showcase flexibility Confidential: Under Embargo Until March 14.
and 16x d64sv3 clients Show efficiency in Azure for AI workloads Beat Qumulo SPEC_ai run by 175% in raw performance, while only being 64% of the infrastructure cost. When a latency factor is put in, WEKA effectively can do 2.5x the number of jobs in the same time that Qumulo takes, and our effective cost per job is only 25% of Qumulo. SPEC_ai_image AWS 40x i3en.24xlarge backends and 40x c5n.18xlarge clients Dominate Qumulo result, #1 position in category 6x higher load count than Qumulo. Infrastructure costs impacted the effective cost per job, but still only76% the cost per job of Qumulo. Big or small, in multiple clouds, WEKA is faster and has a better cost-per-job than a competitor. #1 result SPEC_eda_blended AWS 40x i3en.24xlarge backends and 40x c5n.18xlarge clients Beat NetApp on-prem result. #1 position in category NetApp:6300 jobs at 1.39ms ORT. WEKA in AWS: 6310 jobs at 0.87ms ORT. WEKA is 60% faster response time. In the cloud. Against the NetApp fastest 8-node on-prem system they've got (A900 NVMe). Effective result is that WEKA can process over 10,000 jobs in the same time they can do 6300. #1 result Confidential: Under Embargo Until March 14.
ends and 40x c5n.18xlarge clients #1 position in category WEKA still owns the #1 spot from 2 years ago. (8000 streams) We beat it with 12000 streams. On-prem or in cloud, WEKA is the highest performing video platform around. #1 result SPEC_genomics AWS 40x i3en.24xlarge back ends and 40x c5n.18xlarge clients #1 position in category No direct competitor in category, so take over #1 spot from niche DAS player (UBIX technology). 2200 jobs achieved for the #1 result SPEC_swbuild AWS 40x i3en.24xlarge back ends and 40x c5n.18xlarge clients Beat NetApp, #1 position in category Raw, WEKA achieved 3500builds with a ORT of 0.74ms. So, #2 overall. BUT WAIT... NetApp 8-node A900 NVMe system did 6120 builds at 1.58ms ORT. WEKA's advantage of being ½ the latency means an effective 7472 builds in the same time. So, we have an effective #1 result. Confidential: Under Embargo Until March 14.
AI workloads, Ability to scale up for AI workloads as needed § Overall performance against ANY workload with no ongoing tuning (Placement groups, number of FE’s) § Metadata/mixed IO/latency performance matters: It’s not just about throughput § Cloud Performance that can beat On-Prem § #1 in all SPEC categories, whether it's a raw number or effective# • Futures • STAC-M3, ML-Perf, STAC-ML, IO-500 and others • Mix of cloud and on-prem benchmarking including WEKApod base config. • Transparent, publicly documented and repeatable synthetic results (FIO, El Bencho, VDbench, etc.) Confidential: Under Embargo Until March 14.
(GDS) One of the first NVIDIA DGX BasePOD-certified datastores Reference architecture for NVIDIA DGX BasePOD with DGX H100 Systems WEKApod certified for DGX SuperPOD with DGX H100 Systems + 2019 NVIDIA Invests in WEKA Series C Round Confidential: Under Embargo Until Made Public (WEKA TO CONFIRM)
Certified for NVIDIA DGX SuperPOD™ Systems Integrated with NVIDIA Base Command Manager From 8 to hundreds of nodes Simplifies the WEKA experience Confidential: Under Embargo Until Made Public (WEKA TO CONFIRM)
765 GB/s 18.3M IOPS +0.5PB* +382 GB/s +9.1M IOPS Best-in-Class Performance *Usable capacity with 5+2 striping and 1 virtual hot spare Confidential: Under Embargo Until Made Public (WEKA TO CONFIRM)
4.8M IOPS 7.0 kW draw 1.0 TB/s W BW 25.0M IOPS 11 Rack Units 2 Racks 80 Rack Units Same Bandwidth 5x More IOPS 7x Less Rack Space 8.4X Less Power Draw 1/4 Rack Confidential: Under Embargo Until Made Public (WEKA TO CONFIRM)
28.8M IOPS 25.4 kW draw 1.0 TB/s W BW 91.3M IOPS 40 Rack Units 10 Racks 420 Rack Units 1 Rack Same Bandwidth More IOPS 10x Less Rack Space 12X Less Power Draw Confidential: Under Embargo Until Made Public (WEKA TO CONFIRM)
Scale World’s Fastest AI Infrastructure The WEKA Data Platform on WEKApod Performant infrastructure for highly parallelized workloads On-Premises, Cloud, Hybrid, GPU Cloud, and Turnkey Solutions Better power efficiency, GPU efficiency Certified high performance datastore Confidential: Under Embargo Until Made Public (WEKA TO CONFIRM)