Slide 1

Slide 1 text

1 © 2024 | www.HAMMERSPACE.com GPU Data Orchestration Accelerating Access to S3 Data IT Press Tour, June 2024 Announcement date: June 12 @5am PT

Slide 2

Slide 2 text

2 © 2024 | www.HAMMERSPACE.com Extreme Parallel Performance to Feed GPUs Assemble Data from Disparate Sources Multi-Site, Hybrid-Cloud, Multi-Cloud Agility Efficiently Move Data to GPU Resources Automate Pipelines and Workflows Simplify Data Governance and Security Hammerspace Global Data Platform Hyperscale NAS Architecture Data-in-Place Assimilation Global File System Data Orchestration Programmable Metadata Advanced Data Services Deep Learning Generative AI Scientific Computing Video and Image Rendering Data Analytics LLM Training

Slide 3

Slide 3 text

3 © 2024 | www.HAMMERSPACE.com Building on Record Business Momentum in 2023 January: Added support for data on tape February: Unveiled Hyperscale NAS Architecture March: Meta publishes details on their use of Hammerspace as part of their Gen AI architecture April: Introduced erasure coding June: GPU Data Orchestration for S3 Applications

Slide 4

Slide 4 text

4 © 2024 | www.HAMMERSPACE.com Pain Points and Barriers to AI Success Data Issues1 • Siloed in disparate locations • Difficulties assembling large data sets • Data governance challenges 1Source: NVIDIA 2024 State of AI Surveys across various industries Poor Tech Infrastructure1 Existing file and object storage systems lack performance to feed GPUs GPUs Not Close to Data 49% of companies expect to run AI projects both in-cloud and on-prem by 20251

Slide 5

Slide 5 text

5 © 2024 | www.HAMMERSPACE.com Massive Need for Data Orchestration to GPUs “Spend needs to be on mobilizing the data to where the GPU's are…”, Anthony Robins, NVIDIA VP Federal at the NVIDIA AI Summit

Slide 6

Slide 6 text

6 © 2024 | www.HAMMERSPACE.com S3 Interface Enables New AI Pipelines and Workflows Primary Datacenter Hammerspace Global Data Platform (Single namespace, global parallel file system, automated data orchestration) Ingest Data via S3 S3 Remote Site Data Users / AI Engineers SMB NFS NFS4.2 NAS 1 COTS Storage Cloud Storage NAS 3 NFS 4.2 Local GPU Data Processing Cloud Datacenter Cloud GPU Resources NFS 4.2 Ingest from S3 Endpoints S3 Edge Sites

Slide 7

Slide 7 text

7 © 2024 | www.HAMMERSPACE.com Primary Datacenter Ingest Data via S3 S3 Remote Site Data Users / AI Engineers SMB NFS NFS4.2 NFS 4.2 Local GPU Data Processing Cloud Datacenter Cloud GPU Resources NFS 4.2 How It Works: Data-in-Place Assimilation NAS 1 Cloud Storage NAS 3 Metadata Layer Data Layer Data stays in place Files visible in seconds Non-Disruptive COTS Storage Ingest from S3 Endpoints S3 Edge Sites

Slide 8

Slide 8 text

8 © 2024 | www.HAMMERSPACE.com Primary Datacenter Ingest Data via S3 S3 Remote Site Data Users / AI Engineers SMB NFS NFS4.2 NFS 4.2 Local GPU Data Processing Cloud Datacenter Cloud GPU Resources NFS 4.2 Metadata Layer Data Layer How It Works: Global File System & Data Orchestration NAS 1 Cloud Storage NAS 3 Spans sites and clouds Data movement is transparent File-granular, Objective-Based Data Policies Single data set COTS Storage Ingest from S3 Endpoints S3 Edge Sites

Slide 9

Slide 9 text

9 © 2024 | www.HAMMERSPACE.com Definition | Data Orchestration WHERE Data Mover HOW Resource Allocation & Optimization WHAT Service Level Objectives WHEN Business Logic

Slide 10

Slide 10 text

10 © 2024 | www.HAMMERSPACE.com Creating the Objectives

Slide 11

Slide 11 text

11 © 2024 | www.HAMMERSPACE.com Add One or More “If” Statements to Build the Objective GUI-based slider for simple time-based tiering Creating the Objectives | Putting Metadata to Work

Slide 12

Slide 12 text

12 © 2024 | www.HAMMERSPACE.com Hammerscript Enables More Detailed Business Rules Advanced Objectives | Hammerscript

Slide 13

Slide 13 text

13 © 2024 | www.HAMMERSPACE.com Automated Data Profiling

Slide 14

Slide 14 text

14 © 2024 | www.HAMMERSPACE.com Real-Time Monitor to See If Data & Storage are in Alignment with Policy Objectives Ensuring Data and Storage are Aligned with Objectives

Slide 15

Slide 15 text

15 © 2024 | www.HAMMERSPACE.com Visualization of Data Mobilities

Slide 16

Slide 16 text

16 © 2024 | www.HAMMERSPACE.com Use Case: Orchestrating Data for Edge Computing

Slide 17

Slide 17 text

17 © 2024 | www.HAMMERSPACE.com Use Case: Orchestrating Data for AI Factory Data from Multiple Sources Parallel Global File System Metadata-Driven AI Workflow Objective-based Data Placement Automated Data Orchestration

Slide 18

Slide 18 text

18 © 2024 | www.HAMMERSPACE.com S3 Interface Details and Discussion • Market Perspective • Many customers have developed applications that use S3 interface • Estimated 100+ Exabytes of data created and stored with S3 today (100 trillions+ objects with estimate file size of 1MB) • Pricing • No added cost, part of standard license • Availability • Early access – customers are already using today • Late 2024 – GA Release

Slide 19

Slide 19 text

19 © 2024 | www.HAMMERSPACE.com Thank you

Slide 20

Slide 20 text

20 © 2024 | www.HAMMERSPACE.com Hammerspace Global Data Platform Global Data Platform Parallel Global File System Scale linearly from a few nodes to thousands Extreme performance for mixed I/O Spans multiple locations: edge-core-cloud Automatically move data non- disruptively Standards-based enterprise NAS features and RAS Assimilate metadata from existing storage Software-defined and storage agnostic Harness metadata to unlock value Data Orchestration Automate data placement and management across silos, sites, and clouds Hyperscale NAS Parallel file system performance with NAS simplicity

Slide 21

Slide 21 text

21 © 2024 | www.HAMMERSPACE.com The Most Options for High-Performance Workloads Data Processing Data Consumption and Visualization Hammerspace Global Data Platform Spans Sites and Clouds Build Hyperscale NAS clusters with standard Linux servers or third-party storage Build high-performance, highly- resilient storage clusters on commodity hardware with Hammerspace erasure coding Support for third party storage: NAS, Block, Object, Cloud, Tape

Slide 22

Slide 22 text

22 © 2024 | www.HAMMERSPACE.com Migrate Data to Commodity Hardware Non-Disruptively 1 Import existing metadata into Hammerspace using Assimilation. Data stays in place, users can access files in minutes. 3 Decommission old storage, all without disrupting user access. Users now read/write data from/to the new storage. 2 Set an Objective to copy data from old storage to new storage Process happens in the background, concurrent with file access, and is transparent to users.

Slide 23

Slide 23 text

23 © 2024 | www.HAMMERSPACE.com About Hammerspace Erasure Coding • Implemented via a Hammerspace EC-Group • Based on Mojette Transform erasure code • Up to 2x faster than traditional EC schemes • Standards-based components and protocols • Software defined using x86 servers • HDD and/or flash storage Storage Servers (4+) with HDD and/or Flash NFS Metadata Metadata Parallel Inter-Node Communication Start with at least 4 nodes Scale UP by adding drives Scale OUT by adding nodes Expansions are non-disruptive Many architectures are possible

Slide 24

Slide 24 text

24 © 2024 | www.HAMMERSPACE.com • 2x Reed-Solomon EC performance • No performance penalty on failure • 4KB block size for low latency • Highly parallel operation • No single point of failure • Full node failure(s) supported • Up to ¼ of the drives may fail without data loss • Self healing (from drives to nodes) • Additional data integrity protection via CRC Erasure encoding is a key component of modern storage systems, but it can be slow. The Mojette Transform boosts performance instead of throttling it Performance Resilience About the Mojette Transform Erasure Code