Slide 1

Slide 1 text

Object Storage Overview

Slide 2

Slide 2 text

Why use Object Storage? Object Storage addresses; • Data Storage Scalability • Cost of storing and accessing data • Cost of managing storage infrastructure • Different way to access data with S3 API • Stores Metadata with data to facilitate intelligent data – do more with your data • Provides greater data protection capabilities • Provides single namespace regardless of geography • Maximises data availability 2

Slide 3

Slide 3 text

Scalability - Block & File vs. Object Storage 3

Slide 4

Slide 4 text

Scale Out Object Storage & Use Cases Restful API –S3 Single Namespace Backup & Archive Big Data Analytics File Collaboration Web Content Storage Storage As a Service 4 Designed to be consumed by Apps OID1… OID2… OID3… etc

Slide 5

Slide 5 text

Object Storage • Flat Hierarchical system – scalable • Object is a Binary Large Object (BLOB) • Bucket is a collection of objects • Controlled by user permissions • Applications connect using Restful API S3 App S3 App S3 App 5

Slide 6

Slide 6 text

• User views data store through the S3 client o Buckets (similar to LUN / file system) o Folders (easy way to organize data) o Data Objects (your data & metadata) • Applications connect using Restful API o Endpoint - Bucket.hyperstore.com o S3 Region endpoint – region.hyperstore.com o Access key - unique user assigned key o Secret key - unique user assigned key • Object name includes folder name Ex. Videos/2014/birthday/video1.wmv 6 Object Storage S3 App S3 App S3 App

Slide 7

Slide 7 text

Object Storage / Metadata Storage Data objects are stored on Hard Disk drives formatted with ext4 filesystem Metadata is stored on mirrored SSD Drives along with boot, HyperStore OS & Software Metadata provides look up to data Data Object 7

Slide 8

Slide 8 text

Anatomy of an Object (ex. X-Ray image) BLOB - Image file stored as jpg/tif, etc. System metadata; Date - Object creation date Content-Length - Object size in bytes Last-Modified - Creation date or last modified date, whichever is the latest User-defined metadata; x-amz-meta-Patient: Homer x-amz-meta-Age: 50 x-amz-meta-”Scan of”: Brain x-amz-meta-Scanner: Xray 1 x-amz-meta-operator: Bart ……. 8

Slide 9

Slide 9 text

9 Metadata Search User Data Metadata User Data Metadata User Data Metadata Metadata Synch Elastic Search Data Visualization • Find and visualize assets • Search metadata with Elastic Search • Visualize data with Kibana HyperStore Data “label” Copy of metadata Video Surveillance Genomics

Slide 10

Slide 10 text

S3 Functionality S3 is a modern storage protocol combining • Connectivity • Access control • Data management • Reporting & Notifications • Advanced Functionality • Tiering • Encryption • Billing • Monitoring • Notifications • Replication • Time limited access …. that can be used across the internet. Standard Storage Protocols S3 API 10

Slide 11

Slide 11 text

11 Why S3 Compatibility Matters 1. Cloud storage standard established at Amazon; world’s largest object storage environment 2. MSPs worldwide adopting S3 for cloud storage services 3. Hundreds of applications now support S3, many more in development S3 Compatibility = Investment Protection

Slide 12

Slide 12 text

12 HyperStore is not a gateway: - More interoperable - More scalable Example: Multi-part upload Inherently interoperable & scalable architecture Gateway Architecture OBJECT PART 1 PART 2 PART 3 PART 4 PART n … Gateway Gateway Competition Bottleneck Native S3 API – Why It Matters Lost in Translation? OBJECT PART 1 PART 2 PART 3 PART 4 PART n … Native S3 Connect Load is distributed, no translation

Slide 13

Slide 13 text

13 On-Premises Storage is Essential Why: • Performance • Security • Compliance • Cost of access APPLICATIONS ON PREM STORAGE DATA CENTER Challenge: Multiple Storage Environments to Manage

Slide 14

Slide 14 text

Splunking with S2 and S3 Splunk SmartStore with S3 Object Storage (Cloudian HyperStore)

Slide 15

Slide 15 text

Splunk SmartStore (S2) 7.2.5 Next generation architecture Goals • Scale Splunk deployments to meet your needs • Minimize data on local storage, while maintaining the fast indexing and search capabilities SmartStore • Decouples storage and compute layers • Elastically scale compute on-demand for search and indexing workloads • Grow storage independently to accommodate retention requirements • Cost savings with more flexible storage options How? • S3 support

Slide 16

Slide 16 text

Splunk indexer infrastructure for classic • Indexer storage for classic is SSD and/or NVME based storage • Indexer storage sized for Hot + Warm buckets per storage retention policy • Hot + Warm buckets need data durability or replication. • Default replication setting : x3 • Cold buckets on block / file storage • Replication used for data durability © 2017 Cloudian, Inc. 17 Indexer Hot bucket Warm bucket NVME / Flash / High performance disks NAS / SAN / JBOD

Slide 17

Slide 17 text

Indexer Indexer infrastructure for SmartStore • Indexer storage for S2 is also SSD and/or NVME based storage • Indexer storage sized ONLY for Hot buckets per storage retention policy • Warm buckets moved to remote S3 storage and temporarily in SmartStore cache • Replicated Hot buckets on indexers. • Default replication setting : x3 • Warm buckets moved to inexpensive object storage tier • inbuilt data protection with either replication or erasure coding © 2017 Cloudian, Inc. 18 Hot bucket SmartStore Cache Warm buckets

Slide 18

Slide 18 text

© 2017 Cloudian, Inc. 19 Key Benefits • Storage tier is no longer tied to indexer hardware • Separation of storage and compute • Indexer failures is no longer tied to storage failures • Local storage is now simply a search cache • No longer need to size local storage • used to store - 90days, 6 months, 3 yrs etc • Just need enough local storage for search (1- 7 days)

Slide 19

Slide 19 text

20 Data protection, Geo-distribution Hybrid / Multi-Cloud enabled Granular management, bucket level Start small and grow 100% Native S3 API. Highest compatibility Encryption, Access control, WORM

Slide 20

Slide 20 text

No content