Flexibility on Splunk infrastructure with Splunk SmartStore

Object Storage Overview

Why use Object Storage? Object Storage addresses; • Data Storage
Scalability • Cost of storing and accessing data • Cost of managing storage infrastructure • Different way to access data with S3 API • Stores Metadata with data to facilitate intelligent data – do more with your data • Provides greater data protection capabilities • Provides single namespace regardless of geography • Maximises data availability 2

Scalability - Block & File vs. Object Storage 3

Scale Out Object Storage & Use Cases Restful API –S3
Single Namespace Backup & Archive Big Data Analytics File Collaboration Web Content Storage Storage As a Service 4 Designed to be consumed by Apps OID1… OID2… OID3… etc

Object Storage • Flat Hierarchical system – scalable • Object
is a Binary Large Object (BLOB) • Bucket is a collection of objects • Controlled by user permissions • Applications connect using Restful API S3 App S3 App S3 App 5

• User views data store through the S3 client o
Buckets (similar to LUN / file system) o Folders (easy way to organize data) o Data Objects (your data & metadata) • Applications connect using Restful API o Endpoint - Bucket.hyperstore.com o S3 Region endpoint – region.hyperstore.com o Access key - unique user assigned key o Secret key - unique user assigned key • Object name includes folder name Ex. Videos/2014/birthday/video1.wmv 6 Object Storage S3 App S3 App S3 App

Object Storage / Metadata Storage Data objects are stored on
Hard Disk drives formatted with ext4 filesystem Metadata is stored on mirrored SSD Drives along with boot, HyperStore OS & Software Metadata provides look up to data Data Object 7

Anatomy of an Object (ex. X-Ray image) BLOB - Image
file stored as jpg/tif, etc. System metadata; Date - Object creation date Content-Length - Object size in bytes Last-Modified - Creation date or last modified date, whichever is the latest User-defined metadata; x-amz-meta-Patient: Homer x-amz-meta-Age: 50 x-amz-meta-”Scan of”: Brain x-amz-meta-Scanner: Xray 1 x-amz-meta-operator: Bart ……. 8

9 Metadata Search User Data Metadata User Data Metadata User
Data Metadata Metadata Synch Elastic Search Data Visualization • Find and visualize assets • Search metadata with Elastic Search • Visualize data with Kibana HyperStore Data “label” Copy of metadata Video Surveillance Genomics

S3 Functionality S3 is a modern storage protocol combining •
Connectivity • Access control • Data management • Reporting & Notifications • Advanced Functionality • Tiering • Encryption • Billing • Monitoring • Notifications • Replication • Time limited access …. that can be used across the internet. Standard Storage Protocols S3 API 10

11 Why S3 Compatibility Matters 1. Cloud storage standard established
at Amazon; world’s largest object storage environment 2. MSPs worldwide adopting S3 for cloud storage services 3. Hundreds of applications now support S3, many more in development S3 Compatibility = Investment Protection

12 HyperStore is not a gateway: - More interoperable -
More scalable Example: Multi-part upload Inherently interoperable & scalable architecture Gateway Architecture OBJECT PART 1 PART 2 PART 3 PART 4 PART n … Gateway Gateway Competition Bottleneck Native S3 API – Why It Matters Lost in Translation? OBJECT PART 1 PART 2 PART 3 PART 4 PART n … Native S3 Connect Load is distributed, no translation

13 On-Premises Storage is Essential Why: • Performance • Security
• Compliance • Cost of access APPLICATIONS ON PREM STORAGE DATA CENTER Challenge: Multiple Storage Environments to Manage

Splunking with S2 and S3 Splunk SmartStore with S3 Object
Storage (Cloudian HyperStore)

Splunk SmartStore (S2) 7.2.5 Next generation architecture Goals • Scale
Splunk deployments to meet your needs • Minimize data on local storage, while maintaining the fast indexing and search capabilities SmartStore • Decouples storage and compute layers • Elastically scale compute on-demand for search and indexing workloads • Grow storage independently to accommodate retention requirements • Cost savings with more flexible storage options How? • S3 support

Splunk indexer infrastructure for classic • Indexer storage for classic
is SSD and/or NVME based storage • Indexer storage sized for Hot + Warm buckets per storage retention policy • Hot + Warm buckets need data durability or replication. • Default replication setting : x3 • Cold buckets on block / file storage • Replication used for data durability © 2017 Cloudian, Inc. 17 Indexer Hot bucket Warm bucket NVME / Flash / High performance disks NAS / SAN / JBOD

Indexer Indexer infrastructure for SmartStore • Indexer storage for S2
is also SSD and/or NVME based storage • Indexer storage sized ONLY for Hot buckets per storage retention policy • Warm buckets moved to remote S3 storage and temporarily in SmartStore cache • Replicated Hot buckets on indexers. • Default replication setting : x3 • Warm buckets moved to inexpensive object storage tier • inbuilt data protection with either replication or erasure coding © 2017 Cloudian, Inc. 18 Hot bucket SmartStore Cache Warm buckets

© 2017 Cloudian, Inc. 19 Key Benefits • Storage tier
is no longer tied to indexer hardware • Separation of storage and compute • Indexer failures is no longer tied to storage failures • Local storage is now simply a search cache • No longer need to size local storage • used to store - 90days, 6 months, 3 yrs etc • Just need enough local storage for search (1- 7 days)

20 Data protection, Geo-distribution Hybrid / Multi-Cloud enabled Granular management,
bucket level Start small and grow 100% Native S3 API. Highest compatibility Encryption, Access control, WORM

Flexibility on Splunk infrastructure with Splun...

Flexibility on Splunk infrastructure with Splunk SmartStore

SpunkersDC

Other Decks in Technology

Featured

Transcript

Object Storage Overview

Why use Object Storage? Object Storage addresses; • Data Storage

Scalability - Block & File vs. Object Storage 3

Scale Out Object Storage & Use Cases Restful API –S3

Object Storage • Flat Hierarchical system – scalable • Object

• User views data store through the S3 client o

Object Storage / Metadata Storage Data objects are stored on

Anatomy of an Object (ex. X-Ray image) BLOB - Image

9 Metadata Search User Data Metadata User Data Metadata User

S3 Functionality S3 is a modern storage protocol combining •

11 Why S3 Compatibility Matters 1. Cloud storage standard established

12 HyperStore is not a gateway: - More interoperable -

13 On-Premises Storage is Essential Why: • Performance • Security

Splunking with S2 and S3 Splunk SmartStore with S3 Object

Splunk SmartStore (S2) 7.2.5 Next generation architecture Goals • Scale

Splunk indexer infrastructure for classic • Indexer storage for classic

Indexer Indexer infrastructure for SmartStore • Indexer storage for S2

© 2017 Cloudian, Inc. 19 Key Benefits • Storage tier

20 Data protection, Geo-distribution Hybrid / Multi-Cloud enabled Granular management,