At Scale, Everything is Hard

At Scale, Everything is Hard Paul Dix @pauldix paul@inﬂuxdata.com

Scale?

Scale != count(servers)

Scaling Throughput

Scaling Total Data Size

Scaling Development Teams

Scaling Code Bases

Scaling Feature Sets

At Scale, Everything is Hard

Time series data is the worst and best use case
in distributed databases dotScale 2015

High read & write throughput

Large range scans

Append/insert only

Deletes against large ranges

InﬂuxDB 0.9 to InﬂuxDB 2.0

Monolith to Services

Modern Containerized Data Platform Architecture

Data Platform, not Database?

Flashback to June 2015…

We’ve come a long way…

Time Structured Merge Tree

One-Dot-Oh

Clustering

Infrastructure software has come a long way…

Containerization

Kubernetes

Declarative Infrastructure Infrastructure as Code

Lessons at Scale

Single Tenant Inefﬁciencies

Team Scaling: 12 -> 90

Monolith Scaling: LOC 35k -> 280k

At Scale, Monoliths are Hard

Large Test Surface Area

Slower Releases

The more frequently you release code, the less risky each
release is.

Two-Dot-Oh

Database designed for containers?

Services based Database?

Built on top of Kubernetes

Multi-tenant

Workload Isolation

Architecture

Single Server Monolith

Architecture

Storage

Processing, Monitoring & Alerting

Collection & Scraping

Deploy Services Independently

Stateless Services

Stateful Services

Data has Gravity

Auto-Scaling

Singleton

Decouple Query from Storage

InﬂuxQL & TICKScript -> Flux https://github.com/inﬂuxdata/platform/query

Flux (#ﬂuxlang) is a lightweight language for working with data

Push Down Processing

Push Down Processing Flux Processor Data Node Data Node

Push Down Processing Flux Processor Data Node Data Node from(db:"foo")

Push Down Processing Flux Processor Data Node Data Node Summary
Ticks Back Up

Push Down Processing Flux Processor Data Node Data Node from(db:"foo")

Optimize RPC Make fast?

At Scale, Marshaling is Slow

Apache Arrow

Zero-Copy, no marshaling overhead!

In-memory columnar

Sum 8,192 Values BenchmarkFloat64Funcs_Sum_8192-8 2000000 687 ns/op 95375.41 MB/s BenchmarkInt64Funcs_Sum_8192-8
2000000 719 ns/op 91061.06 MB/s BenchmarkUint64Funcs_Sum_8192-8 2000000 691 ns/op 94797.29 MB/s BenchmarkFloat64Funcs_Sum_8192-8 200000 10285 ns/op 6371.41 MB/s BenchmarkInt64Funcs_Sum_8192-8 500000 3892 ns/op 16837.37 MB/s BenchmarkUint64Funcs_Sum_8192-8 500000 3929 ns/op 16680.00 MB/s AVX2 using c2goasm Pure Go

At Scale, Data Layout in Memory Matters

At Scale, CPU Instruction Set Capabilities Matter

Follow Arrow Development https://github.com/apache/arrow/tree/master/go/arrow

Follow Flux & Platform Development https://github.com/inﬂuxdata/platform

At Scale, Everything is… Interesting

Thank you. Paul Dix @pauldix

At Scale, Everything is Hard

At Scale, Everything is Hard

More Decks by Paul Dix

Other Decks in Technology

Featured

Transcript