[SF DevOps meetup 06/06/19] TiDB Operator

“Smooth” Operator Kevin Xu (@kevinsxu)

Agenda • History and Community • Technical Walkthrough • Kubernetes
Operator ◦ to run cloud-native stateful app

A little about PingCAP • Founded in April 2015 by
3 infrastructure engineers • Created and maintains TiDB, TiKV • Oﬃces throughout North America and China

PingCAP.com Community GitHub Stars • TiDB: 18,000+ • TiKV: 5,000+
Contributors • TiDB: 250+ • TiKV: 150+

PingCAP.com TiKV -> Incubating Status!

PingCAP.com Who’s Using TiDB? 300+ Deployments

PingCAP.com Mobike + TiDB • 200 million users • 200
cities • 9 million smart bikes • ~35 TB in TiDB

Technical Walkthrough

Technical Inspiration TiDB is a NewSQL database that speaks the
MySQL protocol It is not based on the MySQL source code It is an ACID/strongly consistent database The inspiration is Google Spanner + F1 It separates SQL processing and storage into separate components Both of them are independently scalable The SQL processing layer is stateless It is designed for both Transaction and Analytical Processing (HTAP)

Use Cases 1. Approaching the maximum size for MySQL on
a single server. Debating whether or not to shard. 2. Already sharded MySQL, but having a hard time doing analytics on up-to-date data.

TiDB TiDB Region 1 L TiKV Node 1 Region 2
Region 3 Region 4 Region 2 L TiKV Node 3 Region 3 Region 4 L Region 1 Region 4 TiKV Node 2 Region 3 L Region 2 Region 1 TiKV Cluster PD Cluster TiDB Core

TiDB HTAP: Row + Column Storage Spark Cluster TiDB TiDB
Region 1 TiKV Node 1 Region 2 Region 3 Region 4 Region 2 TiKV Node 3 Region 3 Region 4 Region 1 Region 4 TiKV Node 2 Region 3 Region 2 Region 1 TiFlash Node 2 TiFlash Extension Cluster TiKV Cluster TiSpark Worker TiSpark Worker TiFlash Node 1

Kubernetes -- Operator

Operator History • Operator pattern pioneered by CoreOS...now Red Hat...now
IBM • Introduced in 2016, Operator Framework in 2018 ◦ First 2: etcd operator, Prometheus operator • TiDB Operator (2017); Predated Operator Framework

Why Do We (As in TiDB) Care? • Manage multiple
clusters (multi-tenancy) • Safe scaling (up or down, in or out) • Use diﬀerent types of Network or Local Storage (diﬀerent performance) • Automatic monitoring • Rolling updates • Automatic failover • *Multi-Cloud* (as long as it has k8s)

Why Should YOU Care? • Manages stateful applications: ◦ databases,
caches, monitoring system, etc. • Encodes application domain knowledge ◦ Extension of your SRE team • Kubernetes-enabled Hybrid / Multi-Cloud • Growing popularity in database community: ◦ https://thenewstack.io/databases-operators-bring-stateful-workloads-to-kubernet es/

PingCAP.com High Level Resources => Controllers => Clusters

Resources -- CRD • Custom Resource Definition (CRD): ◦ An
application-specific YAML file ◦ End user writes the domain operation logic in CRD ◦ Simple to implement and deploy • (There is another way): ◦ API Aggregation: ▪ More control, more powerful but… ▪ Hard to deploy, not well-supported by k8s engines

Cluster State -- StatefulSet StatefulSet... • Guarantees ordering and uniqueness
of pods ◦ pd -> tikv -> tidb • Gives “sticky” identity -- network and storage • *No* interchangeable pods ◦ always map the same volume to the same pod • Stable since Kubernetes 1.9

TiDB TiDB Region 1 L TiKV Node 1 Region 2
Region 3 Region 4 Region 2 L TiKV Node 3 Region 3 Region 4 L Region 1 Region 4 TiKV Node 2 Region 3 L Region 2 Region 1 TiKV Cluster PD Cluster TiDB Core

How TiDB manages state -- Custom Controller Spec: component: image:
replicas: ... Status: image replicas state CRD (provided by user) Custom Controller Cluster State

TiDB Node PD StatefulSet TiKV StatefulSet TiDB StatefulSet TidbCluster CRD
Pod PVC PV Pod PVC PV Pod PVC PV CRD Sync TidbCluster Controller Watch API Operator Change Detection Reconcile Spec: component: replicas: ...

“Smooth” Operator

PingCAP.com All Open Sourced https://github.com/pingcap/tidb-operator/blob/ master/docs/user-guide.md

Thank You @kevinsxu pingcap.com github.com/pingcap tikv.org

[SF DevOps meetup 06/06/19] TiDB Operator

[SF DevOps meetup 06/06/19] TiDB Operator

Kevin Xu

More Decks by Kevin Xu

Other Decks in Technology

Featured

Transcript

“Smooth” Operator Kevin Xu (@kevinsxu)

Agenda • History and Community • Technical Walkthrough • Kubernetes

A little about PingCAP • Founded in April 2015 by

PingCAP.com Community GitHub Stars • TiDB: 18,000+ • TiKV: 5,000+

PingCAP.com TiKV -> Incubating Status!

PingCAP.com Who’s Using TiDB? 300+ Deployments

PingCAP.com Mobike + TiDB • 200 million users • 200

Technical Walkthrough

Technical Inspiration TiDB is a NewSQL database that speaks the

Use Cases 1. Approaching the maximum size for MySQL on

TiDB TiDB Region 1 L TiKV Node 1 Region 2

TiDB HTAP: Row + Column Storage Spark Cluster TiDB TiDB

Kubernetes -- Operator

Operator History • Operator pattern pioneered by CoreOS...now Red Hat...now

Why Do We (As in TiDB) Care? • Manage multiple

Why Should YOU Care? • Manages stateful applications: ◦ databases,

PingCAP.com High Level Resources => Controllers => Clusters

Resources -- CRD • Custom Resource Deﬁnition (CRD): ◦ An

Cluster State -- StatefulSet StatefulSet... • Guarantees ordering and uniqueness

TiDB TiDB Region 1 L TiKV Node 1 Region 2

How TiDB manages state -- Custom Controller Spec: component: image:

TiDB Node PD StatefulSet TiKV StatefulSet TiDB StatefulSet TidbCluster CRD

“Smooth” Operator

PingCAP.com All Open Sourced https://github.com/pingcap/tidb-operator/blob/ master/docs/user-guide.md

Thank You @kevinsxu pingcap.com github.com/pingcap tikv.org