The Kubernetes Storage Layer: Peeling The Onion Minus The Tears

Slide 1

Slide 1 text

No content

Slide 2

Slide 2 text

Madhav Jivrajani, VMware The Kubernetes Storage Layer: Peeling The Onion Minus The Tears

Slide 3

Slide 3 text

$ whoami ● Work @ VMware ● Do work in API Machinery, Scalability, Architecture and ContribEx ● TL for SIG ContribEx and GitHub Admin of the project

Slide 4

Slide 4 text

Before We Start…

Slide 5

Slide 5 text

🚨Help migrate Prow jobs to community clusters! See https://github.com/kubernetes/test-infra/issues/29722 for details.

Slide 6

Slide 6 text

Prelude A 50,000 ft. view of how the Kubernetes “machine” works.

Slide 7

Slide 7 text

No content

Slide 8

Slide 8 text

No content

Slide 9

Slide 9 text

No content

Slide 10

Slide 10 text

No content

Slide 11

Slide 11 text

No content

Slide 12

Slide 12 text

No content

Slide 13

Slide 13 text

No content

Slide 14

Slide 14 text

No content

Slide 15

Slide 15 text

No content

Slide 16

Slide 16 text

No content

Slide 17

Slide 17 text

No content

Slide 18

Slide 18 text

No content

Slide 19

Slide 19 text

No content

Slide 20

Slide 20 text

No content

Slide 21

Slide 21 text

No content

Slide 22

Slide 22 text

No content

Slide 23

Slide 23 text

No content

Slide 24

Slide 24 text

No content

Slide 25

Slide 25 text

No content

Slide 26

Slide 26 text

No content

Slide 27

Slide 27 text

No content

Slide 28

Slide 28 text

No content

Slide 29

Slide 29 text

List + Watch

Slide 30

Slide 30 text

List + Watch “Kubernetes is a declarative, event-driven system.”

Slide 31

Slide 31 text

List + Watch “Kubernetes is a declarative, event-driven system.”

Slide 32

Slide 32 text

List “Kubernetes is a declarative, event-driven system.”

Slide 33

Slide 33 text

List “Kubernetes is a declarative, event-driven system.” We specify intent. ❯ kubectl apply -f 3-replica-deployment.yaml

Slide 34

Slide 34 text

List “Kubernetes is a declarative, event- driven system.”

Slide 35

Slide 35 text

List “Kubernetes is a declarative, event- driven system.” ● We need to start somewhere, in order to take actions, we need to know what the “current state” looks like.

Slide 36

Slide 36 text

List “Kubernetes is a declarative, event- driven system.” ● We need to start somewhere, in order to take actions, we need to know what the “current state” looks like. ● To do this, we perform a LIST operation. ❯ kubectl get --raw '/api/v1/namespaces/default/pods' { "kind": "PodList", "apiVersion": "v1", "metadata": { "resourceVersion":"1452", ... }, "items": [...] // all pods }

Slide 37

Slide 37 text

List “Kubernetes is a declarative, event- driven system.” ● In order to get the “current state”, we perform a LIST operation. ● Responses can get huge, sometimes we paginate. ❯ kubectl get --raw '/api/v1/namespaces/default/pods?limit=100' { "kind": "PodList", "apiVersion": "v1", "metadata": { "resourceVersion":"1452", "continue": "ENCODED_CONTINUE_TOKEN", ... }, "items": [...] // pod0-pod99 }

Slide 38

Slide 38 text

List “Kubernetes is a declarative, event- driven system.” ● In order to get the “current state”, we perform a LIST operation. ● Responses can get huge, sometimes we paginate. ● We can continue doing this till we get the entire “current state” (full list). ❯ kubectl get --raw '/api/v1/namespaces/default/pods?limit=100&cont inue=ENCODED_CONTINUE_TOKEN' { "kind": "PodList", "apiVersion": "v1", "metadata": { "resourceVersion":"1452", "continue": "ENCODED_CONTINUE_TOKEN_2", ... }, "items": [...] // pod100-pod199 }

Slide 39

Slide 39 text

Watch “Kubernetes is a declarative, event- driven system.”

Slide 40

Slide 40 text

Watch “Kubernetes is a declarative, event- driven system.”

Slide 41

Slide 41 text

Watch “Kubernetes is a declarative, event- driven system.”

Slide 42

Slide 42 text

Watch “Kubernetes is a declarative, event- driven system.”

Slide 43

Slide 43 text

Watch “Kubernetes is a declarative, event- driven system.” https://www.mgasch.com/2018/08/k8sevents/

Slide 44

Slide 44 text

Watch “Kubernetes is a declarative, event- driven system.” ● I have my state of the world from LIST. Now I need to know as and when events happen that modify this state so that I can take corrective action. ❯ kubectl get --raw '/api/v1/namespaces/default/pods?limit=100&cont inue=ENCODED_CONTINUE_TOKEN_2' { "kind": "PodList", "apiVersion": "v1", "metadata": { "resourceVersion":"1452", "continue": "ENCODED_CONTINUE_TOKEN_2", ... }, "items": [...] // pod100-pod199 }

Slide 45

Slide 45 text

Slide 46

Slide 46 text

Watch ❯ kubectl get --raw '/api/v1/namespaces/default/pods? watch=1&resourceVersion=1452' { "type": "MODIFIED", "object": { "kind": "Pod", "apiVersion": "v1", "metadata": {"resourceVersion":"1650", ...}, ...} } ... { "type": "DELETED", "object": { "kind": "Pod", "apiVersion": "v1", "metadata": {"resourceVersion":"1734", ...}, ...} } “Kubernetes is a declarative, event- driven system.” ● I have my state of the world from LIST. Now I need to know as and when events happen that modify this state so that I can take corrective action. ● WATCH for changes. The API Server gives us a stream of notifications on a single connection that we can “react” to.

Slide 47

Slide 47 text

resourceVersion

Slide 48

Slide 48 text

resourceVersion ● Opaque string representing “internal version” of an object. ● One big, global, logical clock.

Slide 49

Slide 49 text

Slide 50

Slide 50 text

resourceVersion ● Opaque string representing “internal version” of an object. ● One big, global, logical clock. ● resourceVersion is backed by etcd’s store revisions* – which provide a global ordering. ● Increases monotonically whenever any change to the state of the world happens. ● Gives you a global order of events that happen in the system. ● Most importantly - they enable optimistic concurrency control.

Slide 51

Slide 51 text

resourceVersion https://sched.co/1R2m8

Slide 52

Slide 52 text

The Kubernetes Storage Layer - Past

Slide 53

Slide 53 text

The Kubernetes Storage Layer - Past

Slide 54

Slide 54 text

The Kubernetes Storage Layer - Past

Slide 55

Slide 55 text

The Kubernetes Storage Layer - Past

Slide 56

Slide 56 text

The Kubernetes Storage Layer - Past

Slide 57

Slide 57 text

The Kubernetes Storage Layer - Past

Slide 58

Slide 58 text

The Kubernetes Storage Layer - Past

Slide 59

Slide 59 text

The Kubernetes Storage Layer - Past

Slide 60

Slide 60 text

The Kubernetes Storage Layer - Past

Slide 61

Slide 61 text

The Kubernetes Storage Layer - Past

Slide 62

Slide 62 text

The Kubernetes Storage Layer - Past If you had a controller, more the replicas, lesser the scalability of etcd.