Kubernetes Me This Batman

Kubernetes me This ... Or I how I quit worrying
and learned to love container clustering

What I want you to walk away with. • Knowledge
about: ◦ Why Kubernetes exists. ◦ How Kubernetes works. ◦ The challenges of container orchestration at scale. ◦ How you can use Kubernetes. • Most importantly, I want you guys and gals to have a good time with some like minded folks and learn something new in a fun way.

Presentation Schedule • Part 1: Kubernetes is very opinionated but
I agree with most of them. • Part 2: All about drawings. • Part 3: Demo Time! • Part 4: Questions.

Part 1: Kubernetes is very opinionated but I agree with
most of them.

I was told memes were good to have in a
presentation. Here is one right off the bat.

From Chapter 4 of Getting Real by 37 Signals The
best software has a vision. The best software takes sides. When someone uses software, they're not just looking for features, they're looking for an approach. They're looking for a vision. Decide what your vision is and run with it.

Running and scheduling containers is a very opinionated field.

Google has an opinion called Kubernetes. • Pronounced /koo-ber-nay'-tace/. It’s
actually a Greek term for “ship master”. • Developed at Google. The third iteration of container management. ◦ Daddy was Omega. ◦ Grandaddy was Borg. • Kubernetes is not a PaaS, but you can build one with it. • Google says that Kubernetes is planet scale.

k8s BTW, Google wants you to stop writing Kubernetes and
use this clever acronym instead. Although it technically should be pronounced “Kates”.

This is the Kubernetes logo. The lack of symmetry bugs
me.

Some stats on a recent ‘pet set deployment’ • 1
Master • 1,009 Nodes over 4 Regions • 8,072 Cores. The master used 24, minion nodes used the rest • 1,009 IP addresses • 1,009 routes setup by Kubernetes on Google Cloud Platform • 100,510 GB persistent disk used by the Minions and the Master • 380,020 GB SSD disk persistent disk. 20 GB for the master and 340 GB per Cassandra Pet. • 1,000 deployed instances of Cassandra

You can’t talk about the cluster without talking about Pods
and vice versa. So let’s talk about Pods and Labels.

Pods first. Not these pods though.

• For the most part … • Pods can contain
one or more containers. • The containers in a pod are scheduled on the same node. • Everything in Kubernetes is some flavor of of pod or an extension of the pod spec. • Remember this for now, we’ll get back to it in a second. A pod is a collection of containers.

Pods are flat files. No, really. Like YAML or JSON
(boo*). apiVersion: v1 kind: Pod metadata: name: "" labels: name: "" namespace: "" annotations: [] generateName: "" spec: ? "// See 'The spec schema' for details." : ~ { "kind": "Pod", "apiVersion": "v1", "metadata": { "name": "", "labels": { "name": "" }, "generateName": "", "namespace": "", "annotations": [] }, "spec": { // See 'The spec schema' for details. } } *Font size 14 vs font size 10, YAML is the clear winner. Especially in the context of Shannon’s Information Theory. The same density of information can be transmitted in less lines with YAML.

Pods. Both of these are the same. apiVersion: v1 kind:
Pod metadata: name: redis-django labels: app: web spec: containers: - name: key-value-store image: redis ports: - containerPort: 6379 - name: frontend image: django ports: - containerPort: 8000 K8S Node 1 redis-django pod 1 redis container django container some-other pod K8S Node 2 redis-django pod 2 redis container django container redis-django pod 3 redis container django container

A political label meme. I’m getting edgy.

Labels and selectors are the fairy dust in k8s. •
A label is a key-value pair that is assigned to objects in k8s. ◦ Pods, services, lots of things can have labels. • A selector is a way to filter for labels that match a certain criteria or logic. ◦ There are two types of selectors: ▪ Equality based ▪ Set based

An example of each. "labels": { "environment" : "prod", "type"
: "nginx" } environment = prod type != nginx "labels": { "environment" : "prod", "type" : "redis" } environment = prod type != nginx No Yes "labels": { "environment" : "prod", "type" : "redis" } environment in (prod, qa) type notin (nginx, mysql) !partition Yes

When one thing in k8s needs to find another thing
in k8s, it uses labels.

Let’s talk about the cluster a little.

Key services on masters and nodes. Master • kube-apiserver: The
ears of the operation. Handles requests from clients to make changes to state. • kube-scheduler: Scheduler for ods. Probably the most advanced component. • kube-controller-manager: Handles control loop services in k8s. • *etcd: Used to store state. Node • kubelet: This is the k8s client service that runs on nodes. It functions as a job scheduler, taking calls from the master and ensuring state. • kube-proxy: Handles networking among the containers in a pod, including load balancing and exposing the containers externally. *etcd is used to handle state for the cluster, but is not exclusive to k8s.

You also get some bonus stuff • When you launch
a cluster, you get some built in services. • Each one of these has their own endpoints and / or UIs. • They run on the master directly though you could schedule them across the cluster or other masters. • To find the endpoints type: kubectl cluster-info Heapster

A basic cluster. K8S Node 1 redis-django pod 1 redis
container django container some-other pod K8S Node 2 redis-django pod 2 redis container django container redis-django pod 3 redis container django container K8S Master SkyDns pod ETCD pod Kibana pod Grafana pod Elasticsearch pod Heapster pod basic-cluster-01

And now: Namespaces. • A namespace as an isolated section
of a cluster. • It’s a virtual cluster in your cluster. • Each cluster can have multiple namespaces. • The root services have their own. • Namespaces are in network isolation from each other and can are (normally) used to house different environments on the same cluster.

Did I tell you that you use a CLI to
do everything with the cluster?

It’s called kubectl. • It has many, many, many, many,
many, many, many options. • It is a direct mapping of the API. • The commands are essentially shortcuts for interacting with the master server and figuring out what’s going on with your cluster. • This is probably why people have spent time building PaaS offerings on top of it. #bashishard

How kubectl works with the cluster. Let’s say you want
to fire up a pod. With kubectl you would: 1. Make a Pod request to the API server using a local pod definition file. 2. The API server saves the info for the pod in ETCD. 3. The scheduler finds the unscheduled pod and schedules it to a node. 4. Kubelet sees the pod scheduled and fires up docker. 5. Docker runs the container. The entire lifecycle state of the pod is stored in ETCD.

Part 2: All about drawings.

Let’s look at a Kubernetes cluster diagram.

This diagram is a bit small, let’s break it down.

The master. Pretty good movie. Too bad about Seymour Hoffman.

The master • Everything is done via kubectl, which then
makes calls against the kube-apiserver. • The Controller Manager, Scheduler Service, and ETCD can be spread across nodes based on cluster size. • All state about everything is stored in ETCD.

Nodes. Either very smart and uses maths and stuff or
very dumb and can lift heavy things.

The Node • The name of the agent process is
called kubelet. Think “cubed omelette”. • The kubelet process manages the Pods, including containers & volumes. • The kube-proxy service handles network routing and service exposure.

A master is a master because it has the api
services and scheduler. The brains are all in etcd.

Kubernetes objects.

My mental model of k8s • During class I found
it easiest to think of everything as a variation of a Pod or another object. • Google has done a very good job at extending base objects to add flexibility or support new features. • This also means that the Pod spec is relatively stable given the massive list of features that is dropped every release.

What k8s looks like in my head. Pod Spec Container
Replica Set Pod Spec Container Replication Controller Pod Spec Container Daemon Set Pod Spec Container Pet Set Pod Spec Container Deployment Replica Set Pod Spec Container Service Pod Service Pod Ingress Service Spec Container Job Pod Spec Container

Or this.

The base “things” in pods are “specs” (not like dust).
spec: containers: - args: - "" command: - "" env: - name: "" value: "" image: "" imagePullPolicy: "" name: "" ports: - containerPort: 0 name: "" protocol: "" resources: cpu: "" memory: "" restartPolicy: "" volumes: - emptyDir: medium: "" name: "" secret: secretName: "" • The only required field is containers. ◦ And it requires two entries ▪ name ▪ image • restarPolicy is for all containers in a pod. • volumes are volumes (duh) that any container in a pod can mount. • The spec is very extensible by design.

Then there is the pod • Specs don’t do anything
by themselves; for that you need a pod. • Pods are just collections of containers that share a few things: ◦ Access to volumes. ◦ Networking. ◦ Are co-located. • Pods can be run by themselves but have not guarantee to restart or stay running or scale or do anything useful really. Pod Spec Container

Services. • Services point to a Pod. • … or
to an external source. • With Pods a virtual endpoint is created then routed to using the kube-proxy. • For non-pod services a virtual IP in the cluster is used to route externally. Service Pod

Ingress Service = AWS API Gateway. • An Ingress Controller
sits at the boundary of the cluster and routes requests to Services. • One Ingress Controller can handle multiple domains. • Each route can point to a different Service. • Relies on the creation of an Ingress Controller in the cluster (another service that is not enabled by default). Service Pod Ingress Service

Things get scary and weird from here on out.

Daemon sets. Scary. • Daemons is an object that ensures
that a copy of each Pod runs on each node. • This is commonly used to make sure side-car containers are running across the cluster. • If new nodes come up they’ll get a copy of the daemon set and will come up. • Daemon sets don’t have scaling rules. Daemon Set Pod Spec Container

Pet sets. Not so scary. • New in 1.3, Pet
Sets allow you to create complex microservices across the cluster. • They have the ability to set dependency on other containers. • They require: ◦ A stable hostname, available in DNS ◦ An ordinal index ◦ Stable storage: linked to the ordinal & hostname • It’s for launching a cluster in your cluster. Pet Set Pod Spec Container

Replciation Controller (deprecated) • A Replication Controller was the best
way to run Pods. • You set a number of pods to run and the Replication Controller made sure that the number was running across the cluster. • Rolling updates could be performed by starting a new Replication Controller and scaling up. Replication Controller Pod Spec Container

Replcia Set. The new hotness. • A Replica Set differs
from the Replication Controller because it can be updated. • If you update the Replica Set template you can fire and update and automatically roll changes. • Roll backs are also built in. • These are not designed to use directly. For that you need ... Pod Spec Container Replica Set

Deployments. The king of the hill. • A Deployment controls
the running state of Pods and Replica Sets. • In k8s 1.3 it is the primary object you should be manipulating. • Deployments have: ◦ History. ◦ Rolling updates. ◦ Pausing updates. ◦ Roll-backs. Deployment Replica Set Pod Spec Container

There’s more stuff too.

Other stuff. • Secrets: ◦ K8s comes with a built-in
secret store that is namespaced and uses labels to control pod read access. • Network Policies: ◦ You can use labels to define whitelist rules between pods. • Persistent Volumes: ◦ These live outside of normal pod volumes and can be used for shared storage for things like databases. Yes, databases in containers.

Part 3: Demo Time!

Part 4: Questions?

Contact me! [email protected] @richardboydii on twitter www.richardboydii.com www.coutnzer0.com https://keybase.io/richardboydii

Civitas and Containers • We are about to flip the
switch and go 100% container by the end of Q3. • We have an existing solution in place that uses Troposphere and ECS. • Looking 12 months out we will outgrow this solution. • I am betting on Kubernetes.

Why I use ECS but am talking about Kubernetes. •
Our “tool” that spits out CloudFormation templates creates: ◦ A CloudFormation stack that spawns: ▪ A 45 node ECS cluster that has: • 120 different ECS tasks running on it ◦ With ELBs ▪ With Route53 entries • Several RDS databases, an ElasticSearch Cluster, and a MemCache Cluster that the containers talk to ◦ And the template it spits out is actually a main template with four sub-templates because more than 4000 lines makes Werner Vogels head a’splode.

Speaking of heads a’sploding. Probably with rage.

Who Am I? • Richard Boyd II • Site Reliability
Engineer @Civitas Learning • Believer in the healing power of DevOps • I don’t work for Google • I don’t believe in silver bullets • I do believe that if you’re thinking about cluster management for containers you need to find the right tool for you

Q: So why way are you standing up there talking?

A: Because they let me. And I took a class
on this stuff. And I like to talk.

Kubernetes Me This Batman

Kubernetes Me This Batman

More Decks by Richard Boyd, II

Other Decks in Technology

Featured

Transcript