Auto Scaling Kubernetes Clusters on OpenStack

@BastianHofmann Auto Scaling Kubernetes Clusters On OpenStack Bastian Hofmann Simon
Pearce

Container orchestration platform

Easy Scaling of applications

Why do we want to scale?

Handle increased load

Only pay what you actually need

Safe the environment

How can applications be scaled?

Horizontal Scaling

Increase or decrease the amount of instances and nodes

Vertical Scaling

Increase or decrease CPU and memory usage/capacity of one instance
or node

Bare metal approach

Order a new server

Put the server into the datacenter

Install the Operating System

Provision the server with necessary dependencies

Deploy services to the new server

Reconﬁgure the load balancer

A lot of steps

This is way too slow

Cloud provider approach

Create a new VM from an OS image

Provision the server with necessary dependencies

Deploy services to the new server

Reconﬁgure the load balancer

Still a lot of steps

AutoScaling Groups

Proprietary APIs for every Cloud Provider

Kubernetes makes this easier

Standardized APIs

How does Kubernetes work?

• A container runs a docker image. • Only 1
process can run inside of a container Container

• A group of 1 or more containers • Shared
network • Shared storage volumes Pod

php-fpm Nginx Filebeat

kind: Deployment apiVersion: extensions/v1beta1 metadata: name: hello-world spec: template: spec:
containers: - name: hello-world image: nginxdemos/hello:0.2 ports: - containerPort: 80

Horizontal Scaling

• Defines and manages how many instances of a pod
should run Replica Set

kind: Deployment apiVersion: extensions/v1beta1 metadata: name: hello-world spec: replicas: 3
template: spec: containers: - name: hello-world image: nginxdemos/hello:0.2 ports: - containerPort: 80

Vertical Scaling

Container

CPU and Memory requests and limits

kind: Deployment ... containers: - name: hello-world image: nginxdemos/hello:0.2 resources:
requests: cpu: 100m memory: 256Mi limits: cpu: 100m memory: 256Mi ...

"Requests" are used by Kubernetes for scheduling pods on nodes

"Limits" limit the container to not use more CPU and
memory

You can change these values manually

And automatically

Don't get up at night

Focus on what is important

Let's show this live

Preparations for the demos

We need a cluster

Do-it-yourself vs. Managed Kubernetes

Setting up and maintaining Kubernetes is hard

Managed Kubernetes

Google GKE

SysEleven MetaKube

Easy upgrades

Easy scaling

Load Balancing

Distributed Persistent Storage

Backups

Premium support

Monitoring

You can focus on what is important

What if a we need to scale a pod?

Manual horizontal scaling

Create a Deployment

$ kubectl apply -f deployment.yaml

$ kubectl get pods NAME READY STATUS RESTARTS AGE hello-world-fc5fd8f57-dmfjt
1/1 Running 0 26h

Create a LoadBalancer

$ kubectl expose deployment hello-world--name=hello-world- svc --port=80 --target-port=80 --type=LoadBalancer

$ kubectl get service hello-world NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S)
AGE hello-world-svc LoadBalancer 10.10.10.102 195.192.xxx.xxx 80:31750/TCP 37s

Scale the deployment

$ kubectl scale deployment/hello-world --replicas 15

1/1 Running 0 26h hello-world-fc5fd8f57-db42a 1/1 Running 0 26h hello-world-fc5fd8f57-u9htw 1/1 Running 0 26h hello-world-fc5fd8f57-btw4h 1/1 Running 0 26h hello-world-fc5fd8f57-t4qn5 1/1 Running 0 26h hello-world-fc5fd8f57-nw5yj 1/1 Running 0 26h hello-world-fc5fd8f57-ny53n 1/1 Running 0 26h hello-world-fc5fd8f57-klxw5 1/1 Running 0 26h hello-world-fc5fd8f57-k0t5s 1/1 Running 0 26h hello-world-fc5fd8f57-653na 1/1 Running 0 26h hello-world-fc5fd8f57-xfis4 1/1 Running 0 26h hello-world-fc5fd8f57-klds7 1/1 Running 0 26h hello-world-fc5fd8f57-babre 1/1 Running 0 26h hello-world-fc5fd8f57-aj5et 1/1 Running 0 26h hello-world-fc5fd8f57-q5aha 1/1 Running 0 26h hello-world-fc5fd8f57-au5a5 1/1 Running 0 26h

What if we need to scale the amount of pods
automatically?

Horizontal Pod Auto Scaling

Cluster needs metrics-server

Create Horizontal Pod Autoscaler

$ kubectl top pods NAME CPU(cores) MEMORY(bytes) hello-world-fc5fd8f57-dmfjt 0m 1Mi
hello-world-fc5fd8f57-ntasr 0m 1Mi

$ kubectl autoscale deployment hello-world --min=1 --max=6 --cpu-percent=5

$ kubectl get horizontalpodautoscaler hello-app NAME REFERENCE TARGETS MINPODS MAXPODS
REPLICAS hello-app Deployment/hello-world <unknown>/5% 1 6 2

Increase load to containers by sending lots of requests to
it

$ ab -c 900 -n 15000 http://195.192.129.xyz/

How does the Horizontal Pod Autoscaler work?

See what the autoscaler is doing

$ kubectl describe horizontalpodautoscaler hello-world ... Events: Type Reason Age
From Message ---- ------ ---- ---- ------- Normal SuccessfulRescale 15m (x222 over 71m) horizontal-pod-autoscaler New size: 6; reason: Current number of replicas above Spec.MaxReplicas

What if a process needs more resources?

Manual vertical scaling

Change pod resource requests and limits

$ kubectl edit deployment hello-world

kind: Deployment ... containers: - name: hello-world image: nginxdemos/hello:0.2 resources:
requests: cpu: 100m memory: 256Mi limits: cpu: 100m memory: 256Mi ...

Pods are re-scheduled on the cluster if necessary

If there are not enough resources, Pods remain pending

0/1 Pending 0 26h hello-world-fc5fd8f57-db42a 0/1 Pending 0 26h hello-world-fc5fd8f57-u9htw 0/1 Pending 0 26h hello-world-fc5fd8f57-btw4h 0/1 Pending 0 26h hello-world-fc5fd8f57-t4qn5 1/1 Running 0 26h hello-world-fc5fd8f57-nw5yj 1/1 Running 0 26h hello-world-fc5fd8f57-ny53n 1/1 Running 0 26h hello-world-fc5fd8f57-klxw5 1/1 Running 0 26h hello-world-fc5fd8f57-k0t5s 1/1 Running 0 26h hello-world-fc5fd8f57-653na 1/1 Running 0 26h hello-world-fc5fd8f57-xfis4 1/1 Running 0 26h hello-world-fc5fd8f57-klds7 1/1 Running 0 26h hello-world-fc5fd8f57-babre 1/1 Running 0 26h hello-world-fc5fd8f57-aj5et 1/1 Running 0 26h hello-world-fc5fd8f57-q5aha 1/1 Running 0 26h hello-world-fc5fd8f57-au5a5 1/1 Running 0 26h

$ kubectl descript pod hello-world-fc5fd8f57-dmfjt Events: Type Reason Age From
Message ---- ------ ---- ---- ------- Warning FailedScheduling 2s (x5 over 10s) default- scheduler 0/3 nodes are available: 3 Insufficient cpu.

What if we need more nodes to schedule additional pods?

Node Scaling

Manually adding more VMS to the cluster

Cloud provider dependent

SysEleven MetaKube

Cluster Management API https:/ /github.com/kubernetes-sigs/cluster-api

Kubermatic Machine Controller https:/ /github.com/kubermatic/machine-controller

MachineDeployment MachineSet Machine Machine MachineController VM VM Node Node Rolling
Updates Ensure Replica Count Create Listen

What happens if you have to add nodes outside of
working hours?

Node Auto Scaling

SysEleven MetaKube

Cluster Auto Scaler https:/ /github.com/kubernetes/autoscaler

Cluster Management API https:/ /github.com/kubernetes-sigs/cluster-api

kubectl get nodes NAME STATUS ROLES AGE VERSION kubermatic-fhgbvx65xg-7flj7 Ready
<none> 7d1h v1.12.2 kubermatic-fhgbvx65xg-hmgd4 Ready <none> 7d2h v1.12.2 kubermatic-fhgbvx65xg-q287t Ready <none> 7d2h v1.12.2

kubectl get machines -n kube-system NAME AGE machine-kubermatic-fhgbvx65xg-7flj7 7d machine-kubermatic-fhgbvx65xg-hmgd4
7d machine-kubermatic-fhgbvx65xg-q287t 7d

apiVersion: cluster.k8s.io/v1alpha1 kind: MachineDeployment metadata: annotations: cluster-autoscaler/minsize: 1 cluster-autoscaler/maxsize: 15
name: scalable-machine-deployment namespace: kube-system spec: replicas: 1 ...

... cloudProvider: openstack cloudProviderSpec: availabilityZone: dbl flavor: m1-small floatingIpPool: ext-net
identityEndpoint: "https://cloud.sys11.net/v3" image: "Ubuntu 18.04" network: kubermatic-c123 region: dbl securityGroups: - kubermatic-c123 operatingSystem: ubuntu operatingSystemSpec:

... sshPublicKeys: - "..." ... versions: kubelet: "v1.12.2"

kubectl get machines -n kube-system NAME AGE machine-kubermatic-fhgbvx65xg-7flj7 7d machine-kubermatic-fhgbvx65xg-hmgd4
7d machine-kubermatic-fhgbvx65xg-q287t 7d scalable-machine-deployment-hueab94ghq-abiur 3m scalable-machine-deployment-hueab94ghq-4uhva 3m scalable-machine-deployment-hueab94ghq-vhues 3m

Summary

With auto scaling you can ...

Safe valuable resources

Safe the environment

Kubernetes makes auto scaling a lot easier

And you don't have to get up at night

[email protected] https:/ /twitter.com/BastianHofmann Join us in the SysEleven Lounge [email protected]
http:/ /speakerdeck.com/u/bastianhofmann [email protected]

Auto Scaling Kubernetes Clusters on OpenStack

Auto Scaling Kubernetes Clusters on OpenStack

More Decks by Bastian Hofmann

Other Decks in Programming

Featured

Transcript