ChaDevOps: Deploying Self Healing Services with Kubernetes

Deploying Self Healing Services with Kubernetes Rob Scott | @robertjscott
ChaDevOps, June 20, 2017

@spire spire.me

Remember This? February 28, 2017

All Spire systems were still up

It’s never that simple

All Spire systems were still up

Our Systems Before Kubernetes

The core services powering Spire Website API Scheduler HTTP Services
Background Services Background Processing Notiﬁcations Management Portal

Node 1 Notifications Background Processing API Management Portal What it
all looks like in Kubernetes API Website STAGING DEMO QA QA Scheduler Website QA STAGING STAGING Node 2 Notifications Background Processing API Management Portal API Website DEMO QA QA Website STAGING Node 3 Notifications Background Processing Management Portal API STAGING DEMO DEMO Scheduler Website DEMO DEMO Notifications Notifications DEMO DEMO Background ProcessingQA Node 4 Notifications API Management Portal Website QA QA Scheduler Background Processing DEMO Background ProcessingQA Management Portal STAGING DEMO STAGING STAGING QA Management Portal STAGING STAGING STAGING DEMO QA

Node 1 Notifications Background Processing API Management Portal What if
a Node dies? API Website STAGING DEMO QA QA Scheduler Website QA STAGING STAGING Node 2 Notifications Background Processing API Management Portal API Website DEMO QA QA Website STAGING Node 3 Notifications Background Processing Management Portal API STAGING DEMO DEMO Scheduler Website DEMO DEMO Notifications Notifications DEMO DEMO Background ProcessingQA Node 4 Notifications API Management Portal Website QA QA Scheduler Background Processing DEMO Background ProcessingQA Management Portal STAGING DEMO STAGING STAGING QA Management Portal STAGING STAGING STAGING DEMO QA

Node 1 Notifications Background Processing API Management Portal After redistribution
API Website STAGING DEMO QA QA Scheduler Website QA STAGING STAGING Node 2 Notifications Background Processing API Management Portal API Website DEMO QA QA Website STAGING Node 3 Notifications Background Processing Management Portal API STAGING DEMO DEMO Scheduler Website DEMO DEMO Notifications Notifications DEMO DEMO Background ProcessingQA Notifications API Management Portal Website QA QA Scheduler Background Processing DEMO Background ProcessingQA Management Portal STAGING DEMO STAGING STAGING QA Management Portal STAGING STAGING STAGING DEMO QA

Kuberwhat?

Initial Release: July 21, 2015 Google partnered with the Linux
Foundation to form the Cloud Native Computing Foundation (CNCF) to govern Kubernetes.

Container Orchestration Tools SWARM

Container Orchestration Trends

Demo Everything you’ll need to deploy your own self healing
services with Kubernetes.

Namespace Kubernetes Foundation

apiVersion: v1 kind: Namespace metadata: name: self-healing-k8s-demo

Service Kubernetes Foundation

apiVersion: v1 kind: Service metadata: name: self-healing-k8s-demo spec: type: LoadBalancer
selector: app: self-healing-k8s-demo ports: - protocol: TCP port: 80 targetPort: 3000

Pod Kubernetes Foundation

apiVersion: v1 kind: Pod metadata: name: demo-pod labels: app: self-healing-k8s-demo
spec: containers: - name: demo-http-server image: quay.io/robertjscott/demo-http-server:0.1.1

Deployment Kubernetes Foundation

apiVersion: extensions/v1beta1 kind: Deployment metadata: name: demo-deployment spec: replicas: 3
template: metadata: labels: app: self-healing-k8s-demo spec: containers: - name: demo-http-server image: quay.io/robertjscott/demo-http-server:0.1.1

Example: Bad Code Example

Liveness Probes Key Concept

spec: containers: - name: demo-http-server image: quay.io/robertjscott/demo-http-server:0.1.1 livenessProbe: httpGet: path:
/alive port: 3000 periodSeconds: 5 timeoutSeconds: 1

Example: Slow Server Example

containers: - name: demo-http-server image: quay.io/robertjscott/demo-http-server:0.1.1 livenessProbe: httpGet: path: /alive
port: 3000 periodSeconds: 5 timeoutSeconds: 1 initialDelaySeconds: 45 env: - name: STARTUP_DELAY_SECONDS value: '40'

Readiness Probes Key Concept

containers: - name: demo-http-server image: quay.io/robertjscott/demo-http-server:0.1.1 livenessProbe: httpGet: path: /alive
port: 3000 periodSeconds: 5 timeoutSeconds: 1 initialDelaySeconds: 45 readinessProbe: httpGet: path: /ready port: 3000 periodSeconds: 5 timeoutSeconds: 1

Example: Mayhem Example

apiVersion: extensions/v1beta1 kind: Deployment metadata: name: mayhem-deployment spec: replicas: 5
template: metadata: labels: app: mayhem spec: containers: - name: mayhem image: quay.io/robertjscott/mayhem:0.1.0

Resource Limits Key Concept

spec: containers: - name: mayhem image: quay.io/robertjscott/mayhem:0.1.0 resources: requests: memory:
64Mi cpu: 125m limits: memory: 64Mi cpu: 125m

Liveness Probes When these probes fail, Kubernetes attempts to restart
the container.

Readiness Probes Kubernetes does not send trafﬁc to the container
until these probes succeed.

Resource Limits Without enforcing proper resource limits, a single rogue
container can take down a node.

Affinity and Anti-Affinity Proper configuration can ensure your pods are
deployed across availability zones or regions.

nodeAffinity: requiredDuringSchedulingIgnoredDuringExecution: nodeSelectorTerms: - matchExpressions: - key: failure-domain.beta.kubernetes.io/zone operator: In
values: - us-east-1c - us-east-1d

preferredDuringSchedulingIgnoredDuringExecution: - weight: 1 preference: matchExpressions: - key: beta.kubernetes.io/instance-type operator:
In values: - m4.large

Where to go from here • The Children's Illustrated Guide
to Kubernetes • Quickstart for Google Container Engine • Setting up an HA Kubernetes Cluster in AWS with private topology with Kops 1.5.1 • KubeCon Videos

With proper conﬁguration, Kubernetes services can heal themselves @robertjscott |
robertjscott.ca

ChaDevOps: Deploying Self Healing Services with...

ChaDevOps: Deploying Self Healing Services with Kubernetes

More Decks by Rob Scott

Other Decks in Technology

Featured

Transcript