Slide 1

Slide 1 text

Monitoring Containers Correctly Michael Kehoe Staff Site Reliability Engineer https://github.com/michael-kehoe/container-monitoring-workshop

Slide 2

Slide 2 text

Getting Started • Setup your workshop platform: • https://app.strigo.io/event/QXDpmTiRAuf Q4LBis • Token: F7C7 • Background slides: https://bit.ly/2NcEBQN • Code repo: https://github.com/michael- kehoe/container-monitoring-workshop • Please let me know ASAP if you’re having problems

Slide 3

Slide 3 text

Today’s agenda 1 Introductions 2 Container Primitives 3 What we’ll monitor 4 Cgroup interface file formats 5 Exercises

Slide 4

Slide 4 text

Today’s agenda Exercises 100 CPU Basics 101 CPU Enhanced 102 CPU Advanced 200 Memory Basics 201 Memory Enhanced 300 IO Basics 400 PID

Slide 5

Slide 5 text

Michael Kehoe $ WHOAMI • Staff Site Reliability Engineer @ LinkedIn • Production-SRE Team • Funny accent = Australian + 4 years American • Worked on: • Networks • Micro-services • Traffic Engineering • Databases

Slide 6

Slide 6 text

Production-SRE Team @ LinkedIn $ WHOAMI • Disaster Recovery - Planning & Automation • Incident Response – Process & Automation • Visibility Engineering – Making use of operational data • Reliability Principles – Defining best practice & automating it

Slide 7

Slide 7 text

Container Primitives

Slide 8

Slide 8 text

Containers Limiting the resources that can be used by a process/ set of processes cgroups Isolating filesystem resources Namespaces Implicit sharing or shadowing Copy on Write Locking down container privileges Linux Security Modules

Slide 9

Slide 9 text

Cgroup • Abbreviation for ‘Control Groups’ • Provides • Resource Limiting • Prioritization • Accounting • Control

Slide 10

Slide 10 text

What we’ll monitor

Slide 11

Slide 11 text

• 100: Basic cgroup CPU utilization • 101: Enhanced cgroup CPU utilization (with percentiles • 102: cgroup throttles What we’ll monitor CPU

Slide 12

Slide 12 text

• 200: Memory Basics • Cgroup utilization • 201: Enhanced Memory Metrics What we’ll monitor MEMORY

Slide 13

Slide 13 text

• 300: Disk IO Monitoring What we’ll monitor DISK/ NETWORK

Slide 14

Slide 14 text

• 400: PID Utilization What we’ll monitor PID

Slide 15

Slide 15 text

Cgroup interface file formats

Slide 16

Slide 16 text

Cgroup interface file formats https://www.kernel.org/doc/Documentation/cgroup-v2.txt

Slide 17

Slide 17 text

Exercises

Slide 18

Slide 18 text

100: CPU Monitoring

Slide 19

Slide 19 text

101: Enhanced CPU Monitoring

Slide 20

Slide 20 text

Enhanced CPU Monitoring

Slide 21

Slide 21 text

102: CPU Advanced Monitoring

Slide 22

Slide 22 text

Advanced CPU Monitoring

Slide 23

Slide 23 text

200: Memory Basics

Slide 24

Slide 24 text

201: Memory Enhanced

Slide 25

Slide 25 text

300: Disk IO Basics

Slide 26

Slide 26 text

400: PID Monitoring

Slide 27

Slide 27 text

No content