D2-2 Jorge Salamero Sanz - You’re Monitoring Containers Wrong

Jorge Salamero Sanz @bencerillo @sysdig You are monitoring Docker wrong

@bencerillo / @sysdig How to Monitor Microservices on Docker? Apps
Infra Health Checks JVM/JMX Custom metrics Metrics Processing Unicorns, rainbows And cute dashboards

@bencerillo / @sysdig

@bencerillo / @sysdig % whoami Jorge Salamero Sanz <[email protected]> •
Working on OSS last 12 years • Working on monitoring last 3 years • Containers gamer @sysdig @bencerillo @sysdig

@bencerillo / @sysdig Agenda • Challenges of container infrastructures •
Traditional monitoring limitations • Best practices monitoring Microservices • Sysdig, container native monitoring & troubleshooting

@bencerillo / @sysdig Breaking traditional model Microservices and containers break
traditional monitoring and troubleshooting models

Traditional deployment Full host OS kernel systemd syslogd App services
MySQL Nginx OpenSSL Java App A App B App C Ops Devs

@bencerillo / @sysdig Application workflow Database App Cache backend middleware
frontend

@bencerillo / @sysdig The Docker revolution Servers Virtual Machines Containers
Unit: machine Orchestration: no Architecture: monolithic Unit: machine Orchestration: external Architecture: monolithic Unit: (micro)service Orchestration: native Architecture: distributed

Containerized deployment Full host OS kernel + Docker MySQL App
A Ops DevOps Nginx + OpenSSL App B Java 8.0 build XXX App C

@bencerillo / @sysdig … but in reality Database App Cache/Frontend
Computing node Computing node Computing node Computing node Computing node Computing node

@bencerillo / @sysdig Application workflow Database App Cache backend middleware
frontend

@bencerillo / @sysdig New organization structures Database Team App Developers
PaaS Team

@bencerillo / @sysdig Container monitoring New challenges: 1. How do
we get the metrics? 2. How do we shape all this amount of metrics? 3. Analysis and troubleshooting 4. Teams on Microservices infrastructure

@bencerillo / @sysdig 1. Metric collection • We Ǻ containers,
because: – are simple – are small – are isolated – less dependencies • … but they are an opaque blackbox

@bencerillo / @sysdig “Workarounds” Agent in the Docker container Agent
in the Kubernetes pod Export metrics through an external agent App Agent App Agent App Agent App App App 1. Complex instrumentation (x2 because just the monitoring) plus service monitoring configuration 2. Limited and pre-established metric collection (Docker API, etc)

@bencerillo / @sysdig Kernel instrumentation Kernel Docker Container 1 Container
2 Container 3 App App rkt LXC Sysdig Docker

@bencerillo / @sysdig Why this is cool? • Just one
instrumentation per host: – spawning or destroying a container is instrumentation-less • Full visibility: all the system calls: – automatic service discovery – all metrics collection (no filtering) – application monitoring without instrumentation (magic of decoding protocols)

@bencerillo / @sysdig Response time and HTTP monitoring

@bencerillo / @sysdig Remember... but in reality: Database App Cache/Frontend
Computing node Computing node Computing node Computing node Computing node Computing node

@bencerillo / @sysdig 2. Information aggregation • Infrastructure monitoring should
be transparent and automatic (no instrumentation no configuration) • You should handle your custom/biz metrics • All metrics should be tagged automatically • All metrics should be aggregated and segmented on a service level basis

@bencerillo / @sysdig Node topology

@bencerillo / @sysdig Service topology

@bencerillo / @sysdig Topology drill down

@bencerillo / @sysdig Orchestration platforms knows already...

@bencerillo @sysdig • New layers • Services / containers not
coupled with nodes • Number of instances can dynamically change • Killing a process most probably it’s fine • Understanding Kubernetes metadata is an absolute requirement • Tagging is cool, autodiscovery is better How Kubernetes changes monitoring?

@bencerillo @sysdig • Application • Services • Kubernetes deployment •
Kubernetes internals • Host / node The 5 layers to monitor in Kubernetes

@bencerillo / @sysdig Real example https://github.com/kubernetes/kubernetes/issues/1405 1

@bencerillo / @sysdig What if…?

@bencerillo / @sysdig 3. Analysis & troubleshooting • Imagine: strace
+ wireshark + htop + lsof + iostat + vmstat + * • Not available on containers, don’t understand namespaces • Metrics and logs can bite your in the ass, system calls have all the truth • Infrastructure gets more complex and volatile

@bencerillo / @sysdig

@bencerillo / @sysdig csysdig drill down in Kubernetes

@bencerillo / @sysdig Teams on Microservices Francesc Zacarias, SRE @
Spotify

@bencerillo / @sysdig 4. Teams by service • Tags/Metadata from
the orchestration platform, eg Kubernetes: – namespaces (dev, prod) – services, deployments, RCs, pods – custom tags • ACLs out of the box (dashboards, alerts, etc) on multi-tenant/PaaS scenarios

@bencerillo / @sysdig Sysdig • 100% open-source • 1M+ downloads
• Host analysis • sysdig.org • SaaS & on-prem • 200+ customers • Cluster analysis: Kubernetes, Openshift, Swarm, Mesos, etc. • Dashboards, alerts, events, teams • sysdig.com

Danke! @bencerillo @sysdig www.sysdig.org www.sysdig.com

D2-2 Jorge Salamero Sanz - You’re Monitoring Co...

D2-2 Jorge Salamero Sanz - You’re Monitoring Containers Wrong

More Decks by DevOpsDays Zurich

Other Decks in Technology

Featured

Transcript