Upgrade to Pro — share decks privately, control downloads, hide ads and more …

DevOpsDays Cuba 2016: Modern Monitoring

DevOpsDays Cuba 2016: Modern Monitoring

Author: Bridget Kromhout
Summary:

DevOpsDays Cuba

October 19, 2016
Tweet

More Decks by DevOpsDays Cuba

Other Decks in Technology

Transcript

  1. @bridgetkromhout “…measuring value, throughput, and performance… revenue rather than cost”

    The Art of Monitoring (2016) James Turnbull artofmonitoring.com
  2. @bridgetkromhout Why monitor? Two customers of monitoring with different needs.

    The Art of Monitoring (2016) James Turnbull artofmonitoring.com
  3. @bridgetkromhout Why monitor? The business: UX data for product &

    engineering Measure value delivered Two customers of monitoring with different needs. The Art of Monitoring (2016) James Turnbull artofmonitoring.com
  4. @bridgetkromhout Why monitor? The business: UX data for product &

    engineering Measure value delivered Information Technology: Visibility into state and failures Product & engineering decisions Measure success of projects Two customers of monitoring with different needs. The Art of Monitoring (2016) James Turnbull artofmonitoring.com
  5. @bridgetkromhout Method Type Focus Manual Checklists, simple scripts “Tribal knowledge”

    of things broken in the past Minimizing downtime, managing assets Reactive disk, CPU, memory checks Thresholds, alerting; updated after incidents Availability, assets, some customer experience Proactive Automatic; required for deployment Alerting includes context, automated remediation Application performance, business outcomes Monitoring Maturity Model The Art of Monitoring (2016) - James Turnbull - artofmonitoring.com
  6. @bridgetkromhout Better notifications Actionable Provide necessary context Prevent alert fatigue

    The Art of Monitoring (2016) - James Turnbull - artofmonitoring.com (in the brave new cloudy-with-a-chance-of-containers world)
  7. @bridgetkromhout 1979 (Version 7)
 1982 (BSD) 2004 2000 chroot FreeBSD

    jails Solaris Zones LXC 2008 A Brief History of Containers, Part 1 (before docker era)
  8. @bridgetkromhout A Brief History of Containers, Part 2 (docker common

    era) 2011 Cloud Foundry 2013 Docker 2014 Rocket (later rkt) 2015 Open Container Initiative
  9. @bridgetkromhout • cloud-based infrastructure • static checks or thresholds no

    longer scale • manual configuration no longer scales “cattle, not pets”
  10. @bridgetkromhout Events - changes in your environment Logs - diagnosis

    & troubleshooting Metrics - seeing state in real time for anomaly detection & pattern analysis
  11. @bridgetkromhout The Art of Monitoring (2016) James Turnbull artofmonitoring.com Open

    source & SaaS • choose TCP over UDP • configurable granularity • “push” vs “pull”
  12. @bridgetkromhout “Almost every task run under Borg contains a built-in

    HTTP server that publishes information about the health of the task and thousands of performance metrics” Large-scale cluster management at Google with Borg - Verma et al. 2015 “Almost every task run under Borg contains a built-in HTTP server that publishes information about the health of the task and thousands of performance metrics”
  13. @bridgetkromhout Security Pros & Cons Containers limit attack surface &

    emitters don’t need ports open but… Microservices move IPC to network transactions & complexity is distributed