Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Monitoring with Sensu Go

portertech
November 06, 2018

Monitoring with Sensu Go

Applications are complex systems. Their many moving parts, component and dependency services, may span any number of infrastructure technologies and platforms, from bare metal to serverless. As the number of services increases, teams responsible for them will naturally develop their own preferences, such as how they instrument their code or how and when they receive alerts. Sean will demonstrate how Sensu Go is designed to monitor these ever changing heterogeneous environments. Sensu Go is the next release of the open source monitoring framework, rewritten in Go, with new capabilities and reduced operational overhead. Sean will go over various patterns of data collection, including scraping Prometheus metrics, and show how Sensu enables self-service monitoring and alerting for service owners.

portertech

November 06, 2018
Tweet

More Decks by portertech

Other Decks in Programming

Transcript

  1. 32%

  2. check_mysql -H localhost -P 3360 Uptime: 798 Threads: 1 Questions:

    5 Slow queries: 0 Opens: 107 Flush tables: 1 Open tables: 26 Queries per second avg: 0.006|Connections=9c;;; Open_files=6;;; Open_tables=27;;; Qcache_free_memory=16760152;;; Qcache_hits=0c;;; Qcache_inserts=0c;;; Qcache_lowmem_prunes=0c;;; Qcache_not_cached=1c;;; Qcache_queries_in_cache=0;;; Queries=6c;;; Questions=4c;;; Table_locks_waited=0c;;; Threads_connected=1;;; Threads_running=1;;; Uptime=798c;;; Exit 0 (OK) Service Checks
  3. check_mysql -H localhost -P 3360 Can't connect to MySQL server

    on 'localhost' Exit 2 (CRITICAL) Service Checks
  4. { timestamp: 1516663186, entity: { … }, check: { command:

    “check_mysql -H ...” output: “Can’t connect ... ”, status: 2, … }, metrics: { ... } }
  5. check_mysql -H localhost -P 3360 Uptime: 798 Threads: 1 Questions:

    5 Slow queries: 0 Opens: 107 Flush tables: 1 Open tables: 26 Queries per second avg: 0.006|Connections=9c;;; Open_files=6;;; Open_tables=27;;; Qcache_free_memory=16760152;;; Qcache_hits=0c;;; Qcache_inserts=0c;;; Qcache_lowmem_prunes=0c;;; Qcache_not_cached=1c;;; Qcache_queries_in_cache=0;;; Queries=6c;;; Questions=4c;;; Table_locks_waited=0c;;; Threads_connected=1;;; Threads_running=1;;; Uptime=798c;;; Exit 0 (OK) Service Checks
  6. { timestamp: 1516663186, entity: { … }, check: { …

    }, metrics: { handlers: [influxdb], points: [{ name: mysql.connections, value: 9, tags: [ … ] }] } }
  7. • REST API (Agent & Backend) • Entity management •

    External checks • Metrics Events API
  8. { timestamp: 1516663186, entity: { name: leviathan, class: application, tags:

    [ … ], ... }, check: { … }, metrics: { ... } }
  9. { timestamp: 1516663186, entity: { … }, check: { output:

    “Backup failed ... ”, status: 2, ttl: 6h, … }, metrics: { ... } }
  10. { timestamp: 1516663186, entity: { … }, check: { …

    }, metrics: { handlers: [influxdb], points: [{ name: mysql.connections, value: 9, tags: [ … ] }] } }
  11. • Agent listeners (TCP & UDP) • Stats aggregation •

    Gauges, counters, etc. • Protocol enhancements (tags) StatsD
  12. { timestamp: 1516663186, entity: { … }, check: { …

    }, metrics: { handlers: [influxdb], points: [{ name: http.requests, value: 42, tags: [{ name: app, value: store }] }] } }