Upgrade to Pro — share decks privately, control downloads, hide ads and more …

TICK_Monitoring

praveen
February 15, 2017
240

 TICK_Monitoring

praveen

February 15, 2017
Tweet

Transcript

  1. Monitoring ? • Collecting, Processing, aggregating and displaying real-time quantitative

    data about a system, such as query counts and types, error counts and types, processing times and server life times.
  2. Why Monitoring ? • Know when things go wrong ◦

    To call human to prevent a business level issue • Be able to debug issues • Trends to see changes over time and drive technical and business decision. • To feed into other systems(Ex Automation, Security) • Alerting • Building dashboard
  3. First Principle of monitoring • Collecting the right data ▪

    Receive meaningful info ▪ Quickly investigate and get to the bottom of perf issues
  4. Monitoring Architectures • Services push metrics to the monitoring system

    • Monitoring system pulls metrics from services
  5. Overview of TICK stack • Quickly describe what time series

    data is • Describe what influxDB Data model. • Dive into internal component.
  6. Time series data is... A time series is a sequence

    of data points, typically consisting of successive measurements made from the same source over a time interval.
  7. Why can’t I just use any tradition database ? •

    Yes, you can. • But by this you will end up creating time-series database not solution to a monitoring problem.
  8. InfluxDB Basics • How do we represent points textually ?

    • Using Line protocol • Measurement Tagset Fieldset Timestamp ◦ cpu , host=H1 values=20 1445555009
  9. Telegraf • Example of Telgraf plugins • How to use

    Telegraf • Input and Output plugin architectures
  10. Telegraf Telegraf is an agent written in Go for collecting

    metrics from local and remote sources. - Designed for minimal footprint - Ingests metrics from - The host system - Common services - Third party API’s - Custom end-points - Write multiple output at the same time.
  11. What all things can be done by Telegraf • Inputs

    ◦ Gather local system metrics ◦ Status checks on processes and services ◦ Collect data from remote API’s over HTTP ◦ Test HTTP responsiveness ◦ Parse log file using patterns to collect metrics ◦ Run custom scripts in regular intervals • Outputs ◦ Converts metrics formats ◦ Buffer metrics ◦ Reroutes metrics ◦ Batches metrics
  12. Telegraf Plugins • Input plugins ◦ Services that Telegraf can

    collect data from ◦ | cpu, mem, disk, diskio, docker | • Output Plugins ◦ Services that Telegraf can write data to ◦ | InfluxDB, Graphite, Kafka, Datadog | • Service Plugins ◦ Services can that push data to Telegraf ◦ | TCP, UDP, statsd, kafka_consumer |