Monitoring

MONITORING Applications & Infrastructure

Sean Porter @PorterTech

FOCUS • MTTD - Mean Time To Detect • MTTR
- Mean Time To Repair

GOAL • MTTD - Mean Time To Detect • MTTR
- Mean Time To Repair REDUCE!

Let’s start with an application.

API GET /ping POST /contacts GET /contacts/:id PUT /contacts/:id DELETE
/contacts/:id

“You can't manage what you haven't measured”

Gather the data that we as developers & operators care
about.

EMIT & EXPOSE Instrumentation

EMIT & EXPOSE Instrumentation Log

EMIT & EXPOSE Instrumentation Log Storage

EMIT & EXPOSE Instrumentation Log Storage GET /stats

EMIT & EXPOSE Instrumentation Log Storage GET /stats Process Title
...

(fn [request] (let [start (System/currentTimeMillis) response (handler request) ﬁnish (System/currentTimeMillis)
time (- ﬁnish start)] ...

A few great libraries you should read. Metrics (JAVA) codahale/metrics
Metriks (Ruby) eric/metriks Folsom (Erlang) boundary/folsom

Let’s talk about Logs...

LOGS • Already being produced. • A log is a
stream of events. • Full of performance & usage indicators.

LOGS METRICS! • Already being produced. • A log is
a stream of events. • Full of performance & usage indicators.

“request :get /ping 200 (2ms)” { “request_method”: “get”, “request_uri”: “/ping”,
“response_status”: 200, “response_time”: 2 } OR

Parsing logs requires effort, let’s send metrics elsewhere.

STORAGE sock = TCPSocket.new(host, port) sock.puts “name value #{Time.now.to_i}” sock.close

Let’s get back to the application.

There is more to it ...

HAProxy There is a load balancer. One or more instances
of the application.

HAProxy There is a load balancer. One or more instances
of the application. MEMORY CPU DISK NETWORK

Know your application dependencies and understand their relationships.

Monitor all the way down to the resources they consume.

HAProxy MEMORY CPU DISK NETWORK /ping HAProxy

Think Unix toolchain.

SENSU “simple, malleable, and scalable” Nagios replacement.

SENSU • JSON conﬁguration. • Uses the Nagios check spec.
• Clients self-register. • Easy to scale out. sensu/sensu

LOGSTASH “collect logs, parse them, and store them for later
use”

LOGSTASH INPUTS File Syslog AMQP 0MQ ... FILTERS Grep Grok
Multiline Mutate ... OUTPUTS ES Graphite AMQP Nagios ... logstash/logstash

GRAPHITE “scalable realtime graphing” name value timestamp

• drawAsInﬁnite() • highestCurrent() • mostDeviant() • hitcount() • threshold()
GRAPHITE Many powerful functions() to analyze data. • derivative() • summarize() • sumSeries() • movingAverage() • holtWintersForecast()

Final words.

Sean Porter @PorterTech THANK YOU

Monitoring

Monitoring

More Decks by portertech

Other Decks in Programming

Featured

Transcript