"With microservices every outage is like a murder mystery" is a common complaint. But it doesn’t have to be! This talk gives an overview on how to monitor distributed applications and how common monitoring APIs can help with this effort. We dive into:
* Logs: Why structured logs should be the new norm of logging and how to centralize them.
* Metrics: This includes both application metrics (using REST APIs or JMX for example) as well as system metrics, which are more similar to the classical ‘top’ output.
* Traces: The Open Tracing Foundation is basically the golden standard of tracing at the moment. We use Sleuth to show how to trace requests through a distributed system and Zipkin to display how long each call takes.
This combination of logs, metrics, and traces is also called the holy trinity of monitoring.