CloudNative Days Spring 2021 Online
Istioと各種監視系OSS(Prometheus, Loki, Jaeger, Grafana)を連携させたObservability基盤を構築し、運用する中で得た知見を紹介します。
References
John Porcaro, "Observability (re)defined", 2019
Betsy Beyer, Chris Jones, Niall Richard Murphy, Jennifer Petoff, "Site Reliability Engineering", O'Reilly Media, Inc., 2017
Mike Julian, 松浦 隼人, "入門 監視 ―モダンなモニタリングのためのデザインパターン", O'Reilly Media, Inc., 2019
Cindy Sridharan, "Distributed Systems Observability", O'Reilly Media, Inc., 2018
Charity Majors, Liz Fong-Jones, George Miranda, "Observability Engineering", O'Reilly Media, Inc., 2022(Early Release)
Cindy Sridharan, "Monitoring in the time of Cloud Native", 2017
Istio Authors, "Istio", 2021 (accessed 2021-03)
Megan O’Keefe, "Istio by Example!", 2021 (accessed 2021-03)
"kube-prometheus", prometheus-operator, 2021 (accessed 2021-03)
"grafana-operator", integr8ly, 2021 (accessed 2021-03)
"Tempo Documentation", Grafana Labs, 2021 (accessed 2021-03)
"jaeger-operator", jaegertracing, 2021 (accessed 2021-03)
"Installation Guide", Kiali, 2021 (accessed 2021-03)
"operator-lifecycle-manager", operator-framework, 2021 (accessed 2021-03)
Benjamin H. Sigelman, etc., "Dapper, a Large-Scale Distributed Systems Tracing Infrastructure", Google Technical Report (2010)