Sloth の設定例 (1/2)
6
https://github.com/slok/sloth/blob/v0.10.0/examples/getting-started.yml
version: "prometheus/v1"
service: "myservice"
labels:
owner: "myteam"
repo: "myorg/myservice"
tier: "2"
slos:
- name: "requests-availability"
objective: 99.9
description: "Common SLO based on availability for HTTP request responses."
sli:
events:
error_query: sum(rate(http_request_duration_seconds_count{job="myservice",code=~"(5..|429)"}[{{.window}}]))
total_query: sum(rate(http_request_duration_seconds_count{job="myservice"}[{{.window}}]))
(snip…)
{{.window}} にはデフォルトで
5m, 30m, 1h, 2h, 6h, 1d, 3d, 30d が入る
生成される Rules に付与するラベルを指定
目標値を指定
total, error 条件となるクエリを定義