Agenda
● Motivation
● Non-goal
● How to respond to increase
● How to scale out
● Configuration
● Browse and Aelrts
● Issue
Slide 4
Slide 4 text
Motivation
Slide 5
Slide 5 text
Address growing data
Slide 6
Slide 6 text
Non-goal
Slide 7
Slide 7 text
× Highly available
○ Data redundancy
× Long term storage
Slide 8
Slide 8 text
How to respond
to increase in
memory and storage usage
Slide 9
Slide 9 text
● Reduce data retention
● Prolong scrape interval
● Reduce unnecessary metrics
● Scale up
● Scale out
● Remote Write/Storage
Slide 10
Slide 10 text
How to scale out
without remote storage
Slide 11
Slide 11 text
Prometheus is easy to
scale out
Pod
Prometheus
Pod
Pod
Pod Prometheus
Slide 12
Slide 12 text
Configuration
Slide 13
Slide 13 text
A. Per scrape rule
● For popular settings in Kubernetes
○ Container Metrics(cAdvisor)
○ Node Metrics
○ Application Metrics
cAdvisor Node
Application
Slide 14
Slide 14 text
A. Per scrape rule
● Application Metrics can be easily split
into multiple scrape rules
App A
App B
- job_name: 'app-xxx'
kubernetes_sd_configs:
- role: endpoints
relabel_configs:
- source_labels: [__meta_kubernetes_service_annotation_app_xxx_scrape]
action: keep
Slide 15
Slide 15 text
B. Per metrics(Not Time series)
● Same scrape target, but decide whether
to scrape by metrics name
metric_relabel_configs:
- source_labels: [__name__]
action: drop
regex: 'container_fs'
cAdvisor
cAdvisor