- 'afs: Lost contact with file server @IPvANY:appacct.server_ip@ in cell @HOSTNAME:appacct.cell@)' matching rule afs: Lost contact with file server \ 10.0.104.125 in cell my.cell warning event id: b82b8d55-6060-4857-bc2e-d3ce5f4fd082 context_id: 'openafs-${appacct.cell}-${appacct.server_ip}' context_scope: program patterns: - 'afs: file server @IPvANY:appacct.server_ip@ in cell @HOSTNAME:appacct.cell@ is back up' matching rule afs: file server 10.0.104.125 in cell \ my.cell is back up self-heal event 7 . 5
HOST_FROM: '${afs.server_ip}' PROGRAM: 'openafs-lostcontact/${afs.cell}-${afs.server_ip}' state: warning rule: 34a012fc-f964-4b85-a5cb-066ca2efa54b trigger: timeout Correlation afs: Lost contact with file server \ 10.0.104.125 in cell my.cell warning event afs: file server 10.0.104.125 in cell \ my.cell is back up self-heal event + no timeout generated event ↓ 7 . 7
round- robin archives online time-based downsampling write data to Elasticsearch rotate time-based indices using ES aliases transparent access to the data save disk space and keep queries fast ~ ILM for spectrum scale ~ continuous queries for influxdb samplerr 7 . 12
1 0 6 4 … t + 3 0 0 5 samplerr 20s avg 60s min avg max 5m min avg max 1h min avg max cust 100GB delete after 1day 100GB delete after 2 days 100GB delete after 1 week 100GB delete after 10 years