Slide 17
Slide 17 text
What else to track to understand causes?
Servers: CPU load, RAM usage, IO, average load, storage usage, …
Web: request count, processing time, queuing time, …
Queued jobs: jobs count, processing times, error rates, …
Databases: queries count, processing time, returned rows count, …
External APIs: requests count, processing times, error rates, …
Network: latency, DNS lookup time, throughput, package drop rates, …
Custom things: different metrics for every integration or whatever
Business metrics: number of users, products, orders, , …