Slide 4
Slide 4 text
Systematic performance troubleshooting
For systematic, deterministic troubleshooting drilldown, you need:
■ Avoid guesswork
● Measure -> Understand -> Fix
■ App/service request latency measurement is just the 1st step
● But then what? Why the high latency in a database, webserver, app?
● How to drill down into app thread, DB connection-level CPU/wait time, with OS kernel visibility too?
■ Can not extract a request/thread/connection's metrics from systemwide averages
● From OS systemwide utilization averages (sar, vmstat) to a specific thread?!
● From DB-wide utilization & wait metrics (total CPU usage of a shared SQL statement) to a single execution?
Today I'll
focus
entirely on
this