little and apps are going down there We need to deal with unknown unknowns We can’t know everything Things can be perceived differently by observers Everything is broken for the users but seems ok to you Why do we need Observability?
Java Agent - Standalone Use of the JMX Exporter Java agent is strongly encouraged due to the complex application RMI configuration required when running the Standalone JMX Exporter.
“How bad is it?” - Aggregated, Cheap, Compact, Efficient - But! High Cardinality! 🧐🤔🤨😬😱🤐🧨💀 - Time Series: How does data change over time? - Counter, Gauge, Timer, etc.
infrastructure events • Purpose: troubleshooting of the system, health monitoring • Content: audit messages, metrics, low-level events Business logs • Focus: business logics and flows • Purpose: Track high level user actions and flow, troubleshoot high-level flows • Content: user activities, transaction, etc Most probably you need both!
idea? - Why watching only TP95 (or TP99) is not a good idea? - Why should you measure max? - Why avg(TP95) does not make sense? - What data can be high cardinality? - What problems can it cause in metrics?