Slide 1

Slide 1 text

Jonatan Ivanov 2024-10-01 Observability for Modern JVM Applications

Slide 2

Slide 2 text

About Me - Spring Team - Micrometer - Spring Cloud, Spring Boot - Spring Observability Team - Seattle Java User Group - develotters.com - @jonatan_ivanov

Slide 3

Slide 3 text

Gauge the audience ● Observability in production? ● Spring Boot 3? ● Micrometer? ● Prometheus? ● OpenTelemetry? How many people are using:

Slide 4

Slide 4 text

What is Observability?

Slide 5

Slide 5 text

Various Opinions 3 pillars: Logging, Metrics, Distributed Tracing 4 pillars: + Events/Lineage(?)/Context/Metadata 6 pillars: + Profiles + Exceptions Arbitrary Wide Events, Signals But what about: /health, /info, etc. Service Registry/Discoverability, API Discoverability

Slide 6

Slide 6 text

What is Observability? How well we can understand the internals of a system based on its outputs (Providing meaningful information about what happens inside) (Data about your app)

Slide 7

Slide 7 text

Why do we need Observability?

Slide 8

Slide 8 text

Why do we need Observability? Today's systems are increasingly complex (cloud) (Death Star Architecture, Big Ball of Mud)

Slide 9

Slide 9 text

Environments can be chaotic You turn a knob here a little and apps are going down there We need to deal with unknown unknowns We can’t know everything Things can be perceived differently by observers Everything is broken for the users but seems ok to you Why do we need Observability?

Slide 10

Slide 10 text

Why do we need Observability? (business perspective) Reduce lost revenue from production incidents Lower mean time to recovery (MTTR) Require less specialized knowledge Shared method of investigating across system Quantify user experience Don't guess, measure!

Slide 11

Slide 11 text

Want to improve something? ● Measure it first! ● Resource utilization (number of instances, cpu, ram, io, etc.)? ● Throughput/latency (max.) patterns? ● Deployment frequency? ● Time to go live? ● Time to troubleshoot/recover? ● How often are you paged? Why do we need Observability? (Continuous Improvement)

Slide 12

Slide 12 text

● Chaos Engineering ● Anomaly Detection ● Feature flags ● A/B Testing ● Auto-tuning ● Adaptive Apps Why do we need Observability? (Advanced Capabilities)

Slide 13

Slide 13 text

Logging Metrics Distributed Tracing

Slide 14

Slide 14 text

Logging What happened (why)? Emitting events Metrics What is the context? Aggregating data Distributed Tracing Why happened? Recording causal ordering of events Logging - Metrics - Distributed Tracing

Slide 15

Slide 15 text

Examples Latency Logging (What?) Processing took 140ms Metrics (Context?) P99.999: 140ms Max: 150ms Distributed Tracing (Why?) DB was slow (lot of data was requested) Error Logging (What?) Processing failed (stacktrace?) Metrics (Context?) The error rate is 0.001/sec 2 errors in the last 30 minutes Distributed Tracing (Why?) DB call failed (invalid input)

Slide 16

Slide 16 text

DEMO 🍵 github.com/jonatan-ivanov/teahouse

Slide 17

Slide 17 text

Tea Service 💻 Tealeaf Service Water Service Architecture Tealeaf DB Water DB

Slide 18

Slide 18 text

spring-boot-starter-web spring-boot-starter-data-jpa spring-cloud-starter-openfeign spring-boot-starter-actuator (micrometer-observation) micrometer-registry-prometheus micrometer-tracing-bridge-brave + zipkin-reporter-brave net.ttddyy.observation:datasource-micrometer-spring-boot

Slide 19

Slide 19 text

Let’s make some tea! 🍵

Slide 20

Slide 20 text

by Kenneth Kousen

Slide 21

Slide 21 text

No content

Slide 22

Slide 22 text

No content

Slide 23

Slide 23 text

through traces TraceID ❮ Exemplars Tags ❯ metrics logs traces

Slide 24

Slide 24 text

Logging With JVM/Spring

Slide 25

Slide 25 text

SLF4J with Logback comes pre-configured SLF4J (Simple Logging Façade for Java) Simple API for logging libraries Logback Natively implements the SLF4J API If you want Log4j2 instead of Logback: - spring-boot-starter-logging + spring-boot-starter-log4j2 Logging with JVM/Spring: SLF4J + Logback

Slide 26

Slide 26 text

Payload, Access, GC logs Payload logs: Logbook + logbook-spring-boot-starter (auto-configured) Access logs: server.tomcat.accesslog.enabled=true server.jetty.accesslog.enabled=true server.undertow.accesslog.enabled=true GC logs: JVM args

Slide 27

Slide 27 text

Metrics With JVM/Spring

Slide 28

Slide 28 text

Metrics with JVM/Spring: Micrometer Dimensional Metrics library on the JVM Like SLF4J, but for metrics API is independent of the configured metrics backend Supports many backends Comes with spring-boot-actuator Spring projects are instrumented using Micrometer Many third-party libraries use Micrometer

Slide 29

Slide 29 text

Supported metrics backends/formats/protocols Ganglia Graphite Humio InfluxDB JMX KairosDB New Relic (/actuator/metrics) OpenTSDB OTLP Prometheus SignalFx Stackdriver (GCP) StatsD Wavefront (VMware) AppOptics Atlas Azure Monitor CloudWatch (AWS) Datadog Dynatrace Elastic

Slide 30

Slide 30 text

Tracing With JVM/Spring

Slide 31

Slide 31 text

Distributed Tracing with JVM/Spring Boot 2.x: Spring Cloud Sleuth Boot 3.x: Micrometer Tracing (Sleuth w/o Spring dependencies) Provide an abstraction layer on top of tracing libraries - Brave (OpenZipkin), “default” - OpenTelemetry (CNCF), “experimental” Instrumentation for Spring Projects, 3rd party libraries, your app Support for various backends

Slide 32

Slide 32 text

Observation API

Slide 33

Slide 33 text

● Add Logs (application logs) ● Add Metrics ● Add Distributed Tracing You want to instrument your application…

Slide 34

Slide 34 text

Observation API basic usage example Observation observation = Observation.start("talk",registry); try { // TODO: scope doSomething(); // ← This is what we’re observing } catch (Exception exception) { observation.error(exception); throw exception; } finally { // TODO: attach tags (key-value) observation.stop(); }

Slide 35

Slide 35 text

Configuring an ObservationHandler (without Boot) ObservationRegistry registry = ObservationRegistry.create(); registry.observationConfig() .observationHandler(new MetricsHandler(...)) .observationHandler(new TracingHandler(...)) .observationHandler(new LoggingHandler(...)) .observationHandler(new AuditEventHandler(...));

Slide 36

Slide 36 text

Observation API shortcuts Observation.createNotStarted("talk",registry) .lowCardinalityKeyValue("conf", "dev2next") .highCardinalityKeyValue("uid", userId) .observe(this::talk); @Observed

Slide 37

Slide 37 text

Health Endpoint Is my app healthy (k8s probes)? Dependencies? Info Endpoint Build Info (name, version, git commit, build time): Boot 2.x Java Info (JRE/JVM name, version, vendor): Boot 2.6 OS Info (name, arch, version): Boot 2.7 Process Info (pid, owner, cpus, memory) Boot 3.3, 3.4 Dependencies (SBOM) Boot 3.3 TLS Info (subject, issuer, validity) Boot 3.4 Cloud Info (instanceId, region, account) GC Info, Timezone, Current Time, Language, Start Time, Uptime Spring Boot Actuator

Slide 38

Slide 38 text

Service Discoverability, API Discoverability How many service instances do we have? Where? (host/ip, port, instanceId, region, account) What versions are deployed? (by environment) Eureka, Spring Boot Admin How to call/use them? Spring REST Docs Spring Cloud Contract + Pact Broker Swagger / OpenAPI + ReDoc Spring HATEOAS + HAL Explorer

Slide 39

Slide 39 text

Thank you! @jonatan_ivanov develotters.com github.com/jonatan-ivanov/teahouse (branch: 2024-dev2next) slack.micrometer.io