Implementing Distributed Tracing 'Like a Boss' in your Apache Kafka Deployments

by Ricardo Ferreira

Slide 1

Slide 1 text

Implementing Distributed Tracing 'Like a Boss' in your Apache Kafka Ricardo Ferreira, Developer Advocate @riferrei

Slide 2

Slide 2 text

About Us: ● Ricardo Ferreira ○ Developer Advocate @ Confluent ○ Ex-Oracle, Red Hat, IONA Tech ○ Currently ~10% Smart, ~90 Dumb ○ https://riferrei.net ● Echo Dot (Alexa) ○ The voice behind Amazon ○ Ex-Raspberry Pi, Arduino, T3? ○ Currently ~90% Smart, ~10 Dumb @riferrei @alexa99

Slide 3

Slide 3 text

A long time ago in a galaxy far, far away…

Slide 4

Slide 4 text

How We Did IT Monitoring ● Each system would have their very own monitoring system. ● Developers should not worry about monitoring. This was supposed to be "IT stuff". ● Architecture was deemed OK if all systems were showing green. ● Monitoring systems were not standard. Each system could potentially use a different vendor. @riferrei @jeeconf

Slide 5

Slide 5 text

OK… What was the Problem?

Slide 6

Slide 6 text

System IS NOT Working! System IS working! Time spent with troubleshooting. Days, weeks, months! @riferrei @jeeconf

Slide 7

Slide 7 text

System IS NOT Working! System IS working! Time spent with troubleshooting. Days, weeks, months! Lesson about Patience: That was only One Incident! @riferrei @jeeconf

Slide 8

Slide 8 text

Solution: Observability!

Slide 9

Slide 9 text

No content

Slide 10

Slide 10 text

Three Pillars of Observability ● Logs: stream of events related to transactions of one application. ● Metrics: Statistics and numeric measures aggregated by time. ● Distributed Tracing: complete view of what happened within a single transaction. It answers: ○ Which services were involved? ○ If it was slow, who caused that? ○ If failed, who actually failed? @riferrei @jeeconf

Slide 11

Slide 11 text

Why Distributed Tracing Became So Relevant?

Slide 12

Slide 12 text

Evolution of Concurrency ● No Concurrency at all ○ Ex: Apache HTTP Server ● Basic Concurrency ○ Ex: Multi-threaded Applications ● Async Concurrency ○ Ex: Actor-based Programming ● Distributed Concurrency ○ Ex: μServices Architecture Style @riferrei @jeeconf

Slide 13

Slide 13 text

Distributed Tracing Today ● There are many distributed tracing technologies available. ● Standards are getting created to ensure a single programming model for each μService. ● Deployment mechanisms such as Kubernetes are also taking care of that automatically. ● Network proxies such as Service Meshes are also handling this. @riferrei @jeeconf

Slide 14

Slide 14 text

Why Apache Kafka? ● Apache Kafka is becoming the de- facto standard to handle data. ● Prediction? It will be the central nervous system of any company. ● μServices in general already use Kafka to exchange messages and keep their data stores in-sync. ● With event streaming becoming even more popular, the Kafka adoption tend to grow even more. @riferrei @jeeconf

Slide 15

Slide 15 text

Implementing Distributed Tracing in Apache Kafka

Slide 16

Slide 16 text

OpenTracing for the Rescue ● Monitor distributed transactions ● Glue your Java code with Kafka ● Trace producers and consumers ● Decorators and Interceptors API ● Pluggable tracers via classpath via the tracer resolver library. ● Dense documentation available. https://github.com/opentracing-contrib/java-kafka- client @riferrei @jeeconf

Slide 17

Slide 17 text

Extension for Bundled JVMs ● Custom interceptors to handle distributed tracing in JVM-based applications that you don't have the source-code. ● Examples: REST Proxy, Kafka Connect and KSQL Servers. ● Allows you to group μServices based on the topics they use. https://github.com/riferrei/jaeger-tracing-support @riferrei @jeeconf

Slide 18

Slide 18 text

DEMO

Slide 19

Slide 19 text

19 Free Books by Confluent: https://www.confluent.io/apache-kafka-stream-processing-book-bundle

Slide 20

Slide 20 text

20 Let’s Keep in Touch: @riferrei riferrei riferrei [email protected] https://riferrei.net https://cnfl.io/slack Confluent is Hiring: https://www.confluent.io/careers

Slide 21

Slide 21 text

No content