The Importance of Observability for Kafka-based applications with Zipkin

The Importance of Observability for Kafka-based applications with Zipkin [email protected]

Jorge Quilcate-Otoya @jeqo89 github.com/jeqo github.com/sysco-middleware Middleware team at SYSCO AS
focused on Data-Integration and Distributed Tracing

SYSCO AS Middleware department: Integration and Data Engineering We are
hiring! Partners: github.com/sysco-middleware sysco.no/

Agenda Event-Driven Applications and Kafka Observability and Distributed Tracing Simulating
Observability tools

Apache Kafka “Apache Kafka® is a distributed Streaming platform.”

Event-Driven Applications and Kafka Amazonas river

Event-Driven Architectural Style https://docs.microsoft.com/en-us/azure/architecture/guide/architecture-styles/event-driven

Service Collaboration and Dataflow Svc Svc Svc Svc Orchestration Event
Bus Svc Svc Svc Svc Choreography

https://www.slideshare.net/ConfluentInc/etl-is-dead-long-live-streams Kafka Ecosystem

Observability and Distributed Tracing Titicaca Lake

What is Observability? “In control theory, observability is a measure
of how well internal states of a system can be inferred from knowledge of its external outputs.” - Wikipedia

Observability is for *Unknown Unknowns* https://twitter.com/mipsytipsy/status/963956028940234752

Observability methods

Span = execution of a task Trace = tree of
spans Context Propagation = pass trace context between distributed components (e.g. HTTP Headers, Kafka-record Headers) Distributed Tracing Concepts

Demo Lab 01: Hello world to Distributed Tracing • Tracing
concepts • Brave instrumentation https://github.com/jeqo/talk-kafka-zipkin#lab-1-hello-world-distributed-tracing

Adoption approaches Annotation-based - Part of your code - Instrument
libraries first - Add custom spans on-demand - Check benchmarks Black-box

How does it work? Svc 0 Svc 1 tracer tracer
Collector Tracing System Tracing DB

Zipkin Architecture

Demo Lab 02: Tracing Kafka-based applications • Kafka-clients and Kafka-streams
instrumentation • Kafka Interceptors for Kafka Connectors https://github.com/jeqo/talk-kafka-zipkin#lab-02-twitter-kafka-based-application

Adoption approaches Annotation-based - Part of your code - Instrument
libraries first - Add custom spans on-demand - Check benchmarks Black-box - Agent-based model - Framework/Protocol support - Machine impact - Promising approach: Service Mesh/Sidecar Proxy

Service Meshes and Zipkin

#QOTD https://twitter.com/rakyll/status/971231712049971200

Simulating Observability tools Lima - Chorrillos

➔ Model your architecture ➔ Simulate interaction ➔ Generate Traces
➔ Visualize your system’s traffic with Vizceral “SimianViz/ Spigo” - Simulation Protocol Interaction in GO github.com/adrianco/spigo

"Monitoring Microservices: A Challenge" - Adrian Cockcroft

Models from Traces, e.g. Vizceral https://www.youtube.com/watch?v=jWpI8qzqNHk

Demo Lab 03: Spigo and Vizceral • Spigo for Simulation
of Architecture behavior • Zipkin for Tracing and Vizceral for Traffic Monitoring https://github.com/jeqo/talk-kafka-zipkin#lab-3-spigo-simulation

Takeaways ➔ If are doing Distributed Systems — using Kafka
or not — consider Distributed Tracing. ➔ Instrument libraries first, not your code. ➔ Experiment by simulating your deployment. ➔ How many models can you build from tracing data?!

References Papers - Dapper: https://static.googleusercontent.com/media/research.google.com /en//pubs/archive/36356.pdf - Canopy: http://cs.brown.edu/~jcmace/papers/kaldor2017canopy.pdf -
Automating Failure Testing Research at Internet Scale: https://people.ucsc.edu/~palvaro/socc16.pdf Posts: - Logging v. Instrumentation https://peter.bourgon.org/blog/2016/02/07/logging-v-instrument ation.html - Monitoring and Observability https://medium.com/@copyconstruct/monitoring-and-observability -8417d1952e1c - Monitoring in the Time of Cloud Native https://medium.com/@copyconstruct/monitoring-in-the-time-of-cl oud-native-c87c7a5bfa3e Tools: - Zipkin: https://zipkin.io/ - Brave: https://github.com/openzipkin/brave - Kafka Interceptors: https://github.com/sysco-middleware/kafka-interceptors - Spigo: https://github.com/adrianco/spigo - Vizceral: https://github.com/Netflix/vizceral

Thanks! Q&A github.com/jeqo/talk-kafka-zipkin github.com/sysco-middleware Machu Picchu

The Importance of Observability for Kafka-based...

The Importance of Observability for Kafka-based applications with Zipkin

Jorge Quilcate

More Decks by Jorge Quilcate

Other Decks in Programming

Featured

Transcript