Save 37% off PRO during our Black Friday Sale! »

OBSERVABILITY AND CONTROL IN THE AGE OF THE SERVICE MESH: PRESENT AND FUTURE

2dc405d9c54e3b151d251d71a981633f?s=47 Matt Klein
September 07, 2017

OBSERVABILITY AND CONTROL IN THE AGE OF THE SERVICE MESH: PRESENT AND FUTURE

2dc405d9c54e3b151d251d71a981633f?s=128

Matt Klein

September 07, 2017
Tweet

Transcript

  1. Matt Klein September 7, 2017 OBSERVABILITY AND CONTROL IN THE

    AGE OF THE SERVICE MESH: PRESENT AND FUTURE Matt Klein / @mattklein123, Software Engineer @Lyft
  2. PAGERDUTY SUMMIT 17 | PIER 27 State of Incident Handling

    @Lyft: Something Breaks…
  3. PAGERDUTY SUMMIT 17 | PIER 27 State of Incident Handling:

    The Page is Triggered...
  4. PAGERDUTY SUMMIT 17 | PIER 27 State of Incident Handling:

    Per Service Auto-Generated Panel
  5. PAGERDUTY SUMMIT 17 | PIER 27 State of Incident Handling:

    Logging
  6. PAGERDUTY SUMMIT 17 | PIER 27 State of Incident Handling:

    Distributed Tracing
  7. PAGERDUTY SUMMIT 17 | PIER 27 State of Incident Handling:

    Service to Service Template Dashboard
  8. PAGERDUTY SUMMIT 17 | PIER 27 State of Incident Handling:

    Edge Proxy
  9. PAGERDUTY SUMMIT 17 | PIER 27 State of Incident Handling:

    Global Health Dashboard
  10. PAGERDUTY SUMMIT 17 | PIER 27 State of Incident Handling

    Lyft has advanced observability, yet DevOps cognitive overload is still extreme!
  11. PAGERDUTY SUMMIT 17 | PIER 27 What is Envoy and

    the Service Mesh?
  12. PAGERDUTY SUMMIT 17 | PIER 27 Envoy Refresher

  13. PAGERDUTY SUMMIT 17 | PIER 27 Envoy Refresher

  14. PAGERDUTY SUMMIT 17 | PIER 27 Observability is Key! •

    Having all SoA traffic transit through Envoy gives us a single place where we can: • Produce consistent statistics for every hop • Create and propagate a stable request ID / tracing context • Consistent logging • Distributed tracing
  15. PAGERDUTY SUMMIT 17 | PIER 27 Lyft Today Obs, obs,

    obs, obs, obs, obs...
  16. PAGERDUTY SUMMIT 17 | PIER 27 Future of Microservice Observability

    How can we reimagine observability and operations in the age of the service mesh?
  17. PAGERDUTY SUMMIT 17 | PIER 27 Hystrix by Netflix

  18. PAGERDUTY SUMMIT 17 | PIER 27 Service Portal Sketch: Landing

  19. PAGERDUTY SUMMIT 17 | PIER 27 Service Portal Sketch: Service

    Detail
  20. PAGERDUTY SUMMIT 17 | PIER 27 Service Portal Sketch: Service

    Detail Alternate
  21. PAGERDUTY SUMMIT 17 | PIER 27 Service Portal Sketch: Service

    Detail Optimal visualization of high level state Actions relevant to mitigation Machine learning to identify problems RBAC and versioning
  22. PAGERDUTY SUMMIT 17 | PIER 27 How Do We Get

    There? • A universal data plane like Envoy provides unified APIs for control as well as consistent observability output. • Allows us to build more complex full service mesh solutions such as Istio. • When we assume the existence of the service mesh, we can focus on an incredible UI/UX instead of constantly trying to keep every application up to date. • Assume that service mesh is the future… All data is available. • We need to start building the UI/UX/ML of the future for distributed system command control. Need to start now!
  23. PAGERDUTY SUMMIT 17 | PIER 27 Q&A • Thanks for

    coming! Questions welcome on Twitter: @mattklein123 • We are super excited about building a community around Envoy. Moving to a service mesh architecture yields many benefits. • Would love to make progress on the UI/UX described in this talk. Reach out if you are interested!
  24. Thank you for attending the session. Participating in the PagerDuty

    Summit Passport Game? Enter the code below to earn points. SUMMIT