UX Design and Education for Effective Monitoring Tools (TechSummit Berlin 2017)

UX Design and Education for Effective Monitoring Tools (TechSummit Berlin 2017)

I, like many of us, chose to work in infrastructure because I thought it meant I could avoid talking to other people as much as possible. I was wrong. It turns out that a huge amount of work in monitoring is about empowering the rest of your engineering organization to use the tools you develop correctly, quickly, and effectively. We spend so much time explaining how to interpret timeseries data and why averaging percentiles is a bad idea. After feedback from our engineers, we embarked on a journey to redesign our internal monitoring tools and understand where people were struggling with the existing system.

In this talk, I will explain how we approached the problem of making concepts like interpolation, aggregation, and alerting more intuitive and how we identified pain points for new users. I will outline common misconceptions our users have about monitoring and how we cleared up this confusion in our UI without forcing everyone to spend hours on documentation. Rather than copying and pasting existing UX design principles onto our monitoring problems, we will see how we can reinterpret these ideas and apply them to our unique situation to create a better experience for everyone.

780e86312035da00762813aa2e443ae8?s=128

Amy Nguyen

April 12, 2017
Tweet

Transcript

  1. 1.

    UX Design and Education for Effective Monitoring Tools Amy Nguyen

    @amyngyn April 12, 2017 or: one way to become a 10x engineer
  2. 2.

    Hi!

  3. 4.

    Hi! • Joined Pinterest in 2015, now on Visibility team

    • Recent projects: network tracing, D3 data visualizations, caching OpenTSDB timeseries data
  4. 5.

    Hi! • Joined Pinterest in 2015, now on Visibility team

    • Recent projects: network tracing, D3 data visualizations, caching OpenTSDB timeseries data • Most importantly: overhauling the user experience of our monitoring stack
  5. 6.

    Hi! • Joined Pinterest in 2015, now on Visibility team

    • Recent projects: network tracing, D3 data visualizations, caching OpenTSDB timeseries data • Most importantly: overhauling the user experience of our monitoring stack • amynguyen.net / @amyngyn
  6. 8.

    8 About Pinterest • 150 million monthly active users the

    world's first visual discovery engine
  7. 9.

    9 About Pinterest • 150 million monthly active users •

    100 billion pins the world's first visual discovery engine
  8. 10.

    10 About Pinterest • 150 million monthly active users •

    100 billion pins • 2 billion boards the world's first visual discovery engine
  9. 12.

    12 About Pinterest Monitoring • Over 150,000 requests per second

    the world's greatest my favorite monitoring team
  10. 13.

    13 About Pinterest Monitoring • Over 150,000 requests per second

    • 20 terabytes logged per day the world's greatest my favorite monitoring team
  11. 14.

    14 About Pinterest Monitoring • Over 150,000 requests per second

    • 20 terabytes logged per day • 2.5 million metrics per second the world's greatest my favorite monitoring team
  12. 15.

    15 About Pinterest Monitoring • Over 150,000 requests per second

    • 20 terabytes logged per day • 2.5 million metrics per second • Over 400 engineers the world's greatest my favorite monitoring team
  13. 19.
  14. 21.

    How to become a 10x engineer 1. Help 10 other

    engineers do their jobs better.
  15. 22.

    How to become a 10x engineer 1. Help 10 other

    engineers do their jobs better. 2. Profit!
  16. 24.

    How to help others do their jobs better 1. Improve

    the experience of working with your documentation. 2. Improve the experience of working with your team. 3. Improve the experience of working with your tools.
  17. 25.

    How to help others do their jobs better 1. Improve

    the experience of working with your documentation. 2. Improve the experience of working with your team. 3. Improve the experience of working with your tools.
  18. 26.
  19. 27.

    • Is your documentation split across Google Drive, internal wikis,

    and old emails that get forwarded around? • Does your documentation assume prior knowledge that a new engineer wouldn't have?
  20. 28.

    • Is your documentation split across Google Drive, internal wikis,

    and old emails that get forwarded around? • Does your documentation assume prior knowledge that a new engineer wouldn't have? • When was the last time anyone on your team followed the instructions in your documentation and checked that they still worked?
  21. 31.

    How can we maintain good documentation? • Add common questions

    to the docs as they come up. • Always link to the docs even if you can answer it yourself.
  22. 32.

    How can we maintain good documentation? • Add common questions

    to the docs as they come up. • Always link to the docs even if you can answer it yourself. • Encourage everyone to contribute to the docs.
  23. 33.

    How to help others do their jobs better 1. Improve

    the experience of working with your documentation. 2. Improve the experience of working with your team. 3. Improve the experience of working with your tools.
  24. 35.

    • How do people contact your team if they have

    questions? • How long does it usually take for someone to have their question answered?
  25. 36.

    • How do people contact your team if they have

    questions? • How long does it usually take for someone to have their question answered? • What happens when someone sends you a complaint or request?
  26. 37.

    • How do people contact your team if they have

    questions? • How long does it usually take for someone to have their question answered? • What happens when someone sends you a complaint or request? • How do people feel after they interact with your team?
  27. 38.

    • How do people contact your team if they have

    questions? • How long does it usually take for someone to have their question answered? • What happens when someone sends you a complaint or request? • How do people feel after they interact with your team? ◦ Stupid? Confused? Unsatisfied? Unwilling to return in the future?
  28. 39.

    • How do people contact your team if they have

    questions? • How long does it usually take for someone to have their question answered? • What happens when someone sends you a complaint or request? • How do people feel after they interact with your team? ◦ Stupid? Confused? Unsatisfied? Unwilling to return in the future? ◦ Heard! Supported! Understood! Learned something new!
  29. 40.

    How to help others do their jobs better 1. Improve

    the experience of working with your documentation. 2. Improve the experience of working with your team. 3. Improve the experience of working with your tools.
  30. 42.
  31. 43.
  32. 44.
  33. 46.
  34. 47.
  35. 48.
  36. 49.
  37. 51.
  38. 52.
  39. 54.

    How to help others do their jobs better 1. Improve

    the experience of working with your documentation. 2. Improve the experience of working with your team. 3. Improve the experience of working with your tools.