Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Failures In Implementing Reliability

Failures In Implementing Reliability

Stories on trying to do something reliably but failing. Presented at SRE Finland meetup.

Ab4a11cf19e2341bfb0837b2ed2b2dd0?s=128

Jaakko Pallari

January 16, 2020
Tweet

Transcript

  1. Welcome to SRE Finland!

  2. Slack channel Sign up @ devopsfinland.org Join #sre-finland

  3. None
  4. The best DevOps company * * according to us

  5. The best DevOps company * * according to us we

    also do SRE, Dataops and DevSecOps
  6. Jaakko Pallari SRE Finland co-organiser Lead SRE Consultant @ Polar

    Squad Background in SW dev and DevOps
  7. FAILURES IN IMPLEMENTING RELIABILITY

  8. Disclaimer No SRE involved

  9. None
  10. None
  11. SCALE!

  12. None
  13. None
  14. None
  15. SCALE?

  16. SCALE? SCALE!

  17. SCALE!

  18. SCAL SCALE!

  19. None
  20. “use the right tool for the right job lol”

  21. None
  22. Mission: Zero downtime upgrades

  23. Azure Ansible Kubernetes Go PostgreSQL Monorepo

  24. None
  25. Azure Kubernetes Go PostgreSQL Monorepo

  26. Azure ARM Ansible Kubernetes Go PostgreSQL Monorepo

  27. None
  28. None
  29. What are we even installing here?

  30. What is even installed in production?

  31. None
  32. 1.0.4, 1.0.5 ... 1.6.1, 1.6.2 in prod latest

  33. PRODUCTION PRODUCTION v2

  34. None
  35. THE INFRA

  36. THE INFRA

  37. EMPOWER AND TRUST

  38. THE INFRA

  39. THE INFRA

  40. What was learned

  41. It’s OK to trust the tools you know

  42. Focus on the right issue

  43. EMPOWER AND TRUST the teams you work with

  44. thank you