Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Site Reliability Engineering Webinar

Site Reliability Engineering Webinar

Yury Nino

May 22, 2020
Tweet

More Decks by Yury Nino

Other Decks in Technology

Transcript

  1. • What is DevOps & SRE? • How did I

    become in SRE? • How is my daily life? • DevOps Practices • Resilience & Chaos Engineering • Human Factors & Postmortems
  2. Reliability is an Important Feature If a system isn't reliable,

    users won't trust it. 500 Something was wrong!
  3. Introducción El framework provee referencias, lineamientos, políticas, mejores prácticas y

    protocolos que se administran de manera centralizada. DevOps Culture
  4. DevOps is a mindset inspired in the collaboration between operations

    and development teams. SRE is talent and technical experience that engineers develop to solve problems related to Infrastructure, Security and Operations using Software Engineering. Introducción SRE vs DevOps
  5. SRE is an organizational model for running online services more

    reliably by teams that are chartered to do reliability focused engineering work. What is SRE book SRE is what happens when you ask a software engineer to design an operations team! SRE Book SRE is doing work that has historically been done by an operations team, but using engineers with software expertise and with the ability to design and implement automation to replace human labor. What is SRE book
  6. Site Reliability Engineering is what you get when you treat

    operations as if it’s a software problem. Their mission is to protect and progress software with an ever-watchful eye on their availability, latency and performance. They are not Heroes!!!
  7. Key skill areas for an SRE? • Software engineering •

    Distributed systems design • Operating systems • Networking • Databases • Security • Reliability best practices • Troubleshooting • Customer support
  8. DevOps Security Dev[Sec]Ops is empowered engineering teams taking ownership of

    how their product performs in production [including security]
  9. • Automating everything. • Choosing the right tool. • Writing

    all your infrastructure in code. • Building a good CI/CD pipeline. • Integrating tests. • Applying security with DevSecOps. • Monitoring your system. • Evolving project management. DevOps Recommendations
  10. 2008-2013 DevOps is born Ben Treynor coined SRE 2014 First

    Conference about SRE: SRECon 2016-2018 SRE Books are released Why SRE in Google? 2019 SRE massification
  11. Chaos Engineering It is the discipline of experimenting in production

    on a distributed system in order to reveal their weakness and to build confidence in their resilience capability. https://principlesofchaos.org/
  12. What my mom thinks I do What my friends thinks

    I do What software engineers think I do What I really do Who is a Chaos Engineer? Help service owners to increase their resilience through education, tools and encouragement.
  13. The best way to promote a postmortem culture is adopting

    a new view, a view focused in the syntoms, no in the causes ...