Slide 1

Slide 1 text

You need And you need it now! link to these slides - dvps.me/need-devops-now-160330

Slide 2

Slide 2 text

Evgeny Zislis Co-Founder / CTO www.devops.co.il

Slide 3

Slide 3 text

How fast do customers receive finished new application changes? ○ months? ○ weeks? ○ days? ○ hours? ○ minutes?

Slide 4

Slide 4 text

○ days? ○ hours? ○ minutes? ○ zero-downtime? How long are users denied service during scheduled and unscheduled maintenance?

Slide 5

Slide 5 text

○ $10,000 ○ $100,000 ○ $500,000 ○ $1 million ○ much more … How much profit is trapped in your bug trackers, not yet released?

Slide 6

Slide 6 text

“Between 12:00 AM and 11:59 PM on April 25, 2013, Quora released new versions of the site 46 times. This was a normal day for us.” - Quora engineering.quora.com/Continuous-Deployment-at-Quora “Deployment every 11.6s, 1,079 max in one hour. 10,000 mean number of hosts per deployment, with 30,000 maximum” - Amazon. com youtube.com/watch?v=PW1lhU8n5So “On the Google Consumer Surveys team, 8 minutes after you commit code it's live in production.” - Google developers.google.com/live/shows/772717729 “10+ deploys per day.” - John Allspaw, 2009 youtube.com/watch?v=LdOe18KhtT4

Slide 7

Slide 7 text

$$$ single most important key performance indicator (KPI) for any commerial business

Slide 8

Slide 8 text

manual process for release lacks repeatability/speed “Earlier today members were disconnected from Xbox LIVE and found themselves unable to log back in. … The root cause of this outage was human error.” - Microsoft blogs.msdn.com/b/xblops/archive/2011/10/03/issues-with-xbox-live-earlier-today.aspx

Slide 9

Slide 9 text

21 / 3 / 2016 "We discovered an issue in one of our backup systems last Thursday night (03/17). Maintenance was scheduled to resolve the issue over the weekend. On working to resolve the issue, an administrator accidentally deleted the production database." support.gliffy.com/entries/ 98911057--Gliffy-Online-System-Outage

Slide 10

Slide 10 text

No content

Slide 11

Slide 11 text

No content

Slide 12

Slide 12 text

Continuous Delivery

Slide 13

Slide 13 text

○ Builds in Quality ○ Creates real flexibility ○ Creates higher productivity ○ Frees up floor space ○ Improves safety ○ Improves morale ○ Reduces cost of inventory Benefits of Toyota One Piece Flow __as conceived by Taiichi Ohno

Slide 14

Slide 14 text

end of 2012 end of 2009 codeascraft.com/2011/02/15/measure-anything-measure-everything Etsy.com : Deploys per day

Slide 15

Slide 15 text

middle of 2015 codeascraft.com/2011/02/15/measure-anything-measure-everything Etsy.com : Deploys per day

Slide 16

Slide 16 text

“... we also track really important stuff, like how much coffee is left in the kitchen” - Etsy codeascraft.com/2011/02/15/measure-anything- measure-everything

Slide 17

Slide 17 text

It is not enough to do your best; you must know what to do, and then do your best. W. Edwards Deming 1900 - 1993 doing his best

Slide 18

Slide 18 text

monolithic vs. micro yobriefca.se/blog/2013/04/29/micro-service-architecture/ Circa 2000 Circa 2014

Slide 19

Slide 19 text

No content

Slide 20

Slide 20 text

“If you can’t feed a team with two pizzas, it’s too large. That limits a task force to five to seven people, depending on their appetites” - Jeff Bezos medium.com/@benorama/ the-evolution-of-software-architecture-bd6ea674c477

Slide 21

Slide 21 text

No content

Slide 22

Slide 22 text

No content

Slide 23

Slide 23 text

No content

Slide 24

Slide 24 text

No content

Slide 25

Slide 25 text

… software releases in the enterprise are still too slow, too buggy, and too costly … Hewlett Package Predictions for DevOps in 2016 http://techbeacon.com/resources/top-5-devops-predictions-trends-2016

Slide 26

Slide 26 text

The Enterprise IT Adoption Cycle enterpriseitadoption.com

Slide 27

Slide 27 text

The community of developers whose work you see on the Web, who probably don't know what ADO or UML or JPA even stand for, Deploy better systems at less cost in less time at lower risk than we see in the Enterprise. - Tim Bray

Slide 28

Slide 28 text

No content

Slide 29

Slide 29 text

lead time for changes release frequency time to restore service change fail rate How long is the delay between a request for a change, and a production system operating with that change implemented? How long does it take for an abnormal behavior in the system to be restored to the normal standard agreed way of operation? How many changes and features are being released to production in a fixed period of time? How often the system fails or service is disrupted?

Slide 30

Slide 30 text

High-performing IT organizations deploy 30x more frequently with 200x shorter lead times; they have 60x fewer failures and recover 168x faster. - PuppetLabs 2015 State of DevOps report puppetlabs.com/2015-devops-report

Slide 31

Slide 31 text

Build Quality IN Cease dependence on mass inspection to achieve quality. Improve the process and build quality into the product in the first place. -- W. Edwards Deming

Slide 32

Slide 32 text

No content

Slide 33

Slide 33 text

changes for the better ○ alerts with great signal to noise ratio. e.g. Google SRE notes & StackExchange Alerts ○ standard procedures and checklists. e.g. AWS Operational Checklist ○ practice recovery from system failures. e.g. Netflix downtime & Xbox downtime ○ practice backup with restore to dev/test. e.g. Netflix Priam ○ infrastructure as code & auto-healing. e.g. Antifragile Systems ○ simplicity is prerequisite for reliability. e.g. Forrester Devops & Simple Made Easy

Slide 34

Slide 34 text

No content

Slide 35

Slide 35 text

Thank you! www.devops.co.il We invite you to join Operations Israel Facebook group on on.fb.me/Ops-IL we are hiring at jobs@devops.co.il link to these slides - dvps.me/need-devops-now-160330