Continuous Delivery: a journey not a destination

CONTINUOUS DELIVERY: A JOURNEY NOT A DESTINATION Akshay Karle &
Fernando Junior @snap_ci | https://snap-ci.com 1

2 The essence of my philosophy to software delivery is
to build software so that it is always in a state where it could be put into production. We call this Continuous Delivery because we are continuously running a deployment pipeline that tests if this software is in a state to be delivered. - Jez Humble http://martinfowler.com/delivery.html

3 High-performing IT organizations deploy 30x more frequently with 200x
shorter lead times; they have 60x fewer failures and recover 168x faster. https://puppetlabs.com/sites/default/ﬁles/2015-state-of-devops-report.pdf

DO WANT!!! 4

5 frequent, reliable releases of high quality valuable software

SNAP, BACK IN 2012… Built on 9 components (microservices, anyone?)
A message-queue for fast background job processing Chef-server for infrastructure provisioning Relational & NoSQL databases Extensive Nagios monitoring Completely automated infrastructure creation Build and deployment pipelines for each component So clearly - we had it nailed. Right? 7

Well, actually… 8

CONSEQUENCES Likelihood of build setup passing 50% Likelihood of changes
actually triggering a build 50% Likelihood that someone knew why 0% Likelihood of 4 SSH sessions ﬁring up debug 100% Likelihood that the infrastructure was busted 50% Frequency of production upgrades Twice a month Consequence of upgrades Site unavailable! Feedback to users about what was going on Minimal 9

10 frequent, reliable releases of high quality valuable software

FAIL! 11

Our journey so far..

Queues per build Messages lost, redelivery problems Single, centralized queue
No need to manage queues Delayed::Job

one Nagios to rule them all too many components too
many metrics and alerts ignored alerts when things actually did go wrong separated application monitoring basic server monitoring aggregated logs we don’t need to manage anything +

Server became cognizant of what state would go into chef
reliable and faster chef runs nothing to manage user build speciﬁc conﬁguration data dependency on the chef server for deployments had to manage the server Solo

Single mutable environments Too many components READONLY MODE Long running
deployments Loosing build requests Loosing customers Fewer components Blue-green deployments

COUPLED MODULES TO MONOLITH TO REAL MODULES Highly coupled initial
state Collapsed moving parts to a monolith Deploying frequently highlights rates of change This was used to make new modules Much more decoupled state today 17

BLUE-GREEN DEPLOYMENTS IN SNAP 18 VZHOST Build Server web server
VZHOST Build Server web server L B DATABASE

SO, WHAT’S THE CURRENT STATE? Build setups almost always pass…
…but if there are failures, the team gets alerted Deployments happen every other day New releases happen once every 2~3 weeks Near zero downtime 19

YAGNI 20

How do we work now? 21

SEQUENCING OF STORIES 22 short (idealized) cycle time, more technical
risk time time longer (idealized) cycle time, less technical risk

FEATURE TOGGLES Hide unﬁnished UI elements Control backend behaviour Test
with feature toggles Avoid multi-component feature toggles 23 http://martinfowler.com/bliki/FeatureToggle.html

FRONTEND 24 <% if feature_enabled?(:parallel_stage) %>    <% else %>    <% end %>

BACKEND 25 if feature_enabled?(:parallel_stage)  # new logic  else  # old
logic  end

TESTING WITH FEATURE TOGGLES 26 describe "multiple jobs" do  describe
"feature enabled" do  before(:each) do  with_feature_enabled(:parallel_stage)  end    it 'should not show the job tabs when there is only one job' do  end  end    describe "feature disabled" do  before(:each) do  with_feature_disabled(:parallel_stage)  end    it 'should not show the job tabs but should show the logs' do  end  end  end

MIGRATION OF DATA Consider existing schema as well as data
Incremental changes Rollbacks Zero downtime releases 27

INTRODUCE JOBS 28 jobs id

POPULATE ALL EXISTING STAGES 29 stages id started_at completed_at result
… jobs id stage_id …

COPY ATTRIBUTES TO JOB 30 stages … started_at completed_at result
… jobs … started_at completed_at result …

SWITCH THE APPLICATION TO START USING THE JOB MODEL 31
# After switch  class Stage  def result  results = jobs.collect { |job| job.result }  return :failed if results.any?(:failed)  :passed  end  end # Before switch  class Stage  attr_reader :result  end # in transition  class Stage  def result  if feature_enabled?(:parallel_stage)  results = jobs.collect { |job| job.result }  return :failed if results.any?(:failed)  :passed  else  result  end  end  end

REMOVE UNUSED ATTRIBUTES 32 stages id started_at completed_at result …
jobs … started_at completed_at result …

IDENTICAL ENVIRONMENTS/AUTOMATION 33 staging build n+1 upgrade smoke tests
ver. n To production

34 build n+1 upgrade smoke tests ver. n+1 To
production staging

35 build n+1 upgrade smoke tests ver. n+1 staging
To production

36 ver. n+1 staging build n+1 upgrade smoke tests
build n+2 upgrade smoke tests To production

SUMMARY Start with getting working software into hands of users
Layer on qualities of reliability, automation, frequent releases etc Kick the can down the road Don’t be afraid of “re-design”/”re-work” Let CD guide your architecture and design - not the other way around 39

Akshay Karle, @akshay_karle THANK YOU Fernando Junior, @nandopaf @snap_ci |
https://snap-ci.com

Continuous Delivery: a journey not a destination

Continuous Delivery: a journey not a destination

Other Decks in Technology

Featured

Transcript