Test Driven Infrastructure

Test Driven Infrastructure Bring Tranquility To Your Infrastructure Arthur Maltson
@amaltson

Speaker Note: I’ll be talking about a problem that plagues….

Speaker Note: Both large companies like big ﬁnancials.

Speaker Note: And small, like the hip startups.

Speaker Note: That problem is untested infrastructure. If you’re experiencing
this, what might be the symptoms?

Speaker Note: Do you cross your ﬁngers before running a
script in production?

Speaker Note: Or cradle a bottle of wine under your
desk after another failed deploy?

Speaker Note: Do you ﬁnd your infrastructure has stability issues?

Speaker Note: Or that automation you’ve written behaves in surprising
ways? If so, you’re probably experiencing untested infrastructure.

Speaker Note: If you happen to talk to your friendly
neighbourhood DevOps Unicorn….

Speaker Note: They might tell you about Test Driven Infrastructure.
But what is Test Driven Infrastructure (TDI)?

Refactor Green Red Speaker Note: TDI comes from a process
in Software Development known as Test Driven Development (TDD). This is a popular technique that has been shown to lead to higher quality code, that’s more stable and easier to maintain.

Speaker Note: If you listen to those.. interesting.. DevOps unicorns,
you might expect to experience….

Speaker Note: Extreme conﬁdence in your automation.

Speaker Note: Better infrastructure stability.

Speaker Note: And general operations and developer happiness. Great, but
what does it take to get here?

Speaker Note: It doesn’t come free. Getting to TDI takes
a number of tools.

Speaker Note: To start, we need a Conﬁguration Management system.
Doesn’t have to be Chef, that’s just the example here. At the end of the day you could use Bash scripts, but CM will probably works better.

Speaker Note: We’ll use a default cookbook generated by Chef
DK using the chef generate cookbook command.

Speaker Note: In our example we’ll set up Redis. In
Chef, using the redisio cookbook from the Supermarket, it might look like this.

Speaker Note: The other tool we’ll need is Test Kitchen
(TK). TK is going to be our primary testing work horse.

Speaker Note: TK is the test runner, using it’s massively
pluggable architecture to run tests on any platform, framework, etc.

Drivers: docker, Vagrant, AWS … Speaker Note: TK is the
test runner, using it’s massively pluggable architecture to run tests on any platform, framework, etc.

Drivers: docker, Vagrant, AWS … Communication: ssh, winrm, … Speaker
Note: TK is the test runner, using it’s massively pluggable architecture to run tests on any platform, framework, etc.

Drivers: docker, Vagrant, AWS … Communication: ssh, winrm, … Provisioners:
Chef, Ansible, Puppet … Speaker Note: TK is the test runner, using it’s massively pluggable architecture to run tests on any platform, framework, etc.

Chef, Ansible, Puppet … Testing: InSpec, Pester, BATS… Speaker Note: TK is the test runner, using it’s massively pluggable architecture to run tests on any platform, framework, etc.

Chef, Ansible, Puppet … Testing: InSpec, Pester, BATS… Platform: CentOS, Ubuntu, Windows … Speaker Note: TK is the test runner, using it’s massively pluggable architecture to run tests on any platform, framework, etc.

Speaker Note: The four commands we’ll be looking at are
kitchen create/login/converge/ verify. Create will create the VM/container. Login lets you poke around the server TK starts up. Converge will execute the provisioner against the server to put it into the desired state. Finally, verify will run all the tests inside the server.

kitchen create/login/converge/ verify. Create will create the VM/container. Login lets you poke around the server TK starts up. Converge will execute the provisioner against the server to put it into the desired state. Finally, verify will run all the tests inside the server. kitchen create

kitchen create/login/converge/ verify. Create will create the VM/container. Login lets you poke around the server TK starts up. Converge will execute the provisioner against the server to put it into the desired state. Finally, verify will run all the tests inside the server. kitchen login kitchen create

kitchen create/login/converge/ verify. Create will create the VM/container. Login lets you poke around the server TK starts up. Converge will execute the provisioner against the server to put it into the desired state. Finally, verify will run all the tests inside the server. kitchen converge kitchen login kitchen create

kitchen create/login/converge/ verify. Create will create the VM/container. Login lets you poke around the server TK starts up. Converge will execute the provisioner against the server to put it into the desired state. Finally, verify will run all the tests inside the server. kitchen converge kitchen verify kitchen login kitchen create

Speaker Note: Speaking of tests, this is where InSpec comes
in. InSpec is an extension of RSpec, a Ruby BDD testing library. It speciﬁcally focuses on server testing. InSpec uses the underlying OS commands to verify the state of the system.

Speaker Note: This is an example of how to test
whether a system is listening on a speciﬁc port. InSpec will then use the underlying netstat command to check if the port is being listened to. You can also make sure it’s NOT listening on speciﬁc ranges ports.

Speaker Note: This is how to check if a service
exists or is enabled. InSpec will use the proper OS level check, like chkconfig on CentOS.

Speaker Note: You can use it to check if a
user exists. In this case it’ll use id on Linux OSes.

Speaker Note: There are many more resources, but InSpec offers
the command resource which provides the ultimate ﬂexibility. You can execute any command and then inspect its standard out, standard error and exit status.

Speaker Note: I’d be remiss if I didn’t mention Docker
in a DevOps themed talk. But Docker is perfect for testing. You want to spin up a server, very quickly, provision it and then tear it down.

Speaker Note: Using Test Kitchen’s pluggable architecture, we can customize
the kitchen.yml ﬁle to test against Docker. The easiest path in a Chef world is to use kitchen-dokken, which ships custom Docker images set up to build in Chef and conﬁgures SystemD so it looks like a full OS making the test more realistic. @amaltson

0 35s 1m 10s 1m 45s 2m 20s Vagrant Docker
@amaltson Speaker Note: Get a huge performance gain. With that small change, we get over 30% performance boost. If we cache the resources ofﬂine, we can tighten our feedback cycle from initial boot to converge to verify in under one minute.

Speaker Note: With Chef, Test Kitchen, InSpec and Docker in
our tool belt, we put on our safety goggles and ask “what does the process look like?”

Speaker Note: To talk about the TDI process, we ﬁrst
need to discuss the TDD process. In TDD you ﬁrst write the failing test, then you write the code to make it pass, and especially in software development, you refactor. You can safely refactor your code because you have the tests to back you up. I’m not religious about the order, as long as you write the tests close to the code under test.

Red Speaker Note: To talk about the TDI process, we
ﬁrst need to discuss the TDD process. In TDD you ﬁrst write the failing test, then you write the code to make it pass, and especially in software development, you refactor. You can safely refactor your code because you have the tests to back you up. I’m not religious about the order, as long as you write the tests close to the code under test.

Green Red Speaker Note: To talk about the TDI process,
we ﬁrst need to discuss the TDD process. In TDD you ﬁrst write the failing test, then you write the code to make it pass, and especially in software development, you refactor. You can safely refactor your code because you have the tests to back you up. I’m not religious about the order, as long as you write the tests close to the code under test.

Refactor Green Red Speaker Note: To talk about the TDI
process, we ﬁrst need to discuss the TDD process. In TDD you ﬁrst write the failing test, then you write the code to make it pass, and especially in software development, you refactor. You can safely refactor your code because you have the tests to back you up. I’m not religious about the order, as long as you write the tests close to the code under test.

Speaker Note: The approach for TDI is very similar. You
write a failing InSpec test, you make it pass with a Chef recipe/Ansible playbook/Puppet manifest, and then you refactor if necessary. You won’t refactor as often because the code is generally simpler. However, if you’re depending on an open source cookbook, like redisio, and sometime down the road you decide to write your own Redis cookbook, you have the tests to back you up.

InSpec Speaker Note: The approach for TDI is very similar.
You write a failing InSpec test, you make it pass with a Chef recipe/Ansible playbook/Puppet manifest, and then you refactor if necessary. You won’t refactor as often because the code is generally simpler. However, if you’re depending on an open source cookbook, like redisio, and sometime down the road you decide to write your own Redis cookbook, you have the tests to back you up.

InSpec Recipe Speaker Note: The approach for TDI is very
similar. You write a failing InSpec test, you make it pass with a Chef recipe/Ansible playbook/Puppet manifest, and then you refactor if necessary. You won’t refactor as often because the code is generally simpler. However, if you’re depending on an open source cookbook, like redisio, and sometime down the road you decide to write your own Redis cookbook, you have the tests to back you up.

Refactor InSpec Recipe Speaker Note: The approach for TDI is
very similar. You write a failing InSpec test, you make it pass with a Chef recipe/Ansible playbook/Puppet manifest, and then you refactor if necessary. You won’t refactor as often because the code is generally simpler. However, if you’re depending on an open source cookbook, like redisio, and sometime down the road you decide to write your own Redis cookbook, you have the tests to back you up.

Speaker Note: Enough slides, let’s see Test Driven Infrastructure in
action. We’ll get Redis installed practicing TDI.

Speaker Note: We start with an empty recipe, and an
empty Dokken Docker container.

Speaker Note: And also the default generated InSpec test.

Speaker Note: When installing Redis, we want to make sure
Redis runs under it’s own user as is standard practice in Linux. We also want make sure Redis stores it’s database in the default /var/lib/redis directory. We write out these tests in InSpec. Make it fail.

Speaker Note: We need to add a dependency on the
Supermarket redisio cookbook in Chef’s Policyﬁle.rb and execute the ‘chef update’ command.

Speaker Note: We then add a dependency on the redisio
cookbook in metadata.rb. This will have our cookbook pull in that dependency.

Speaker Note: And ﬁnally, Chef speciﬁc, we include the default
recipe to get Redis installed. Now our test passes.

Speaker Note:The tests passed, but we don’t know if Redis
is actually started and running. Let’s write some failing tests that make sure Redis is listening on the default 6379 port and has a service to ensure Redis starts back up on reboots. Redisio names the service with the port number.

Speaker Note: To make the test pass, we just need
to include the ‘enable’ recipe.

Speaker Note: Redis being a database, you want to be
able to store and retrieve data from it. Using the command resource in InSpec, we can call any command on the OS. We can use the redis-cli command to put data into Redis and get data out. Let’s write those tests. They pass.

Speaker Note: Finally, we discussed how Test Kitchen has support
for multiple platforms. Let’s say tomorrow your CIO comes down and says “there’s this great new Linux server out there called Ubuntu Server, we should use it.” Fortunately, since we’ve followed a Test Drive Infrastructure approach, we just need to add the Ubuntu platform (in this case an ofﬂine cache version), run kitchen converge and then kitchen verify to see a passing build of Redis installed and working on Ubuntu. This is kind of a “refactor” to add Ubuntu support.

Refactor InSpec Recipe Speaker Note: We now saw the full
virtuous cycle of writing a failing ServerSpec test, writing the recipe/playbook/etc to make that test pass, and then even “refactor” by changing the platform we support.

Speaker Note: Of course nothing has all upsides, there are
some downsides with testing too. If you’re cookbook does too much, your feedback cycle can get really long. If the tests you execute take a long time to return, and you do a lot of them, that also increases the feedback cycle. Testing also adds more process, so shipping takes longer. Just like with software development 10 years ago, there was questions on “why would I test, I never did that before”. 10 years later, we’re terriﬁed to touch code that doesn’t have tests. My recommendation would be, keep your cookbooks/playbooks/etc small and focused. You should really only need 30-100 tests.

Speaker Note: At the end of the day, the tradeoffs
and gotchas are well worth it. It’s all about safety and conﬁdence in making changes to your infrastructure. We’ve found tests catch a range of issues, like wrong conﬁgurations for websites, before it ever hits the development or production environment. The end goal is to move fast and continuous deliver.

Speaker Note: continuously deliver value, not downtime. This whole time
we’ve been talking about development on our local workstation…

Speaker Note: This is where Continuous Integration (CI) comes into
play. With something like CircleCI or Jenkins, you get a central place that veriﬁes the tests continue passing. With Docker, running these TK tests in CI is really easy.

Speaker Note: What does the full workﬂow look like? You
follow the TDI cycle locally, commit and push to your central repo, that triggers a build, which ﬁres up a Docker image and runs Test Kitchen and InSpec tests. If build fails, you deal with it on your workstation and continue the cycle again.

Refactor Recipe ServerSpec Speaker Note: What does the full workﬂow
look like? You follow the TDI cycle locally, commit and push to your central repo, that triggers a build, which ﬁres up a Docker image and runs Test Kitchen and InSpec tests. If build fails, you deal with it on your workstation and continue the cycle again.

git push Refactor Recipe ServerSpec Speaker Note: What does the
full workﬂow look like? You follow the TDI cycle locally, commit and push to your central repo, that triggers a build, which ﬁres up a Docker image and runs Test Kitchen and InSpec tests. If build fails, you deal with it on your workstation and continue the cycle again.

git push Refactor Recipe ServerSpec triggers Speaker Note: What does
the full workﬂow look like? You follow the TDI cycle locally, commit and push to your central repo, that triggers a build, which ﬁres up a Docker image and runs Test Kitchen and InSpec tests. If build fails, you deal with it on your workstation and continue the cycle again.

git push Refactor Recipe ServerSpec triggers build Speaker Note: What
does the full workﬂow look like? You follow the TDI cycle locally, commit and push to your central repo, that triggers a build, which ﬁres up a Docker image and runs Test Kitchen and InSpec tests. If build fails, you deal with it on your workstation and continue the cycle again.

git push Refactor Recipe ServerSpec triggers build ❌ Speaker Note:
What does the full workﬂow look like? You follow the TDI cycle locally, commit and push to your central repo, that triggers a build, which ﬁres up a Docker image and runs Test Kitchen and InSpec tests. If build fails, you deal with it on your workstation and continue the cycle again.

triggers build ✅ Speaker Note: But what happens when the
build passes? This is where you can use the Continuous Delivery features available in your CI system, most of them have something built in (eg Bamboo). A successful CI build can automatically trigger a deployment to development. With enough comfort level, you could even trigger an automatic deployment to production.

triggers build ✅ deploy Speaker Note: But what happens when
the build passes? This is where you can use the Continuous Delivery features available in your CI system, most of them have something built in (eg Bamboo). A successful CI build can automatically trigger a deployment to development. With enough comfort level, you could even trigger an automatic deployment to production.

triggers build ✅ deploy ✅ Speaker Note: But what happens
when the build passes? This is where you can use the Continuous Delivery features available in your CI system, most of them have something built in (eg Bamboo). A successful CI build can automatically trigger a deployment to development. With enough comfort level, you could even trigger an automatic deployment to production.

triggers build ✅ deploy deploy ✅ Speaker Note: But what
happens when the build passes? This is where you can use the Continuous Delivery features available in your CI system, most of them have something built in (eg Bamboo). A successful CI build can automatically trigger a deployment to development. With enough comfort level, you could even trigger an automatic deployment to production.

triggers build ✅ deploy deploy ✅ ✅ Speaker Note: But
what happens when the build passes? This is where you can use the Continuous Delivery features available in your CI system, most of them have something built in (eg Bamboo). A successful CI build can automatically trigger a deployment to development. With enough comfort level, you could even trigger an automatic deployment to production.

Speaker Note: What’s the path ahead?

Speaker Note: Another interesting use of TK is multi-server testing.
You can have TK spin up several nodes, and have them all talk to each other on a private local network. We’ve had success testing Redis primary, replica and sentinel conﬁgurations as well as testing the entire ELK stack.

Speaker Note: But ultimately you’ll have to run this Redis
server somewhere, and most likely you’re going to do it in the Cloud.

Speaker Note: But is it possible to do TDI against
Cloud resources???

Speaker Note: It is, because InSpec supports AWS (and Azure)
resources out of the box. You can check on AWS EC2 instances.

Speaker Note: And ELBs that point at those EC2 instances.

Speaker Note: And even S3 buckets. You can make sure
your buckets are never publicly exposed!

Speaker Note: If you use the popular Terraform tool to
create that infrastructure, you can use the awesome kitchen-terraform plugin to tie this all together!

Speaker Note: Mind… blown!

@amaltson Speaker Note: And if you think that’s mind blowing,
you can take it to the next level by building Test Kitchen plugins.

@amaltson Speaker Note: Remember the various concepts in Test Kitchen?
They’re all extensible by inheriting from Test Kitchen’s base classes and packaging as a gem.

@amaltson Drivers: custom lifecycle Speaker Note: Remember the various concepts
in Test Kitchen? They’re all extensible by inheriting from Test Kitchen’s base classes and packaging as a gem.

@amaltson Drivers: custom lifecycle Communication: custom remote login Speaker Note:
Remember the various concepts in Test Kitchen? They’re all extensible by inheriting from Test Kitchen’s base classes and packaging as a gem.

@amaltson Drivers: custom lifecycle Communication: custom remote login Provisioners: custom
provisioning Speaker Note: Remember the various concepts in Test Kitchen? They’re all extensible by inheriting from Test Kitchen’s base classes and packaging as a gem.

provisioning Testing: custom veriﬁcation Speaker Note: Remember the various concepts in Test Kitchen? They’re all extensible by inheriting from Test Kitchen’s base classes and packaging as a gem.

provisioning Testing: custom veriﬁcation Platform: custom deployment platforms Speaker Note: Remember the various concepts in Test Kitchen? They’re all extensible by inheriting from Test Kitchen’s base classes and packaging as a gem.

Speaker Note: If you take one thing away, please test
your infrastructure and be a super hero.

Arthur Maltson @amaltson maltson.com Capital One Distinguished Engineer 70% Dev,
30% Ops Full Time DadOps @amaltson

• Slides: https://speakerdeck.com/amaltson/test-driven-infrastructure • test-driven-redis: https://github.com/amaltson/test-driven-redis • Test Kitchen: https://kitchen.ci
• InSpec: https://www.inspec.io • Kitchen Dokken: https://github.com/test-kitchen/kitchen-dokken • Kitchen Terraform: https://github.com/newcontext-oss/kitchen-terraform @amaltson Arthur Maltson

Credits • Riccardo Cuppini, Zen [Explored], https://flic.kr/p/5ehoTC • CollegeDegrees360, Computer
Problems, https://flic.kr/p/cEJpCY • Greg Heo, Big banks, https://flic.kr/p/dfb13h • Heisenberg Media, Berlin Startup Tour, https://flic.kr/p/dP6W49 • Will Humes Follow, crossed fingers, https://flic.kr/p/4s5kZ5 • Crying Under the Table With a Bottle of Wine GIF, https://mashable.com/2013/08/20/gif-origins/#3PUHZ0bAVPqj • Matthew Frederickson, Unicorns, https://flic.kr/p/5jrvmr • yosuke muroya, Unicorn, https://flic.kr/p/bpQFTw • Chris & Karen Highland, consumer confidence!, https://flic.kr/p/qKcmR2 • Quentin Meulepas, Whistler: Inukshuk, https://flic.kr/p/6izmiv • Moyan Brenn, Happiness, https://flic.kr/p/nMmBGs • Bre Pettis, Dave’s Bike Tools, https://flic.kr/p/QMVMw • F Delventhal, Safety First, https://flic.kr/p/EmGgn • MsSaraKelly, Take one: Sarah's hen do, https://flic.kr/p/fsKWAi • Simon Harrod, Strawberry Snail, https://flic.kr/p/9XkFkY • GotCredit, Safety, https://flic.kr/p/qHCmfo • Lawrence Whittemore, basement.jpg, https://flic.kr/p/c84PL • Joseph Thornton, 2013 Retina Macbook Pro, https://flic.kr/p/eu3G38 • DeclanTM, Home Server, https://flic.kr/p/4PGBb5 • Quinn Dombrowski, Servers, https://flic.kr/p/cqqwcb • Matthew Faltz, The Path, https://flic.kr/p/pA7dZQ • Anita Sollars, Niche Chat, https://www.pinterest.ca/pin/138063544803937259 • tribp, Grapes, https://flic.kr/p/dcZUgY • Nate Grigg, Thank You, https://flic.kr/p/6K41qv

Test Driven Infrastructure

Test Driven Infrastructure

More Decks by Arthur Maltson

Other Decks in Programming

Featured

Transcript