Docker in Production: A Survival Guide

Docker in Production: A Survival Guide Greg Poirier - CTO
- @GetOpsee

Who am I? • CTO at Opsee. • Lots of
years in operations, development. • Super jazzed about containers.

Docker at Opsee • I love containers. • Lightweight deployable
objects. • “Build once. Deploy anywhere.”

A Bumpy Ride

A Bumpy Ride • Adopting any new technology requires a
significant investment in the form of time and energy. • You will make mistakes. • You will learn from them.

Docker in Production • Building software for containers • Deploying
containers • Operational considerations • Logging, resource allocation.

Runtime Containers • You want thin containers. • Faster deploys.
• Faster builds. • Fewer disk problems.

Thin Containers • Avoid "OS" containers. • Avoid startup scripts.

Thin Containers • Runtime dependencies go in volumes. • Use
multiple containers and link them. • Containers cost very little. • Inodes and disk space.

Build vs. Runtime Containers • Build containers are for building.
• Compilers, deployment stuff, etc. • Runtime containers are for running. • Just the build artifacts.

Don’t Fear Multiple Containers config:  image: yourOrg/getConfig  command: /getConfig serviceName
-o /etc/config.yaml  volumes:  - /etc    service:  image: yourOrg/serviceName  command: /serviceName /etc/config.yaml  volumes_from:  - config:ro

Export Things for Humans • Put stuff in host-mounted volumes
for people • If you must • Ship stuff to S3 • Log • Emit metrics

Deploying Containers • Registries • Tags • Schedulers

Registries • Depending on registries sucks. • Downtime is extremely
frustrating. • I think they mostly understand this.

MFW Registry Downtime

Registry Downtime • Downtime can and will happen. • Restart
on the same host if you crash. • Docker or Systemd restart policy. • Don’t fail to start if you can’t pull. • ExecStartPre=-/usr/bin/docker pull…

Deploying Containers • Avoid symbolic container tags. • Tags identify
code running in a container. • You can use labels for this as well, but don’t.

Tag Your Images • Simple Example: • You run yourOrg/yourService:production
• You update the tag to point to a new image version • One of the instances in your ELB restarts. • Two versions without a deploy.

I Promise This is Bad • Deploys should be deliberate.
• Control what code is running very carefully. • Make it obvious to the casual observer what version is running.

Schedulers • Most of them are good. • Some of
them are easier. • Some of them are harder.

Choosing a Scheduler • Operational complexity. • Features. • Most
importantly: your needs.

Docker-Compose

The Power of Docker-Compose Compels You • Containers work well
together. • E.g. NSQ + Service + Configuration • Choose a scheduler that supports docker- compose. • It’s got what devs need.

Operations

Operations • Docker does not solve operational problems. • Docker’s
default configuration is not suitable for production. • Docker’s default configuration will lead to downtime.

Logging • Default logging driver: json-file • gliderlabs/logspout • So
many problems…

Don’t Use json-file in Production • Long-running containers in production
will eventually consume all of the disk space available to /var/lib/docker because of json- file’s default configuration. • Use syslog, or awslog

No Sensible Defaults Anywhere

No Sensible Defaults Anywhere • CoreOS uses json-file by default.
• Debian(s) use json-file by default. • RHEL(s) use json-file by default. • Everyone defaults to something inappropriate for production.

Breathe. </high horse>

Logspout • We tried to make logspout happen. • Problems
with connection handling, etc. • Don’t use json-log or logspout.

Disk • You really need to manage disk space carefully.
• Remove stale images. • Remove stopped containers. • Don’t store tons of state locally.

No Really... • rm -rf /var/lib/docker • docker ps -aq
| xargs docker rm • docker images -q | xargs docker rmi -f

Memory Allocation • Declare the resources you intend to use.
• This is important to do. • Pick a scheduler that supports this.

You Still Have Work to Do • V8 and JVM
will allocate heap until they are OOM killed. • Go does not adhere to resource limits. • Nothing adheres to resource limits but the kernel.

Memory Management Settings • V8 and JVM allow you to
control memory allocation. • Max heap isn’t everything. • If you don’t set max heap, they will allocate heap until the kernel kills them. • Plan for this or don’t.

Thanks! • Thanks for coming! • Thanks for listening! •
Question, comments? • @grepory on Twitter

Operators are Standing By

Docker in Production: A Survival Guide

Docker in Production: A Survival Guide

More Decks by Greg Poirier

Other Decks in Technology

Featured

Transcript