Stateful Applications on Autopilot

Stateful Applications on Autopilot Tim Gross @0x74696d (“tim”) github.com/autopilotpattern

github.com/autopilotpattern What if your containers were self-aware and self-operating?

github.com/autopilotpattern

github.com/autopilotpattern How do we get from dev to prod? •
Service Discovery • Load balancing • Automated-failover • Conﬁg changes • Monitoring

github.com/autopilotpattern App

github.com/autopilotpattern Nginx Consul MySQL Primary MySQL Replica ES Master ES
Data Kibana Prometheus Sales Logstash Customers

github.com/autopilotpattern Nginx Consul /custom ers /sales /sales/data /customers/data read/write read-
only MySQL Primary MySQL Replica ES Master ES Data Kibana Prometheus Sales Logstash Customers Load Balancing

only async replication MySQL Primary MySQL Replica ES Master ES Data Kibana Prometheus Sales Logstash Customers Replication & Fail-over

only async replication MySQL Primary MySQL Replica ES Master ES Data Kibana Prometheus Sales Logstash Customers Service discovery

only async replication MySQL Primary MySQL Replica ES Master ES Data Kibana Prometheus Sales Logstash Customers Logging

only async replication MySQL Primary MySQL Replica ES Master ES Data Kibana Prometheus Sales Logstash Customers Monitoring

github.com/autopilotpattern Problem: Service Discovery

github.com/autopilotpattern App Application

github.com/autopilotpattern Database App Application w/ database

github.com/autopilotpattern Database App Application w/ database How does the app
ﬁnd the DB? Can we just use DNS?

github.com/autopilotpattern Couchbase App Couchbase Couchbase Couchbase Application w/ Couchbase Couchbase
Couchbase

Couchbase Need to know real IP address (not A-Record)

Couchbase What happens when we lose a node?

Couchbase Does client respect DNS TTL?

github.com/autopilotpattern Problem: Load Balancing

github.com/autopilotpattern Nginx Customers Sales Microservices application /sales/data /customers/data /sales /custom
ers

github.com/autopilotpattern Nginx Sales Customers /sales/data /customers/data /sales /custom ers Microservices
application

github.com/autopilotpattern Nginx Sales Customers /sales/data /customers/data /sales /custom ers Microservices
application How do apps update peers when we scale out?

github.com/autopilotpattern Nginx Sales Customers /sales/data /custom ers/data Microservices application Route
everything thru Nginx (or LB)?

github.com/autopilotpattern Nginx Sales Customers /sales/data /custom ers/data Microservices application How
do we update Nginx backends? Adds network path length and SPoF

github.com/autopilotpattern Sales Sidecar/ Proxy Customers http://localhost http://192.168.1.1 ex. Bamboo Compute
node

github.com/autopilotpattern Sales Sidecar/ Proxy Customers http://localhost http://192.168.1.1 How do we
update proxy conﬁg? Adds network path length Compute node

github.com/autopilotpattern Nginx Sales Consul /custom ers /sales /sales/data /customers/data Customers
Microservices application w/ discovery catalog

How do we make existing applications use it? Microservices application w/ discovery catalog

github.com/autopilotpattern Problem: Automated- Failover

github.com/autopilotpattern read/write read-only async replication App Primary Replica MySQL with
replication

replication How does client ﬁnd DB? How does replica ﬁnd primary? How does primary tell replica where to start?

replication How do we update client on failover? How do we promote a replica? How do we orchestrate backups?

github.com/autopilotpattern Solutions that don’t work: Conﬁguration Management (ex. Chef, Puppet,
Ansible)

github.com/autopilotpattern • No CM server in local development • No
service discovery on change • Agents are heavy

github.com/autopilotpattern Solutions that don’t work: *aaS (ex. PaaS, DBaaS)

github.com/autopilotpattern • Vendor lock-in • Poor performance • Very expensive

github.com/autopilotpattern Solutions that don’t work: Mega-orchestrator (ex. Kubernetes)

github.com/autopilotpattern Behavior split between orchestrator and application

github.com/autopilotpattern Tight coupling between orchestrator and application

github.com/autopilotpattern Developers don’t run the orchestrator in development

github.com/autopilotpattern Shifts responsibility for app behavior away from app developers

github.com/autopilotpattern What if your containers were self-aware and self-operating?

github.com/autopilotpattern

github.com/autopilotpattern “[a] pattern where containers autonomously adapt to changes in
their environment and coordinate their actions thru a globally shared state” Lukasz Guminski, Container Solutions http://container-solutions.com/containerpilot-on-mantl/

github.com/autopilotpattern Make applications responsible for: Startup Shutdown Scaling Discovery Recovery
Telemetry

github.com/autopilotpattern Empower development teams to operationalize their applications

github.com/autopilotpattern 3 requirements:

github.com/autopilotpattern #1: Ability to provision applications onto compute

github.com/autopilotpattern VM or physical hardware VM or physical hardware VM
or physical hardware Nginx Consul MySQL Primary ES Master Prometheus Logstash Customers Nginx Consul MySQL Primary ES Master Prometheus Logstash Customers Customers Cluster management & provisioning

github.com/autopilotpattern Options for cluster management and container placement:

github.com/autopilotpattern #2: Network virtualization

github.com/autopilotpattern IP inside the container IP outside the container ==

github.com/autopilotpattern NAT Sales Customers 192.168.1.101 Compute Node 172.17.0.2:80 192.168.1.100:32380 Docker
bridge networking Consul

github.com/autopilotpattern NAT Sales Customers Consul Compute Node 192.168.1.100:32380 Docker bridge
networking “I’m listening on 172.17.0.2:80” 172.17.0.2:80

networking “Where is Customers?” “172.17.0.2:80” 172.17.0.2:80

networking WTF???!!! 172.17.0.2:80 172.17.0.2:80 No route to host!

github.com/autopilotpattern Sales Customers Compute Node Docker host networking Consul 192.168.1.101
192.168.1.100:80

192.168.1.100:80 “I’m listening on 192.168.1.100:80”

192.168.1.100:80 “I’m listening on 192.168.1.100:80” Customers 192.168.1.100:80

192.168.1.100:80 “I’m listening on 192.168.1.100:80” Customers 192.168.1.100:80 Port conﬂicts!

github.com/autopilotpattern Sales Customers Compute Node Overlay networking Consul 192.168.1.101 192.168.1.100:80
“I’m listening on 192.168.1.100:80” Customers 192.168.1.102:80

github.com/autopilotpattern Sales Customers Compute Node Overlay networking Consul 192.168.1.101 192.168.1.100:80
“I’m listening on 192.168.1.102:80” Customers 192.168.1.102:80

github.com/autopilotpattern Options for overlay networking:

github.com/autopilotpattern #3: Infrastructure-backed service discovery

github.com/autopilotpattern Nginx Sales Consul Customers Microservices app

github.com/autopilotpattern Nginx Sales Consul Customers Microservices app How do we
bootstrap service catalog HA? How do services ﬁnd service catalog?

github.com/autopilotpattern Options to bootstrap service catalog: infrastructure-backed DNS * run
on each node * Container Name Service (CNS)

github.com/autopilotpattern #4: We might need some help

github.com/autopilotpattern App-centric micro-orchestrator that runs inside the container. User-deﬁned behaviors:
• Lifecycle hooks (preStop, preStop, postStop) • Health checks w/ heart beats • Watch discovery catalog for changes • Update conﬁg on upstream changes • Gather performance metrics

github.com/autopilotpattern Sales Container Pilot Application Application container http://localhost http://192.168.1.1 Side-car
proxy?

github.com/autopilotpattern Sales Container Pilot Application Application container http://localhost http://192.168.1.1 Not
a proxy!

github.com/autopilotpattern Sales Container Pilot Application Consul Where is Sales? Application
container

github.com/autopilotpattern Sales Container Pilot Application Consul Where is Sales? 192.168.1.100
192.168.1.101 192.168.1.102 Application container

github.com/autopilotpattern Sales Container Pilot Application Consul Where is Sales? 192.168.1.100
192.168.1.101 192.168.1.102 Application container onChange event

github.com/autopilotpattern Sales Container Pilot Application http://192.168.1.100 Consul Where is Sales?
192.168.1.100 192.168.1.101 192.168.1.102 Application container onChange event

github.com/autopilotpattern Application onChange event User-deﬁned behavior hooks: • preStart •
preStop • postStop • health • onChange • sensor • task • co-process Application container

github.com/autopilotpattern Microservices stack

github.com/autopilotpattern ~ $ git clone [email protected]:autopilotpattern/workshop.git ~ $ cd workshop
&& git checkout workshop ~/workshop $ tree --dirsfirst . ├── customers │ ├── Dockerfile │ ├── containerpilot.json │ ├── customers.js │ └── package.json ├── nginx │ ├── Dockerfile │ ├── containerpilot.json │ ├── index.html │ ├── index.js │ ├── nginx.conf │ └── nginx.conf.ctmpl ├── sales │ ├── Dockerfile │ ├── containerpilot.json │ ├── package.json │ └── sales.js └── docker-compose.yml

github.com/autopilotpattern # a Node.js application container FROM gliderlabs/alpine:3.3 # dependencies
RUN apk update && apk add nodejs curl COPY package.json /opt/customers/ RUN cd /opt/customers && npm install # add our application and configuration COPY customers.js /opt/customers/ EXPOSE 4000 CMD [ "node", "/opt/customers/customers.js" ] ~/workshop/customers/Dockerfile

github.com/autopilotpattern # a Node.js application container FROM gliderlabs/alpine:3.3 # dependencies
RUN apk update && apk add nodejs curl COPY package.json /opt/customers/ RUN cd /opt/customers && npm install # get ContainerPilot release (please verify checksum in real life, but YOLO!) RUN curl -Lo /tmp/cp.tar.gz https://github.com/joyent/containerpilot/… tar -xz -f /tmp/cp.tar.gz && mv /containerpilot /bin/ # add our application and configuration COPY customers.js /opt/customers/ COPY containerpilot.json /etc/containerpilot.json ENV CONTAINERPILOT=file:///etc/containerpilot.json EXPOSE 4000 CMD [ "/bin/containerpilot", "node", "/opt/customers/customers.js" ] ~/workshop/customers/Dockerfile

github.com/autopilotpattern { "consul": "consul:8500", "services": [ { "name": "customers", "port":
4000, "health": "/usr/bin/curl --fail -s http://localhost:4000/data", "poll": 3, "ttl": 10 } ], "backends": [ { "name": "sales", "poll": 3, "onChange": "pkill -SIGHUP node" } ] } ~/workshop/customers/etc/containerpilot.json

4000, "health": "/usr/bin/curl --fail -s http://localhost:4000/data", "poll": 3, "ttl": 10 } ], "backends": [ { "name": "sales", "poll": 3, "onChange": "pkill -SIGHUP node" } ] } ~/workshop/customers/etc/containerpilot.json can be templatized w/ env var: {{ .CONSUL }}

github.com/autopilotpattern Container Pilot Application Application container

github.com/autopilotpattern Container Pilot Application Application container onChange event health check

root@993acf351cd9:/# ps axo uid,pid,ppid,cmd UID PID PPID CMD root 1 0 /bin/containerpilot root 94 1 ├─ node /opt/customers/customers.js root 107 1 ├─ /usr/bin/curl --fail localhost:4000 root 120 1 └─ pkill -SIGHUP node

github.com/autopilotpattern { "consul": "consul:8500", "preStart": "/bin/reload-nginx.sh preStart", "services": [ {
"name": "nginx", "port": 80, "interfaces": ["eth1", "eth0"], "health": "/usr/bin/curl --fail -s http://localhost/health", "poll": 10, "ttl": 25 } ], "backends": [ { "name": "sales", "poll": 3, "onChange": "/bin/reload-nginx.sh onChange” },{ "name": "customers", "poll": 3, "onChange": "/bin/reload-nginx.sh onChange” } ]... ~/workshop/nginx/etc/containerpilot.json

root@993acf351cd9:/# ps axo uid,pid,ppid,cmd UID PID PPID CMD root 1 0 /bin/containerpilot root 94 1 ├─ nginx -g “daemon off;” root 107 1 ├─ /usr/bin/curl --fail localhost root 120 1 └─ consul-template -SIGHUP node root 128 120 └─ nginx -s reload

github.com/autopilotpattern Container Pilot Consul Lifecycle: preStart Application container

github.com/autopilotpattern Container Pilot Consul Lifecycle: preStart PID1 Separate container Application
container

github.com/autopilotpattern Container Pilot Consul preStart Lifecycle: preStart Application container

github.com/autopilotpattern Container Pilot Consul preStart Lifecycle: preStart Note: no main
application running yet! If exit code of preStart != 0, ContainerPilot exits Application container

github.com/autopilotpattern #!/bin/sh # Render Nginx configuration template using values from
Consul, # but do not reload because Nginx has't started yet preStart() { consul-template \ -once \ -consul consul:8500 \ -template "/etc/containerpilot/nginx.conf.ctmpl:/etc/nginx/nginx.conf" } # Render Nginx configuration template using values from Consul, # then gracefully reload Nginx onChange() { consul-template \ -once \ -consul consul:8500 \ -template "/etc/containerpilot/nginx.conf.ctmpl:/etc/nginx/nginx.conf:nginx -s reload" } until cmd=$1 if [ -z "$cmd" ]; then onChange fi shift 1 $cmd "$@" [ "$?" -ne 127 ] do onChange exit done ~/workshop/nginx/reload-nginx.sh

Consul, # but do not reload because Nginx has't started yet preStart() { consul-template \ -once \ -consul consul:8500 \ -template "/etc/containerpilot/nginx.conf.ctmpl:/etc/nginx/nginx.conf" } # Render Nginx configuration template using values from Consul, # then gracefully reload Nginx onChange() { consul-template \ -once \ -consul consul:8500 \ -template "/etc/containerpilot/nginx.conf.ctmpl:/etc/nginx/nginx.conf:nginx -s reload" } until cmd=$1 if [ -z "$cmd" ]; then onChange fi shift 1 $cmd "$@" [ "$?" -ne 127 ] do onChange exit done can be templatized w/ env var: $CONTAINERPILOT_<SERVICENAME>_IP ~/workshop/nginx/reload-nginx.sh

Consul, # but do not reload because Nginx has't started yet preStart() { consul-template \ -once \ -consul consul:8500 \ -template "/etc/containerpilot/nginx.conf.ctmpl:/etc/nginx/nginx.conf" } # Render Nginx configuration template using values from Consul, # then gracefully reload Nginx onChange() { consul-template \ -once \ -consul consul:8500 \ -template "/etc/containerpilot/nginx.conf.ctmpl:/etc/nginx/nginx.conf:nginx -s reload" } until cmd=$1 if [ -z "$cmd" ]; then onChange fi shift 1 $cmd "$@" [ "$?" -ne 127 ] do onChange exit done ~/workshop/nginx/reload-nginx.sh /etc/containerpilot/nginx.conf.ctmpl:/etc/nginx/nginx.conf:nginx -s reload render this file : to this file : and then do this

github.com/autopilotpattern Container Pilot Consul preStart Lifecycle: preStart Nginx container consul-template:
get addresses for upstreams

github.com/autopilotpattern Container Pilot Consul preStart Lifecycle: preStart “None yet!” or
“{ customers: [ 192.168.1.100:4000, 192.168.1.101:4000], sales: [ 192.168.1.102:3000, 192.168.1.103:3000] }” Nginx container consul-template: get addresses for upstreams

github.com/autopilotpattern Container Pilot Consul preStart Lifecycle: preStart “None yet!” or
“{ customers: [ 192.168.1.100:4000, 192.168.1.101:4000], sales: [ 192.168.1.102:3000, 192.168.1.103:3000] }” Nginx container consul-template: get addresses for upstreams render virtualhost conﬁg

github.com/autopilotpattern user nginx; worker_processes 1; error_log /var/log/nginx/error.log warn; pid /var/run/nginx.pid;
events { worker_connections 1024; } http { include /etc/nginx/mime.types; default_type application/octet-stream; access_log /var/log/nginx/access.log main; sendfile on; keepalive_timeout 65; log_format main '$remote_addr - $remote_user [$time_local] "$request" ' '$status $body_bytes_sent "$http_referer" ' '"$http_user_agent" "$http_x_forwarded_for"'; server { listen 80; server_name _; root /usr/share/nginx/html; location /health { # requires http_stub_status_module stub_status; allow 127.0.0.1; deny all; } } }

github.com/autopilotpattern {{ if service "sales" }} upstream sales { #
write the address:port pairs for each healthy Sales node {{range service "sales"}} server {{.Address}}:{{.Port}}; {{end}} least_conn; }{{ end }} server { listen 80; server_name _; root /usr/share/nginx/html; {{ if service "sales" }} location ^~ /sales { # strip '/sales' from the request before passing # it along to the Sales upstream rewrite ^/sales(/.*)$ $1 break; proxy_pass http://sales; proxy_redirect off; }{{end}} } }

github.com/autopilotpattern upstream sales { # write the address:port pairs for
each healthy Sales node server 192.168.1.101:3000; server 192.168.1.102:3000; server 192.168.1.103:3000; least_conn; } server { listen 80; server_name _; root /usr/share/nginx/html; location ^~ /sales { # strip '/sales' from the request before passing # it along to the Sales upstream rewrite ^/sales(/.*)$ $1 break; proxy_pass http://sales; proxy_redirect off; } } }

github.com/autopilotpattern Container Pilot Consul Lifecycle: run

github.com/autopilotpattern Container Pilot node Consul Lifecycle: run • Attach to
stdout/ stderr • Return exit code of application to Docker runtime application container

github.com/autopilotpattern Container Pilot Consul Lifecycle: health node health application container

github.com/autopilotpattern User-deﬁned health check inside the container. Runs every poll
seconds. Container Pilot Consul Lifecycle: health nginx (or node) health application container

github.com/autopilotpattern Container Pilot Consul Lifecycle: health Exit! nginx (or node)
health application container

github.com/autopilotpattern Container Pilot nginx (or node) Consul health Lifecycle: health
Exit code is 0? “I am customers-12345. I am available at 192.168.100.2:4000. I am healthy for the next 10 seconds.” application container

github.com/autopilotpattern Container Pilot nginx (or node) Consul health Lifecycle: health
If exit code != 0, do nothing (TTL expires) application container

github.com/autopilotpattern Container Pilot nginx (or node) Consul Where is customers?
192.168.1.101:3000 application container Lifecycle: onChange

github.com/autopilotpattern Container Pilot nginx (or node) Consul Where is customers?
192.168.1.101:3000 application container Lifecycle: onChange Check Consul for services listed in backends. Runs every poll seconds.

Consul, # but do not reload because Nginx has't started yet preStart() { consul-template \ -once \ -consul consul:8500 \ -template "/etc/containerpilot/nginx.conf.ctmpl:/etc/nginx/nginx.conf" } # Render Nginx configuration template using values from Consul, # then gracefully reload Nginx onChange() { consul-template \ -once \ -consul consul:8500 \ -template "/etc/containerpilot/nginx.conf.ctmpl:/etc/nginx/nginx.conf:nginx -s reload" } until cmd=$1 if [ -z "$cmd" ]; then onChange fi shift 1 $cmd "$@" [ "$?" -ne 127 ] do onChange exit done ~/workshop/nginx/reload-nginx.sh

github.com/autopilotpattern var upstreamHosts = []; var getUpstreams = function(force, callback)
{ // get data from Consul // fill upstreamHosts // fire callback } process.on('SIGHUP', function () { console.log('Received SIGHUP'); getUpstreams(true, function(hosts) { console.log(‘Updated upstreamHosts'); }); }); ~/workshop/sales/sales.js

github.com/autopilotpattern ... "telemetry": { "port": 9090, "sensors": [ { "name":
"tb_nginx_connections_unhandled_total", "help": "Number of accepted connnections that were not handled", "type": "gauge", "poll": 5, "check": ["/opt/containerpilot/sensor.sh", "unhandled"] }, { "name": "tb_nginx_connections_load", "help": "Ratio of active connections to max worker connections", "type": "gauge", "poll": 5, "check": ["/opt/containerpilot/sensor.sh", "connections_load"] } ] } } ~/workshop/nginx/etc/containerpilot.json

github.com/autopilotpattern Stateful applications

github.com/autopilotpattern read/write read-only async replication App Primary Replica Consul MySQL
with replication

github.com/autopilotpattern ~ $ git clone [email protected]:autopilotpattern/mysql.git ~ $ cd mysql
~/mysql $ tree --dirsfirst . ├── bin │ └── manage.py ├── etc │ ├── containerpilot.json │ └── my.cnf.tmpl ├── tests ├── _env ├── Dockerfile ├── docker-compose.yml ├── local-compose.yml └── setup.sh

github.com/autopilotpattern ~/mysql/docker-compose.yml mysql: image: autopilotpattern/mysql:latest mem_limit: 4g restart: always #
expose for linking, but each container gets a private IP for # internal use as well expose: - 3306 labels: - triton.cns.services=mysql env_file: _env environment: - CONTAINERPILOT=file:///etc/containerpilot.json

expose for linking, but each container gets a private IP for # internal use as well expose: - 3306 labels: - triton.cns.services=mysql env_file: _env environment: - CONTAINERPILOT=file:///etc/containerpilot.json Infrastructure-backed service discovery requirement

expose for linking, but each container gets a private IP for # internal use as well expose: - 3306 labels: - triton.cns.services=mysql env_file: _env environment: - CONTAINERPILOT=file:///etc/containerpilot.json Credentials from environment

github.com/autopilotpattern ~/workshop/mysql $ ./setup.sh /path/to/private/key.pem ~/workshop/mysql $ emacs _env MYSQL_USER=me
MYSQL_PASSWORD=password1 MYSQL_REPL_USER=repl MYSQL_REPL_PASSWORD=password2 MYSQL_DATABASE=mydb MANTA_BUCKET=/<username>/stor/triton-mysql MANTA_USER=<username> MANTA_SUBUSER= MANTA_ROLE= MANTA_URL=https://us-east.manta.joyent.com MANTA_KEY_ID=1a:b8:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx MANTA_PRIVATE_KEY=-----BEGIN RSA PRIVATE KEY——#… CONSUL=consul.svc.0f06a3e0-a0da-eb00-a7ae-989d4e44e2ad.us-east-1.cns.joyent.com

github.com/autopilotpattern ~/mysql $ docker-compose -p my up -d Creating my_consul_1
Creating my_mysql_1 ~/mysql $ docker-compose -p my ps Name Command State Ports ––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––– my_consul_1 /bin/start -server -bootst... Up 53/tcp, 53/udp, 8300/tcp... my_mysql_1 containerpilot mysqld… Up 0.0.0.0:3600

github.com/autopilotpattern ~/mysql $ docker-compose -p my scale mysql=2 Creating my_mysql_2
~/mysql $ docker-compose -p my ps Name Command State Ports ––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––– my_consul_1 /bin/start -server -bootst... Up 53/tcp, 53/udp, 8300/tcp... my_mysql_1 containerpilot mysqld… Up 0.0.0.0:3600 my_mysql_2 containerpilot mysqld… Up 0.0.0.0:3600

github.com/autopilotpattern FROM percona:5.6 RUN apt-get update && apt-get install -y
\ python python-dev gcc curl percona-xtrabackup # get Python drivers MySQL, Consul, and Manta RUN curl -Ls -o get-pip.py https://bootstrap.pypa.io/get-pip.py && \ python get-pip.py && pip install \ PyMySQL==0.6.7 python-Consul==0.4.7 manta==2.5.0 mock==2.0.0 # get ContainerPilot release (see repo for checksum verification!) RUN curl -Lo /tmp/cp.tar.gz https://github.com/joyent/containerpilot/… tar -xz -f /tmp/cp.tar.gz && mv /containerpilot /usr/local/bin/ # configure ContainerPilot and MySQL COPY etc/* /etc/ COPY bin/* /usr/local/bin/ # override the parent entrypoint ENTRYPOINT [] # use --console to get error logs to stderr CMD [ “containerpilot", “mysqld”, \ "--console", \ "--log-bin=mysql-bin", \ "--log_slave_updates=ON", \ "--gtid-mode=ON", \ "--enforce-gtid-consistency=ON" \ ] ~/mysql/Dockerfile

github.com/autopilotpattern ~/mysql/etc/containerpilot.json { "consul": "{{ .CONSUL }}:8500", "preStart": "python /usr/local/bin/manage.py",
"services": [ { "name": "mysql", "port": 3306, "health": "python /usr/local/bin/manage.py health", "poll": 5, "ttl": 25 } ], "backends": [ { "name": "mysql-primary", "poll": 10, "onChange": "python /usr/local/bin/manage.py on_change" } ] }

"services": [ { "name": "mysql", "port": 3306, "health": "python /usr/local/bin/manage.py health", "poll": 5, "ttl": 25 } ], "backends": [ { "name": "mysql-primary", "poll": 10, "onChange": "python /usr/local/bin/manage.py on_change" } ] } Environment variable interpolation

"services": [ { "name": "mysql", "port": 3306, "health": "python /usr/local/bin/manage.py health", "poll": 5, "ttl": 25 } ], "backends": [ { "name": "mysql-primary", "poll": 10, "onChange": "python /usr/local/bin/manage.py on_change" } ] } Service deﬁnition

"services": [ { "name": "mysql", "port": 3306, "health": "python /usr/local/bin/manage.py health", "poll": 5, "ttl": 25 } ], "backends": [ { "name": "mysql-primary", "poll": 10, "onChange": "python /usr/local/bin/manage.py on_change" } ] } Backend deﬁnition

"services": [ { "name": "mysql", "port": 3306, "health": "python /usr/local/bin/manage.py health", "poll": 5, "ttl": 25 } ], "backends": [ { "name": "mysql-primary", "poll": 10, "onChange": "python /usr/local/bin/manage.py on_change" } ] } Huh? This isn’t in our docker-compose.yml

"services": [ { "name": "mysql", "port": 3306, "health": "python /usr/local/bin/manage.py health", "poll": 5, "ttl": 25 } ], "backends": [ { "name": "mysql-primary", "poll": 10, "onChange": "python /usr/local/bin/manage.py on_change" } ] } Logic lives in manage.py

github.com/autopilotpattern Container Pilot Consul Lifecycle: preStart MySQL container

github.com/autopilotpattern Container Pilot Consul Lifecycle: preStart MySQL container PID1 Separate
container

github.com/autopilotpattern Container Pilot Consul Lifecycle: preStart Manta object store Store
snapshots MySQL container

github.com/autopilotpattern Container Pilot Consul preStart Lifecycle: preStart Manta object store
MySQL container

MySQL container Note: no main application running yet! If exit code of preStart != 0, ContainerPilot exits

MySQL container “Has a snapshot been written to Manta?”

github.com/autopilotpattern Container Pilot Consul MySQL container preStart Lifecycle: preStart Manta
object store “Has a snapshot been written to Manta?” “Nope!”

github.com/autopilotpattern Container Pilot Consul MySQL container preStart Lifecycle: preStart Manta
object store “Has a snapshot been written to Manta?” “Nope!” initialize DB

github.com/autopilotpattern ~/mysql/bin/manage.py def pre_start(): """ MySQL must be running in
order to execute most of our setup behavior so we're just going to make sure the directory structures are in place and then let the first health check handler take it from there """ if not os.path.isdir(os.path.join(config.datadir, 'mysql')): last_backup = has_snapshot() if last_backup: get_snapshot(last_backup) restore_from_snapshot(last_backup) else: if not initialize_db(): log.info('Skipping database setup.') sys.exit(0)

order to execute most of our setup behavior so we're just going to make sure the directory structures are in place and then let the first health check handler take it from there """ if not os.path.isdir(os.path.join(config.datadir, 'mysql')): last_backup = has_snapshot() if last_backup: get_snapshot(last_backup) restore_from_snapshot(last_backup) else: if not initialize_db(): log.info('Skipping database setup.') sys.exit(0) Check w/ Consul for snapshot

order to execute most of our setup behavior so we're just going to make sure the directory structures are in place and then let the first health check handler take it from there """ if not os.path.isdir(os.path.join(config.datadir, 'mysql')): last_backup = has_snapshot() if last_backup: get_snapshot(last_backup) restore_from_snapshot(last_backup) else: if not initialize_db(): log.info('Skipping database setup.') sys.exit(0) calls /usr/bin/mysql_install_db

order to execute most of our setup behavior so we're just going to make sure the directory structures are in place and then let the first health check handler take it from there """ if not os.path.isdir(os.path.join(config.datadir, 'mysql')): last_backup = has_snapshot() if last_backup: get_snapshot(last_backup) restore_from_snapshot(last_backup) else: if not initialize_db(): log.info('Skipping database setup.') sys.exit(0)

github.com/autopilotpattern Container Pilot Consul Lifecycle: run Manta object store MySQL
container

github.com/autopilotpattern Container Pilot mysqld Consul Lifecycle: run • Attach to
stdout/ stderr • Return exit code of application to Docker runtime MySQL container Manta object store

github.com/autopilotpattern Container Pilot Consul Lifecycle: health mysqld health Manta object
store MySQL container

github.com/autopilotpattern Manta object store Container Pilot Consul Lifecycle: health User-deﬁned
health check inside the container. Runs every poll seconds. mysqld MySQL container health

github.com/autopilotpattern mysqld MySQL container Container Pilot Consul health Lifecycle: health
Manta object store ﬁrst time? ﬁnish initialization

github.com/autopilotpattern ~/mysql/bin/manage.py def health(): """ Run a simple health check.
Also acts as a check for whether the ContainerPilot configuration needs to be reloaded (if it's been changed externally), or if we need to make a backup because the backup TTL has expired. """ node = MySQLNode() cp = ContainerPilot(node) if cp.update(): cp.reload() return # Because we need MySQL up to finish initialization, we need to check # for each pass thru the health check that we've done so. The happy # path is to check a lock file against the node state (which has been # set above) and immediately return when we discover the lock exists. # Otherwise, we bootstrap the instance. was_ready = assert_initialized_for_state(node) ctx = dict(user=config.repl_user, password=config.repl_password, timeout=cp.config['services'][0]['ttl']) node.conn = wait_for_connection(**ctx) # Update our lock on being the primary/standby. if node.is_primary() or node.is_standby(): update_session_ttl() # Create a snapshot and send it to the object store if all((node.is_snapshot_node(), (not is_backup_running()), (is_binlog_stale(node.conn) or is_time_for_snapshot()))): write_snapshot(node.conn) mysql_query(node.conn, 'SELECT 1', ())

github.com/autopilotpattern ~/mysql/bin/manage.py def run_as_primary(node): """ The overall workflow here is
ported and reworked from the Oracle-provided Docker image: https://github.com/mysql/mysql-docker/blob/mysql-server/5.7/docker-entrypoint.sh """ node.state = PRIMARY mark_as_primary(node) node.conn = wait_for_connection() if node.conn: # if we can make a connection w/o a password then this is the # first pass set_timezone_info() setup_root_user(node.conn) create_db(node.conn) create_default_user(node.conn) create_repl_user(node.conn) run_external_scripts('/etc/initdb.d') expire_root_password(node.conn) else: ctx = dict(user=config.repl_user, password=config.repl_password, database=config.mysql_db) node.conn = wait_for_connection(**ctx) stop_replication(node.conn) # in case this is a newly-promoted primary if USE_STANDBY: # if we're using a standby instance then we need to first # snapshot the primary so that we can bootstrap the standby. write_snapshot(node.conn) Set up DB, user, replication user, and expire password, etc.

github.com/autopilotpattern ~/mysql/bin/manage.py def run_as_replica(node): try: ctx = dict(user=config.repl_user, password=config.repl_password, database=config.mysql_db)
node.conn = wait_for_connection(**ctx) set_primary_for_replica(node.conn) except Exception as ex: log.exception(ex) def set_primary_for_replica(conn): """ Set up GTID-based replication to the primary; once this is set the replica will automatically try to catch up with the primary's last transactions. """ primary = get_primary_host() sql = ('CHANGE MASTER TO ' 'MASTER_HOST = %s, ' 'MASTER_USER = %s, ' 'MASTER_PASSWORD = %s, ' 'MASTER_PORT = 3306, ' 'MASTER_CONNECT_RETRY = 60, ' 'MASTER_AUTO_POSITION = 1, ' 'MASTER_SSL = 0; ' 'START SLAVE;') mysql_exec(conn, sql, (primary, config.repl_user, config.repl_password,))

node.conn = wait_for_connection(**ctx) set_primary_for_replica(node.conn) except Exception as ex: log.exception(ex) def set_primary_for_replica(conn): """ Set up GTID-based replication to the primary; once this is set the replica will automatically try to catch up with the primary's last transactions. """ primary = get_primary_host() sql = ('CHANGE MASTER TO ' 'MASTER_HOST = %s, ' 'MASTER_USER = %s, ' 'MASTER_PASSWORD = %s, ' 'MASTER_PORT = 3306, ' 'MASTER_CONNECT_RETRY = 60, ' 'MASTER_AUTO_POSITION = 1, ' 'MASTER_SSL = 0; ' 'START SLAVE;') mysql_exec(conn, sql, (primary, config.repl_user, config.repl_password,)) gets from Consul

node.conn = wait_for_connection(**ctx) set_primary_for_replica(node.conn) except Exception as ex: log.exception(ex) def set_primary_for_replica(conn): """ Set up GTID-based replication to the primary; once this is set the replica will automatically try to catch up with the primary's last transactions. """ primary = get_primary_host() sql = ('CHANGE MASTER TO ' 'MASTER_HOST = %s, ' 'MASTER_USER = %s, ' 'MASTER_PASSWORD = %s, ' 'MASTER_PORT = 3306, ' 'MASTER_CONNECT_RETRY = 60, ' 'MASTER_AUTO_POSITION = 1, ' 'MASTER_SSL = 0; ' 'START SLAVE;') mysql_exec(conn, sql, (primary, config.repl_user, config.repl_password,)) Remember our preStart downloaded the snapshot

github.com/autopilotpattern Wait a sec. How do we know which instance
is primary!?

github.com/autopilotpattern Container Pilot Consul Lifecycle: health Exit! mysqld MySQL container
health

github.com/autopilotpattern Container Pilot mysqld Consul health Lifecycle: health Exit code
is 0? “I am mysql-12345. I am available at 192.168.100.2:4000. I am healthy for the next 10 seconds.” MySQL container

github.com/autopilotpattern Container Pilot mysqld Consul MySQL container health Lifecycle: health
If exit code != 0, do nothing (TTL expires)

github.com/autopilotpattern Ask Consul for Primary

github.com/autopilotpattern I’m the primary! Ask Consul for Primary

github.com/autopilotpattern I’m the primary! Ask Consul for Primary Update lock
TTL w/ each health check

github.com/autopilotpattern I’m the primary! Someone else is the primary! I’m
a replica! Ask Consul for Primary

github.com/autopilotpattern I’m the primary! Someone else is the primary! I’m
a replica! Ask Consul for Primary Syncs up using snapshot and GTID

github.com/autopilotpattern No Primary? I’m the Primary! I’m the primary! Someone
else is the primary! I’m a replica! Ask Consul for Primary

github.com/autopilotpattern No Primary? I’m the Primary? I’m the primary! Someone
else is the primary! I’m a replica! Ask Consul for Primary Need to assert only 1 primary

github.com/autopilotpattern No Primary? I’m the Primary? I’m the primary! Failed!
Go back to start I’m the primary! Someone else is the primary! I’m a replica! Set lock in Consul w/ TTL Ask Consul for Primary

github.com/autopilotpattern No Primary? I’m the Primary? I’m the primary! Failed!
Go back to start I’m the primary! Someone else is the primary! I’m a replica! Set lock in Consul w/ TTL Ask Consul for Primary Update lock TTL w/ each health check. Rewrite ContainerPilot conﬁg and SIGHUP

"services": [ { "name": “mysql-primary", "port": 3306, "health": "python /usr/local/bin/manage.py health", "poll": 5, "ttl": 25 } ], "backends": [ { "name": "mysql-primary", "poll": 10, "onChange": "python /usr/local/bin/manage.py on_change" } ] } Rewrite & reload conﬁg

github.com/autopilotpattern ~/mysql/bin/manage.py def health(): """ Run a simple health check.
Also acts as a check for whether the ContainerPilot configuration needs to be reloaded (if it's been changed externally), or if we need to make a backup because the backup TTL has expired. """ node = MySQLNode() cp = ContainerPilot(node) if cp.update(): cp.reload() return was_ready = assert_initialized_for_state(node) # cp.reload() will exit early so no need to setup # connection until this point ctx = dict(user=config.repl_user, password=config.repl_password, timeout=cp.config['services'][0]['ttl']) node.conn = wait_for_connection(**ctx) # Update our lock on being the primary/standby. # If this lock is allowed to expire and the health check for the primary # fails, the `onChange` handlers for the replicas will try to self-elect # as primary by obtaining the lock. # If this node can update the lock but the DB fails its health check, # then the operator will need to manually intervene if they want to # force a failover. This architecture is a result of Consul not # permitting us to acquire a new lock on a health-checked session if the # health check is *currently* failing, but has the happy side-effect of # reducing the risk of flapping on a transient health check failure. if node.is_primary() or node.is_standby(): update_session_ttl() # Create a snapshot and send it to the object store. if all((node.is_snapshot_node(), (not is_backup_running()), (is_binlog_stale(node.conn) or is_time_for_snapshot()))): write_snapshot(node.conn) mysql_query(node.conn, 'SELECT 1', ())

github.com/autopilotpattern Wait a sec. How do we fail-over?

github.com/autopilotpattern Container Pilot mysqld Consul Where is mysql-primary? 192.168.1.100 MySQL
container Lifecycle: onChange

github.com/autopilotpattern Container Pilot mysqld Consul Where is mysql-primary? 192.168.1.100 MySQL
container Lifecycle: onChange Check Consul for services listed in backends. Runs every poll seconds.

github.com/autopilotpattern replica primary Healthy! Healthy! Failed! Ask Consul for Primary
no change Ask Consul for Primary no change Ask Consul for Primary ﬁre onChange handler

github.com/autopilotpattern ~/mysql/bin/manage.py def on_change(): node = MySQLNode() ctx = dict(user=config.repl_user,
password=config.repl_password, timeout=cp.config['services'][0]['ttl']) node.conn = wait_for_connection(**ctx) # need to stop replication whether we're the new primary or not stop_replication(node.conn) while True: try: # if there is no primary node, we'll try to obtain the lock. # if we get the lock we'll reload as the new primary, otherwise # someone else got the lock but we don't know who yet so loop primary = get_primary_node() if not primary: session_id = get_session(no_cache=True) if mark_with_session(PRIMARY_KEY, node.hostname, session_id): node.state = PRIMARY if cp.update(): cp.reload() return else: # we lost the race to lock the session for ourselves time.sleep(1) continue # we know who the primary is but not whether they're healthy. # if it's not healthy, we'll throw an exception and start over. ip = get_primary_host(primary=primary) if ip == node.ip: if cp.update(): cp.reload() return set_primary_for_replica(node.conn) return except Exception as ex: # This exception gets thrown if the session lock for `mysql-primary` # key has not expired yet (but there's no healthy primary either), # or if the replica's target primary isn't ready yet. log.debug(ex) time.sleep(1) # avoid hammering Consul continue

github.com/autopilotpattern replica primary Healthy! Healthy! Failed! no change no change
Ask Consul for Primary Ask Consul for Primary Ask Consul for Primary ﬁre onChange handler

github.com/autopilotpattern replica primary Healthy! Healthy! Failed! no change no change
Ask Consul for Primary Ask Consul for Primary Ask Consul for Primary Ask Consul for Primary Ok, I’m primary Set lock in Consul Success! primary Healthy! ﬁre onChange handler

Applications on Autopilot Tim Gross @0x74696d (“tim”) github.com/autopilotpattern

Stateful Applications on Autopilot

Stateful Applications on Autopilot

More Decks by Tim Gross

Other Decks in Technology

Featured

Transcript