Hey there
I’m Mike Lehan
Software engineer, CTO of StuRents.com, skydiver, northerner
Follow me on Twitter @M1ke
Love/hate to https://joind.in/talk/f087d
2
Slide 3
Slide 3 text
Cloud changes
the way our
products work
But how can we move applications to the cloud
without changing the way we develop, deploy
and manage those products?
3
Slide 4
Slide 4 text
Why move to the
cloud?
Scalable
No need to migrate to
new servers when our
business grows or
requirements change
Available
The services we use must
be there when we need
them, resistant to faults in
underlying utilities
Durable
If something goes wrong
with our system, being
able to recover data or
revert changes
4
Slide 5
Slide 5 text
Where are we
coming from?
● Single server
● LAMP stack
● Separate database
● Hypervisors
5
Still love LAMP? Never fear!
Slide 6
Slide 6 text
Acronyms ahead!
Don’t worry we’ve got this
These slides will explain all
And I put an icon of a plant up there if you
need some fresh air
6
Slide 7
Slide 7 text
“
Is this really the best way
to do all this?
7
Slide 8
Slide 8 text
Your server being on
You’ll miss:
Slide 9
Slide 9 text
EC2
Elastic Compute Cloud - servers, called “instances”
AZ
Availability Zone - one or more data centres
9
RDS
Relational Database Service - managed SQL
Slide 10
Slide 10 text
AWS Regions
10
Slide 11
Slide 11 text
Avoid the loss of one
Availability Zone
EC2
Launch instances in
multiple availability zones,
there are ways we can
manage this
RDS
Tick the “Multi-AZ” option
and AWS manages
replication across zones
(adds a charge)
Other services
Some services run across
AZs, others are tied to
one - always check when
estimating costs
11
Slide 12
Slide 12 text
Introducing
Auto-scaling
12
first
Image
Configuration plus
disk snapshot
Launch Configuration
Image plus network,
server size settings
second
Availability Zones
Launch in one or more,
balance between
third
Profit
AWS starts
instances for you
?
Slide 13
Slide 13 text
“
With multiple servers, how
does web traffic reach our
application?
13
Slide 14
Slide 14 text
Mapping an IP to a domain
You’ll miss:
Slide 15
Slide 15 text
15
domain.com A 123.45.67.89
Slide 16
Slide 16 text
Introducing Elastic
Load Balancing
16
first
Choose a target group
Auto-scaling group
or set of instances
Add rules
Simple redirects to
complex forwarding
second
Health checks
Custom requests to
instances, can check
returned content
third
Availability Zones
Across all AZs by
default
btw
Slide 17
Slide 17 text
Where did my
session go?
● Sessions stored on disk
● Server changes, lose your session
● Sticky sessions to the rescue!
● Saves a cookie
17
Slide 18
Slide 18 text
“
Now I have to deploy code
to how many servers?
18
Slide 19
Slide 19 text
Using the file system
You’ll miss:
Slide 20
Slide 20 text
Each server has its own disk
This will cause problems for two areas of your
application:
● Deploying application code
● Storing content generated by or uploaded
through the application
20
Slide 21
Slide 21 text
EBS
Elastic Block Store - hard disks used by instances
EFS
Elastic File System - networked hard disk
21
S3
Simple Storage Service - cheap API driven file store
0.10
$/GB
0.30
0.023
Slide 22
Slide 22 text
Multiple servers
can access a single
EFS volume
● Stored across all AZs
● No need to set a volume size
● Write consistency
● Scales with number of stored files
22
The "E" stands for exabyte:
1,000,000,000,000,000,000 bytes
Slide 23
Slide 23 text
“
23
Files upload
slowly!
PHP files execute
even slower
Slide 24
Slide 24 text
Speed
You’ll miss:
Slide 25
Slide 25 text
Write consistency
Every file write has to copy across
multiple locations.
For single files this is fine.
For lots of files this causes big
slowdown
Some of the EFS characteristics
cause other problems
Network read
Delay isn’t noticeable for 1 file
PHP applications tend to use lots
File read delay slows down
application
File handling in app also affected
25
Slide 26
Slide 26 text
How we tried to solve
these problems
Atomic deployments
Key problem with write
time is for deployments
and app code consistency
Custom rsync + bash
solution
Opcache
Reduce dependency on
PHP file reads by caching
generated opcodes
Invalidate this cache on
deployment
S3 deployments
Remove EFS effect on
application code by
deploying code direct to
servers
Must ensure consistency
26
Slide 27
Slide 27 text
Atomic
deployments
27
first
Deploy
Copy files to EBS
on 1 instance for a
quick copy
Sync
On instance sync to
EFS in time-stamped
directory
second
Switch
Move a symlink to
ensure code
consistency
third
Problem
Deployment is still
super slow, need to
scp for quick changes
-
Slide 28
Slide 28 text
PHP Opcache for speed
Once app endpoint has
been run once
subsequent executions
should be faster
Must set This means we need a
custom app running to
invalidate cache when
code changes
28
validate_timestamps=0
first
Run as cron
Every minute, on
every instance
Check timestamp
From a file in the latest
app deployment
second
Reset opcache
Curl local PHP route
and run
third
Problems
Slow “first” loads of
any page. Opcache
now critical to normal
app performance
-
opcache_reset()
Slide 29
Slide 29 text
Well what then?
● Blue-green deployment
● CodeDeploy or other tools
29
But...
● Problems of consistency, especially front end cache
● Not designed for different types of servers
● Harder to run post-deployment code
Slide 30
Slide 30 text
Deploy via S3
30
Simple agent can monitor for deployments and synchronise across servers
Sorry EFS, you won’t cut it for code but you’re still good for shared content!
first
Upload
Developer uploads
code & timestamp
file to S3
Check
Instance checks S3
timestamp on a
schedule
second
Download
Create a lock and sync
files to current instance
third
Switch
Once all locks across all
servers released,
update a symlink
fourth
Slide 31
Slide 31 text
Modifying servers
“on-the-fly”
You’ll miss:
Slide 32
Slide 32 text
We all do it
● Sign in to a server by SSH
● Make some config changes
32
Not on AWS
● Servers can vanish and be recreated by auto scaling
Slide 33
Slide 33 text
ASG
Auto Scaling Group - we’ve already mentioned these
LC
Launch Configuration - types of instances for ASG
33
AMI
Amazon Machine Image - config + disk snapshot
Slide 34
Slide 34 text
Introducing the
master instance
34
One instance to edit
AWS doesn’t charge
for inactive
instances (except
storage)
Turn it on & change
You can edit configs
and even test them
- web traffic isn’t
routed here unless
you map a temp
domain to the IP
Turn it off & image
Create a new AMI
from the instance
Build a LC with this
AMI
Re-point your ASG
Slide 35
Slide 35 text
“
Surely we should
automate this?
35
Slide 36
Slide 36 text
Introducing Lambda
36
Generate an AMI when
an instance stops
Triggers on event
Targets 1 instance
AMI generates in
<5 minutes
Put new AMI into service
in an Auto Scaling group
Triggers manually
Find latest AMI
Copy latest LC, insert
new AMI/snapshot
first
Increase size to create
new instances
Old instances won’t
automatically update
Need new instances 1st
Increase size of ASG
second
Schedule removal of old
instances
Add ASG schedule
to reduce size
Termination policy:
delete old LC first
third
Lets you run single scripts on a schedule or triggers.
Costs basically nothing. Runs Python but not PHP
Slide 37
Slide 37 text
Cron
You’ll miss:
Slide 38
Slide 38 text
Cron is part of many apps
● Simple tasks like once a day backup
● “Semi-live” tasks like webhook/stats processing
38
ASG instances make cron awkward
● Cron tasks running twice is a bad thing
● Instances could stop or start any time
● Multiple instances make jobs hard to track/debug
Slide 39
Slide 39 text
Use a separate
control instance
Cron
Run your cron jobs here.
Has the added advantage
that cron jobs are less
likely to impact web
serving
Inspect
This server can be used
for developers to inspect
content in EFS, access
SQL via CLI (avoid public
open SQL access)
Availability
Still use an ASG! This
means if we lose an AZ
the control server restarts
elsewhere. Fix ASG to
exactly 1 instance
39
Slide 40
Slide 40 text
“
How can I SSH into
instances if they keep
moving?
40
Slide 41
Slide 41 text
EIP
Elastic IP - fixed IP that can move between instances
Route53 (not an acronym)
Domain Name Service provided by AWS
41
Slide 42
Slide 42 text
Introducing awscli
42
Sorry, they
don’t have an
icon for this
Python package
which takes API
credentials
Everything in AWS
can be done via CLI
Learning CLI also
teaches you SDK
and IAM
Returns JSON data
which can be
filtered by AWS or
parsed by “jq”
Put together we can run:
$ aws ec2 describe-instances --instance-id X \
| jq --raw-output
.Reservations[].Instances[].PublicIpAddress
Slide 43
Slide 43 text
Oh no terminal
43
Slide 44
Slide 44 text
Viewing your logs
You’ll miss:
Slide 45
Slide 45 text
With one server
45
● System level logs are generally in /var/log
● Web application might log to content files or to server
level logs
● Some app runtime errors may appear in apache/php
system logs
● Cron can have logs for app output, stderr, cron
problems (sometimes in /var/mail ??? )
Slide 46
Slide 46 text
With multiple servers
46
Slide 47
Slide 47 text
Introducing the
CloudWatch agent
47
Configurable
service by AWS
Can install on EC2
or on your own
current servers
Streams a named
log or all logs in a
directory
Streams all
directory logs to
one “log stream”
AWS Log Checker
https://github.com/M1ke/aws-log-checker
One log stream per file
Slide 48
Slide 48 text
The best loggers use CLI
48
https://github.com/jorgebastida/awslogs
Stream logs from multiple sources to terminal
Slide 49
Slide 49 text
The good parts We’ve looked at the
challenges. What good
stuff have we learned?
49
Slide 50
Slide 50 text
“
AWS is not a cloud server -
it is a cloud data centre
50
Slide 51
Slide 51 text
“
Terraform lets you model
your infrastructure as code
51
Slide 52
Slide 52 text
“
CloudFront is easy to set up to serve
your static resources right now
52
Slide 53
Slide 53 text
53
Cheers for listening
Any questions? Ask away...
Or find me on: Twitter @M1ke | Slack #phpnw & #og-aws
Liked this? Tell me on https://joind.in/talk/f087d (please be nice)
Want more? Join me for an impromptu tutorial in the delegates room, next session
Presentation template by SlidesCarnival