Slide 1

Slide 1 text

What you’ll miss in AWS & how to find it

Slide 2

Slide 2 text

Hey there I’m Mike Lehan Software engineer, CTO of StuRents.com, skydiver, northerner Follow me on Twitter @M1ke Love/hate to https://joind.in/talk/f087d 2

Slide 3

Slide 3 text

Cloud changes the way our products work But how can we move applications to the cloud without changing the way we develop, deploy and manage those products? 3

Slide 4

Slide 4 text

Why move to the cloud? Scalable No need to migrate to new servers when our business grows or requirements change Available The services we use must be there when we need them, resistant to faults in underlying utilities Durable If something goes wrong with our system, being able to recover data or revert changes 4

Slide 5

Slide 5 text

Where are we coming from? ● Single server ● LAMP stack ● Separate database ● Hypervisors 5 Still love LAMP? Never fear!

Slide 6

Slide 6 text

Acronyms ahead! Don’t worry we’ve got this These slides will explain all And I put an icon of a plant up there if you need some fresh air 6

Slide 7

Slide 7 text

“ Is this really the best way to do all this? 7

Slide 8

Slide 8 text

Your server being on You’ll miss:

Slide 9

Slide 9 text

EC2 Elastic Compute Cloud - servers, called “instances” AZ Availability Zone - one or more data centres 9 RDS Relational Database Service - managed SQL

Slide 10

Slide 10 text

AWS Regions 10

Slide 11

Slide 11 text

Avoid the loss of one Availability Zone EC2 Launch instances in multiple availability zones, there are ways we can manage this RDS Tick the “Multi-AZ” option and AWS manages replication across zones (adds a charge) Other services Some services run across AZs, others are tied to one - always check when estimating costs 11

Slide 12

Slide 12 text

Introducing Auto-scaling 12 first Image Configuration plus disk snapshot Launch Configuration Image plus network, server size settings second Availability Zones Launch in one or more, balance between third Profit AWS starts instances for you ?

Slide 13

Slide 13 text

“ With multiple servers, how does web traffic reach our application? 13

Slide 14

Slide 14 text

Mapping an IP to a domain You’ll miss:

Slide 15

Slide 15 text

15 domain.com A 123.45.67.89

Slide 16

Slide 16 text

Introducing Elastic Load Balancing 16 first Choose a target group Auto-scaling group or set of instances Add rules Simple redirects to complex forwarding second Health checks Custom requests to instances, can check returned content third Availability Zones Across all AZs by default btw

Slide 17

Slide 17 text

Where did my session go? ● Sessions stored on disk ● Server changes, lose your session ● Sticky sessions to the rescue! ● Saves a cookie 17

Slide 18

Slide 18 text

“ Now I have to deploy code to how many servers? 18

Slide 19

Slide 19 text

Using the file system You’ll miss:

Slide 20

Slide 20 text

Each server has its own disk This will cause problems for two areas of your application: ● Deploying application code ● Storing content generated by or uploaded through the application 20

Slide 21

Slide 21 text

EBS Elastic Block Store - hard disks used by instances EFS Elastic File System - networked hard disk 21 S3 Simple Storage Service - cheap API driven file store 0.10 $/GB 0.30 0.023

Slide 22

Slide 22 text

Multiple servers can access a single EFS volume ● Stored across all AZs ● No need to set a volume size ● Write consistency ● Scales with number of stored files 22 The "E" stands for exabyte: 1,000,000,000,000,000,000 bytes

Slide 23

Slide 23 text

“ 23 Files upload slowly! PHP files execute even slower

Slide 24

Slide 24 text

Speed You’ll miss:

Slide 25

Slide 25 text

Write consistency Every file write has to copy across multiple locations. For single files this is fine. For lots of files this causes big slowdown Some of the EFS characteristics cause other problems Network read Delay isn’t noticeable for 1 file PHP applications tend to use lots File read delay slows down application File handling in app also affected 25

Slide 26

Slide 26 text

How we tried to solve these problems Atomic deployments Key problem with write time is for deployments and app code consistency Custom rsync + bash solution Opcache Reduce dependency on PHP file reads by caching generated opcodes Invalidate this cache on deployment S3 deployments Remove EFS effect on application code by deploying code direct to servers Must ensure consistency 26

Slide 27

Slide 27 text

Atomic deployments 27 first Deploy Copy files to EBS on 1 instance for a quick copy Sync On instance sync to EFS in time-stamped directory second Switch Move a symlink to ensure code consistency third Problem Deployment is still super slow, need to scp for quick changes -

Slide 28

Slide 28 text

PHP Opcache for speed Once app endpoint has been run once subsequent executions should be faster Must set This means we need a custom app running to invalidate cache when code changes 28 validate_timestamps=0 first Run as cron Every minute, on every instance Check timestamp From a file in the latest app deployment second Reset opcache Curl local PHP route and run third Problems Slow “first” loads of any page. Opcache now critical to normal app performance - opcache_reset()

Slide 29

Slide 29 text

Well what then? ● Blue-green deployment ● CodeDeploy or other tools 29 But... ● Problems of consistency, especially front end cache ● Not designed for different types of servers ● Harder to run post-deployment code

Slide 30

Slide 30 text

Deploy via S3 30 Simple agent can monitor for deployments and synchronise across servers Sorry EFS, you won’t cut it for code but you’re still good for shared content! first Upload Developer uploads code & timestamp file to S3 Check Instance checks S3 timestamp on a schedule second Download Create a lock and sync files to current instance third Switch Once all locks across all servers released, update a symlink fourth

Slide 31

Slide 31 text

Modifying servers “on-the-fly” You’ll miss:

Slide 32

Slide 32 text

We all do it ● Sign in to a server by SSH ● Make some config changes 32 Not on AWS ● Servers can vanish and be recreated by auto scaling

Slide 33

Slide 33 text

ASG Auto Scaling Group - we’ve already mentioned these LC Launch Configuration - types of instances for ASG 33 AMI Amazon Machine Image - config + disk snapshot

Slide 34

Slide 34 text

Introducing the master instance 34 One instance to edit AWS doesn’t charge for inactive instances (except storage) Turn it on & change You can edit configs and even test them - web traffic isn’t routed here unless you map a temp domain to the IP Turn it off & image Create a new AMI from the instance Build a LC with this AMI Re-point your ASG

Slide 35

Slide 35 text

“ Surely we should automate this? 35

Slide 36

Slide 36 text

Introducing Lambda 36 Generate an AMI when an instance stops Triggers on event Targets 1 instance AMI generates in <5 minutes Put new AMI into service in an Auto Scaling group Triggers manually Find latest AMI Copy latest LC, insert new AMI/snapshot first Increase size to create new instances Old instances won’t automatically update Need new instances 1st Increase size of ASG second Schedule removal of old instances Add ASG schedule to reduce size Termination policy: delete old LC first third Lets you run single scripts on a schedule or triggers. Costs basically nothing. Runs Python but not PHP

Slide 37

Slide 37 text

Cron You’ll miss:

Slide 38

Slide 38 text

Cron is part of many apps ● Simple tasks like once a day backup ● “Semi-live” tasks like webhook/stats processing 38 ASG instances make cron awkward ● Cron tasks running twice is a bad thing ● Instances could stop or start any time ● Multiple instances make jobs hard to track/debug

Slide 39

Slide 39 text

Use a separate control instance Cron Run your cron jobs here. Has the added advantage that cron jobs are less likely to impact web serving Inspect This server can be used for developers to inspect content in EFS, access SQL via CLI (avoid public open SQL access) Availability Still use an ASG! This means if we lose an AZ the control server restarts elsewhere. Fix ASG to exactly 1 instance 39

Slide 40

Slide 40 text

“ How can I SSH into instances if they keep moving? 40

Slide 41

Slide 41 text

EIP Elastic IP - fixed IP that can move between instances Route53 (not an acronym) Domain Name Service provided by AWS 41

Slide 42

Slide 42 text

Introducing awscli 42 Sorry, they don’t have an icon for this Python package which takes API credentials Everything in AWS can be done via CLI Learning CLI also teaches you SDK and IAM Returns JSON data which can be filtered by AWS or parsed by “jq” Put together we can run: $ aws ec2 describe-instances --instance-id X \ | jq --raw-output .Reservations[].Instances[].PublicIpAddress

Slide 43

Slide 43 text

Oh no terminal 43

Slide 44

Slide 44 text

Viewing your logs You’ll miss:

Slide 45

Slide 45 text

With one server 45 ● System level logs are generally in /var/log ● Web application might log to content files or to server level logs ● Some app runtime errors may appear in apache/php system logs ● Cron can have logs for app output, stderr, cron problems (sometimes in /var/mail ??? )

Slide 46

Slide 46 text

With multiple servers 46

Slide 47

Slide 47 text

Introducing the CloudWatch agent 47 Configurable service by AWS Can install on EC2 or on your own current servers Streams a named log or all logs in a directory Streams all directory logs to one “log stream” AWS Log Checker https://github.com/M1ke/aws-log-checker One log stream per file

Slide 48

Slide 48 text

The best loggers use CLI 48 https://github.com/jorgebastida/awslogs Stream logs from multiple sources to terminal

Slide 49

Slide 49 text

The good parts We’ve looked at the challenges. What good stuff have we learned? 49

Slide 50

Slide 50 text

“ AWS is not a cloud server - it is a cloud data centre 50

Slide 51

Slide 51 text

“ Terraform lets you model your infrastructure as code 51

Slide 52

Slide 52 text

“ CloudFront is easy to set up to serve your static resources right now 52

Slide 53

Slide 53 text

53 Cheers for listening Any questions? Ask away... Or find me on: Twitter @M1ke | Slack #phpnw & #og-aws Liked this? Tell me on https://joind.in/talk/f087d (please be nice) Want more? Join me for an impromptu tutorial in the delegates room, next session Presentation template by SlidesCarnival