Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Let's take the monolith to the cloud ๐Ÿš€

Let's take the monolith to the cloudย ๐Ÿš€

How can you take an existing monolith to the cloud with very minimal effort? In this talk we will explore an architecture that can help you to achieve that while focusing on scalability and resilience.

Luciano Mammino

March 01, 2022
Tweet

More Decks by Luciano Mammino

Other Decks in Technology

Transcript

  1. Let's take the monolith to the cloud ๐Ÿš€ Luciano Mammino

    ( ) @loige loige.link/mono-lit 2022-03-01 1
  2. I'm Luciano ๐Ÿ‘‹ Senior Architect @ fourTheorem (Dublin ) nodejsdp.link

    ๐Ÿ“” Co-Author of Node.js Design Patterns ๐Ÿ‘‰ Let's connect! (blog) (twitter) (twitch) (github) loige.co @loige loige lmammino 3
  3. We are business focused technologists that deliver. | | Accelerated

    Serverless AI as a Service Platform Modernisation We are hiring do you want to ? work with us loige 4
  4. I co-host a podcast about AWS with ! @eoins @loige

    awsbites.com Pleaze, subscribe ๐Ÿ˜‡ 5
  5. Business Summary @loige SaaS CMS for legal practices 1 founder

    + 1 developer ๐Ÿ’ธ Bootstrapped business ๐Ÿ™Œ Good MVP, getting attention in the market ๐Ÿ’ช Started a TRIAL with a big customer 10
  6. Current problems @loige ๐Ÿ“ˆ The company is growing โš– but

    the technology does not scale! ๐Ÿ“ฆ 1 monolithic server ๐Ÿ”ฅ Frequent failures = ๐Ÿคฌ unhappy customers ๐Ÿ˜ฅ The business is at risk! 11
  7. Desired State @loige โš– More reliable & scalable infrastructure ๐Ÿ‘Œ

    Minimal amount of change required* * the team is not skilled with the cloud & containers, we need to keep cognitive load low 12
  8. @loige ๐Ÿค” "I heard that the cloud is great but

    we don't have the time and the skills to re-architect everything as micro-services!" 13
  9. Example use cases @loige A user logs in the application

    and they should be able to see all their previously uploaded legal documents A user can upload new documents and organize them by providing speci๏ฌc tags (client id, case number, etc.) A user might search for documents containing speci๏ฌc keywords or tags 16
  10. Can we take the monolith to the cloud and make

    it resilient & scalable? @loige 18
  11. Region @loige A physical location around the world (e.g. North

    Virginia, Ireland or Sydney) where AWS hosts a group of data centers. Regions help to provision infrastructure that is closer to the customers, so that our applications can have low latency and feel responsive. 28
  12. Availability Zone (AZ) @loige Discrete data center with redundant power,

    networking, and connectivity in an AWS Region. Data centers in di๏ฌ€erent availability zones are disjointed from one another, so if thereโ€™s a serious outage, thatโ€™s rarely a๏ฌ€ecting more than one availability zone at the same time. 29
  13. Availability Zone (AZ) @loige Itโ€™s good practice to spread redundant

    infrastructure across di๏ฌ€erent availability zones in a given region to guarantee high availability. 30
  14. VPC @loige A virtual (private) network provisioned in a given

    region for a given AWS account. It is logically isolated from other virtual networks in AWS. Every VPC has a range of private IP addresses organised in one or more subnets. 31
  15. Subnet @loige A range of IPs in a given VPC

    and in a given availability zone that can be used to spin up and connect resources within the network. Subnets can be public or private. A public subnet can be used to run instances that can have a public IP assigned to them and can be reachable from outside the VPC itself. 32
  16. Subnet @loige Itโ€™s good practice to keep front-facing servers (or

    load balancers) in public subnets and keep everything else (backend services, databases, etc.) in private subnets. Tra๏ฌƒc between subnets can be enabled through routing tables to allow for instance a load balancer in a public subnet to forward tra๏ฌƒc to backend instances in a private subnet. 33
  17. Quick Recap @loige Region: physical location with data centers Availability

    Zone: data center in a region VPC: a virtual private network in a region Subnet: range of IPs in a VPC in a given AZ 34
  18. โœ TODO List @loige โ˜ Create an AWS account โ˜

    Select a region โ˜ Create and con๏ฌgure a VPC 36
  19. โ˜‘ Create an AWS account โ˜‘ Select a region โ˜‘

    Create and con๏ฌgure a VPC โœ TODO List @loige 38
  20. โœ TODO List @loige โ˜ Load Balancer โ˜ EC2 โ˜

    S3 โ˜ RDS โ˜ ElastiCache โ˜ Route 53 39
  21. Application Load Balancer (ALB) @loige The entry point to all

    the application tra๏ฌƒc. Layer 7 Load Balancer (HTTP, HTTPS, WebSocket, gRPC). Highly available: replicated in all our public subnets. 40
  22. Application Load Balancer (ALB) @loige Scalable: can handle millions of

    request per second. Managed service: we don't need to con๏ฌgure the OS or install software patches. Can be integrated with ACM (AWS Certi๏ฌcate Manager) to support HTTPS. 41
  23. Application Load Balancer (ALB) @loige Target group ๐Ÿ”ฅ /health โœ…

    โŒ /health /health โœ… Unhealty targets won't get any tra๏ฌƒc 43
  24. Application Load Balancer (ALB) @loige Targets can be added dynamically.

    We can scale targets automatically using autoscaling groups. E.g. Add or remove instances based on num requests in-๏ฌ‚ight or on avg CPU of the current instances. 44
  25. How does it scale? @loige Being a managed service, scalability

    is mostly handled out of the box by AWS. 45
  26. Resiliency @loige A load balancer can distribute tra๏ฌƒc to multiple

    AZs, so if one of them becomes unavailable it will keep distributing tra๏ฌƒc to the remaining ones. 46
  27. โœ TODO List @loige โ˜‘ Load Balancer โ˜ EC2 โ˜

    S3 โ˜ RDS โ˜ ElastiCache โ˜ Route 53 47
  28. EC2 - Virtual Machine @loige Virtual machine running all the

    necessary software for the service (Nginx, Node.js, app code, etc.) They need to use Security Groups (allow tra๏ฌƒc) and IAM Roles (allow them to access other AWS resources like S3). 48
  29. EC2 - Virtual Machine @loige We will need to provision

    multiple machines dynamically. Challenges: Consistency ๐Ÿฎ Cattle vs ๐Ÿ™€ Pet mindset Stateless applications 49
  30. Consistency @loige All our virtual machines have to be the

    same: we need to build an AMI (Amazon Machine Image). An AMI contains OS, libraries, software and source code. You can use an AMI to start a new instance. 50
  31. Consistency @loige While we can build an AMI manually, it's

    better to use tools to automate the work: Hashicorp Packer EC2 Image Builder 51
  32. ๐Ÿฎ Cattle vs ๐Ÿ™€ Pet mindset @loige Once an instance

    has been launched we shouldn't change it anymore (e.g update the OS, install new softare, update the code, etc.) If we need to change something, we build a new image and deploy new instances. Instances are disposable! 52
  33. Stateless @loige We are load balancing tra๏ฌƒc so a user

    might be served by di๏ฌ€erent instances during their session. A single instance should not store any state (e.g. user sessions, uploaded ๏ฌles, etc.) State should be stored outside instances (ElastiCache, S3, RDS, etc). 53
  34. Stateless @loige Making an application stateless might require a good

    amount of code change. A shortcut to this might be to enable in the ALB, but it's not recommended for scalability and resiliency. sticky sessions 54
  35. How does it scale? @loige Every instance will be able

    to handle a certain number of requests per second. We can scale by adding more instances when the tra๏ฌƒc grows. 55
  36. Resiliency @loige We should have at least 1 instance per

    availability zone. If there is an AWS outage, the instances on the healthy availability zone will keep handling requests. We can use an autoscaling group to make sure that unhealthy instances are replaced. 56
  37. โœ TODO List @loige โ˜‘ Load Balancer โ˜‘ EC2 โ˜

    S3 โ˜ RDS โ˜ ElastiCache โ˜ Route 53 57
  38. Simple Storage Service (S3) @loige One of the very ๏ฌrst

    AWS services and (probably) the most famous one. Object storage service: Allows you to store any amount of data durably. You need to use the SDK to read and write data. 58
  39. Simple Storage Service (S3) @loige Data can be organised in

    logical containers called Buckets. Key/value model: Inside a bucket you can store data by providing a key and the content. 59
  40. Simple Storage Service (S3) @loige const AWS = require('aws-sdk') const

    s3 = new AWS.S3() const params = { Bucket: 'my-bucket', Key: 'my-first-s3-file.txt', Body: Buffer.from('Hello, AWS') } s3.upload(params, (err) => { if (err) { console.error(err) } else { console.log('Upload successful') } }) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 const AWS = require('aws-sdk') 1 2 const s3 = new AWS.S3() 3 4 const params = { 5 Bucket: 'my-bucket', 6 Key: 'my-first-s3-file.txt', 7 Body: Buffer.from('Hello, AWS') 8 } 9 10 s3.upload(params, (err) => { 11 if (err) { 12 console.error(err) 13 } else { 14 console.log('Upload successful') 15 } 16 }) 17 const s3 = new AWS.S3() const AWS = require('aws-sdk') 1 2 3 4 const params = { 5 Bucket: 'my-bucket', 6 Key: 'my-first-s3-file.txt', 7 Body: Buffer.from('Hello, AWS') 8 } 9 10 s3.upload(params, (err) => { 11 if (err) { 12 console.error(err) 13 } else { 14 console.log('Upload successful') 15 } 16 }) 17 const params = { Bucket: 'my-bucket', Key: 'my-first-s3-file.txt', Body: Buffer.from('Hello, AWS') } const AWS = require('aws-sdk') 1 2 const s3 = new AWS.S3() 3 4 5 6 7 8 9 10 s3.upload(params, (err) => { 11 if (err) { 12 console.error(err) 13 } else { 14 console.log('Upload successful') 15 } 16 }) 17 s3.upload(params, (err) => { if (err) { console.error(err) } else { console.log('Upload successful') } }) const AWS = require('aws-sdk') 1 2 const s3 = new AWS.S3() 3 4 const params = { 5 Bucket: 'my-bucket', 6 Key: 'my-first-s3-file.txt', 7 Body: Buffer.from('Hello, AWS') 8 } 9 10 11 12 13 14 15 16 17 const AWS = require('aws-sdk') const s3 = new AWS.S3() const params = { Bucket: 'my-bucket', Key: 'my-first-s3-file.txt', Body: Buffer.from('Hello, AWS') } s3.upload(params, (err) => { if (err) { console.error(err) } else { console.log('Upload successful') } }) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 60
  41. Simple Storage Service (S3) @loige Too much code to change?

    A ๏ฌrst migration could be done by using a something like to create a "virtual ๏ฌlesystem" that allows you to read/write to S3 seamlessly. s3fs-fuse 61
  42. How does it scale? @loige S3 is a managed service

    which automatically scales to thousands of read/write operations per second. 62
  43. Resiliency @loige S3 is provisioned in multiple AZs by default

    and it makes multiple copies of your data. All of this happens transparently, no special con๏ฌguration required. 63
  44. โœ TODO List @loige โ˜‘ Load Balancer โ˜‘ EC2 โ˜‘

    S3 โ˜ RDS โ˜ ElastiCache โ˜ Route 53 64
  45. Relational Database Service (RDS) @loige Managed relational database service for

    MySql, PostgreSQL, MariaDB, Oracle & SQL Server. Being a managed service, AWS takes care of most common concerns like backups and updates (con๏ฌgurable). 65
  46. How does it scale? @loige RDS PostgreSQL supports Read Replicas:

    you can provision additional instances to which you can distribute heavy read-only queries. 66
  47. Resiliency @loige RDS PostgreSQL can be con๏ฌgured to work in

    Multi-AZ mode: this means that there will be one or two standby copies of the database in di๏ฌ€erent AZs. If the primary DB instance or the primary AZ have an outage, one of the standby copies are promoted to become "the primary" instance. 67
  48. Resiliency @loige Failover is fast but not instantaneous (60-120 seconds),

    so we need to make sure to plan for possible connectivity failures in your app and show clear error messages to the users. 68
  49. โœ TODO List @loige โ˜‘ Load Balancer โ˜‘ EC2 โ˜‘

    S3 โ˜‘ RDS โ˜ ElastiCache โ˜ Route 53 69
  50. ElastiCache @loige Managed in-memory caching service supporting Redis and Memcached.

    Meant to be used for use cases that don't require durability like data cache, session stores, gaming leaderboards, streaming, and analytics. AWS takes care of maintenance. 70
  51. How does it scale? @loige A single instance of Redis

    (with enough memory) can scale to signi๏ฌcant amounts of tra๏ฌƒc. If you need more, you can run ElastiCache Redis in Cluster Mode and shard your data across multiple Redis instances. 71
  52. Resiliency @loige ElastiCache Redis can operate in Multi-AZ mode. Similarly

    to RDS, in case of failures, there might be some downtime while the new master is promoted. We need to make sure the app accounts for Redis connection failures. 72
  53. โœ TODO List @loige โ˜‘ Load Balancer โ˜‘ EC2 โ˜‘

    S3 โ˜‘ RDS โ˜‘ ElastiCache โ˜ Route 53 73
  54. Route53 @loige Highly available and scalable cloud DNS service. Can

    be used to direct tra๏ฌƒc on a given domain to our Application Load Balancer. 74
  55. โœ TODO List @loige โ˜‘ Load Balancer โ˜‘ EC2 โ˜‘

    S3 โ˜‘ RDS โ˜‘ ElastiCache โ˜‘ Route 53 75
  56. Infrastructure as Code (IaaC) @loige We could provision everything "manually"

    from the web console, but... It will be hard to create consistent environments for development and QA It will be hard to change things incrementally How would we test and review changes before applying them in production? 76
  57. Infrastructure as Code (IaaC) @loige It's better to de๏ฌne all

    the infrastructure using code. There are several tools that can help us with that: CloudFormation Hashicorp Terraform Cloud Development Kit (CDK) Pulumi 77
  58. { "AWSTemplateFormatVersion" : "2010-09-09", "Description" : "AWS CloudFormation Sample Template

    EC2InstanceWithSecurityGroupSample: Create an Amazon EC2 instance running the A "Parameters" : { "KeyName": { "Description" : "Name of an existing EC2 KeyPair to enable SSH access to the instance", "Type": "AWS::EC2::KeyPair::KeyName", "ConstraintDescription" : "must be the name of an existing EC2 KeyPair." }, "InstanceType" : { "Description" : "WebServer EC2 instance type", "Type" : "String", "Default" : "t2.small", "AllowedValues" : [ "t1.micro", "t2.nano", "t2.micro", "t2.small", "t2.medium", "t2.large", "m1.small", "m1.medium", "m1.large" , "ConstraintDescription" : "must be a valid EC2 instance type." }, "SSHLocation" : { "Description" : "The IP address range that can be used to SSH to the EC2 instances", "Type": "String", "MinLength": "9", "MaxLength": "18", "Default": "0.0.0.0/0", "AllowedPattern": "(\\d{1,3})\\.(\\d{1,3})\\.(\\d{1,3})\\.(\\d{1,3})/(\\d{1,2})", "ConstraintDescription": "must be a valid IP CIDR range of the form x.x.x.x/x." } }, "Mappings" : { "AWSInstanceType2Arch" : { "t1.micro" : { "Arch" : "HVM64" }, "t2.nano" : { "Arch" : "HVM64" }, "t2.micro" : { "Arch" : "HVM64" }, @loige Example of CloudFormation template 78
  59. import * as cdk from '@aws-cdk/core' import * as ec2

    from '@aws-cdk/aws-ec2' export class CdkUbuntuEc2Stack extends cdk.Stack { constructor (scope: cdk.Construct, id: string, props?: cdk.StackProps) { super(scope, id, { env: { account: process.env.CDK_DEPLOY_ACCOUNT || process.env.CDK_DEFAULT_ACCOUNT, region: process.env.CDK_DEPLOY_REGION || process.env.CDK_DEFAULT_REGION }, ...props }) const defaultVpc = ec2.Vpc.fromLookup(this, 'VPC', { isDefault: true }) const userData = ec2.UserData.forLinux() userData.addCommands( 'apt-get update -y', 'apt-get install -y git awscli ec2-instance-connect', 'until git clone https://github.com/aws-quickstart/quickstart-linux-utilities.git; do echo "Retrying"; done', 'cd /quickstart-linux-utilities', 'source quickstart-cfn-tools.source', 'qs_update-os || qs_err', 'qs_bootstrap_pip || qs_err', 'qs_aws-cfn-bootstrap || qs_err', 'mkdir -p /opt/aws/bin', 'ln -s /usr/local/bin/cfn-* /opt/aws/bin/' ) const machineImage = ec2.MachineImage.fromSSMParameter( '/aws/service/canonical/ubuntu/server/focal/stable/current/amd64/hvm/ebs-gp2/ami-id', ec2.OperatingSystemType.LINUX, userData ) const myVmSecurityGroup = new ec2.SecurityGroup(this, 'myVmSecurityGroup', { @loige Example of a CDK stack (TypeScript) 79
  60. Switch over @loige ๐Ÿ™€ How do we migrate the data?

    ๐Ÿ˜ฅ How do we switch the tra๏ฌƒc to the new infrastructure? 82
  61. Streamlined data migration @loige Update the "old" code-base to save

    every new ๏ฌle also to S3. Copy all the existing ๏ฌle to the S3 bucket (S3 sync). 83
  62. Streamlined data migration @loige AWS Database Migration service allows you

    to replicate all the data from the old database to the new one. It will also keep the 2 Databases in sync during the switch over! 84
  63. Switching traffic @loige Request a new certi๏ฌcate using AWS Certi๏ฌcate

    Manager (ACM). Can be validated by email or DNS. Point your DNS to the new Load Balancer in AWS! 85
  64. New opportunities ๐Ÿ˜Š @loige We can scale dynamically! As the

    team grows and the system gets more complicated we can start to think about micro- services. We can start to play with other AWS services (E.g. SQS + Lambda for background task processing). 88
  65. ๐Ÿ’ธ Cost @loige calculator.aws/#/estimate?id=1e0779adb67305166c01a583f5d7f61f7c92b029 Load Balancer $ 24.24 EC2 (6

    instances) $ 85.87 ElastiCache Redis (3 instances) $ 78.84 RDS PostgreSQL (3 instances) $ 155.48 S3 (1TB) $ 32.55 TOT $ 376.98 90
  66. ๐Ÿ’ธ Cost @loige Cost estimates are always a bit of

    a "gamble"... I selected some arbitrary instance sizes (EC2, RDS, ElastiCache). I am not accounting for auto-scaling. I am not accounting for network tra๏ฌƒc. Better to look at cost in production and try to optimise when needed. Rule of thumb: try to balance cost with your revenue. Rule of thumb (2): consider the ! total cost of ownership 91
  67. โ˜ Create an AWS Account โ˜ Select a tool for

    IaaC โ˜ Create and con๏ฌgure a VPC in a region (3 AZs, Public / Private subnets) โ˜ Create an S3 bucket โ˜ Update the old codebase to save every new ๏ฌle to S3 โ˜ Copy all the existing ๏ฌles to S3 โ˜ Spin up the database in RDS (Multi-AZ) โ˜ Migrate the data using Database Migration Service โ˜ Provision the ElastiCache Redis Cluster (Multi-AZ) โœ Bonus: a TODO list for the migration @loige โ˜ Create an AMI for the application โ˜ Create a security groups and an IAM policy for EC2 โ˜ Make the application stateless โ˜ Create an health check endpoint โ˜ Create an autoscaling group to spin up the instances โ˜ Create a certi๏ฌcate in ACM โ˜ Provision an Application Load Balancer (public subnets) โ˜ Con๏ฌgure Https, Targets and Health Checks โ˜ Con๏ฌgure Route53 โ˜ Tra๏ฌƒc switch-over through DNS ๐Ÿคž ๐Ÿ“ Great guide to cloud migrations: 6 strategies for migrating applications to the cloud 92
  68. The cloud is a journey not a destination The cloud

    is a journey not a destination @loige 93
  69. โ˜ nodejsdp.link Thank you! Special thanks to: , โค Photos

    by @micktwomey @eoins Unsplash @loige loige.link/mono-lit 94