Slide 1

Slide 1 text

Real-Time Web Analytics On Amazon Web Services @TheDeveloper Google Campus February 2013

Slide 2

Slide 2 text

Real-time web analytics

Slide 3

Slide 3 text

James Gill CEO + Co-founder Geoff Wagstaff CTO + Co-founder James Taylor Developer + Co-founder

Slide 4

Slide 4 text

No content

Slide 5

Slide 5 text

LiveStats

Slide 6

Slide 6 text

No content

Slide 7

Slide 7 text

In 2009, our Server melted.

Slide 8

Slide 8 text

Conventional Hosting • Contractual lock-in. Fixed term subscription • Laborious and slow to provision resources • Restricted • Dedicated, but not bespoke • Boring

Slide 9

Slide 9 text

Business constraints • Experimental technology • Agility • Low budget • Inexperience • High barrier to entry in market

Slide 10

Slide 10 text

GET TO THE CLOUD!!!11!!one!

Slide 11

Slide 11 text

Why AWS? Diverse + growing range of services EC2 S3 CloudFront CloudFormation CloudWatch RDS Route53 IAM CloudSearch ElasticBeanstalk ElastiCache DynamoDB SES SNS SQS Glacier RedShift ElasticTranscoder Data Pipeline StorageGateway VPC SWF

Slide 12

Slide 12 text

Why AWS? Flexible & Scalable 5 x c1.xlarge 2.5m ELB requests 1.4TB S3 storage 4 Route53 record sets 32 EBS volumes 500 DynamoDB Units 3 SQS queues

Slide 13

Slide 13 text

Why AWS? Pay-as-you-use Pricing On-demand Spot (market) pricing Reserved Per unit Per 1,000 Requests Free Tier Per resource Heavy Medium Light

Slide 14

Slide 14 text

Why AWS? Platform agnostic APIs SQL server Windows Server Linux Ubuntu Fedora CentOS Ruby PHP Python Node.JS Perl CLI tools

Slide 15

Slide 15 text

Why AWS? Reputable

Slide 16

Slide 16 text

Challenges • More moving parts • Daunting. Requires new skills and role changes • Architecting for cloud • Deployment - resources, software, data • Automation - scaling, failover, backup

Slide 17

Slide 17 text

Advantages •Full control - freedom, bespoke •Fosters good system design paradigms & avoids complacency •Abundant tools, services, community + documentation •Lowered barriers to entry: infrastructure for anyone •Reduced time to market

Slide 18

Slide 18 text

Gap Year project: Trends

Slide 19

Slide 19 text

Angels (2010) Series A (2011)

Slide 20

Slide 20 text

6 Team Members Average age: 21

Slide 21

Slide 21 text

25,000 sites and counting

Slide 22

Slide 22 text

Traffic 10s Thousands 100s Thousands 100s Millions 99.9% concurrent visitors pageviews/month Requests/second Uptime

Slide 23

Slide 23 text

Our Stack Load Balancer Reverse Proxy Services Data Stores ELB Redis, MongoDB, MySQL (RDS), EBS, S3 Service-oriented architecture: Tiered, decoupled, load-balanced clusters Fault tolerance, failover, backups Auto-scaling. Spot instances Collectors, processing, API, Traffic AWS Services: EC2, S3, RDS, Route53, CloudWatch, SNS, IAM Applications, Website Route53

Slide 24

Slide 24 text

#ProTips Monitoring Ganglia - http://ganglia.sourceforge.net/ Graphite - http://graphite.wikidot.com/ ServerDensity CloudWatch PagerDuty https://github.com/etsy/statsd/

Slide 25

Slide 25 text

Deployment SCM + hosting https://github.com/gosquared Deliver https://github.com/gosquared/deliver Continuous Integration http://jenkins-ci.org/ Installation + con guration - http://www.opscode.com/chef/ #ProTips

Slide 26

Slide 26 text

Scaling EC2 Clusters. Auto-scaling. Spot instances Messaging - RabbitMQ S3, CloudFront, Route53 Producer C C C Redis - http://redis.io https://github.com/gosquared/convoy #ProTips https://github.com/maritz/nohm

Slide 27

Slide 27 text

Destroy Everything

Slide 28

Slide 28 text

Thanks! Geoff Wagstaff @TheDeveloper https://engineering.gosquared.com