Upgrade to Pro — share decks privately, control downloads, hide ads and more …

All About Clouds

All About Clouds

In spring of 2013, VM Farms Inc. Founder and CEO Hany Fahim was approached to give a lecture to University of Toronto students about what the cloud is and how it works. Check out the presentation to learn a little about the VM Farms outlook on cloud-building.

VM Farms Inc.

March 13, 2013
Tweet

More Decks by VM Farms Inc.

Other Decks in Technology

Transcript

  1. Background • Worked in the hosting industry for 10 years.

    • Founded VM Farms in 2009. • Based in Toronto. • VM Farms is a Fully Managed Cloud Hosting provider. • All of our customer data is stored in Canada**.
  2. What is a cloud? • Collection of computational resources (hardware

    and software) delivered over a network, usually the Internet. • 3 distinct aspects makes something a “cloud”: • Resources are usually metered (pay-per-use). • Resources can be provisioned on-demand. • Resources can be manipulated via API.
  3. Shared Hosting • Share same hardware and OS. • Low

    cost. • HostGator starts at $3.96/mo • Fewer options and less flexibility. • Less performant. Frequent outages. • Security implications.
  4. Dedicated Hosting • Leased dedicated hardware (servers, network switches, etc...).

    • High Cost. • Typical monthly fee starts at $200/mo. • Greater flexibility and options. • Higher performance. • Long contracts (typically 1 year, but can be upwards of 3 years).
  5. Co-location • Purchase and host your own hardware. • High

    Initial Cost (CAPEX). • A good server can cost upwards of $10k. • Monthly fees for power, network, cooling. • Long contracts (typically 1 year, but can be upwards of 3 years).
  6. VPS • Best compromise between price and performance. • Utilized

    primitive incarnations of virtualization. • Pay monthly fee for VPS ($40-$100/mo). • Security implications.
  7. Something’s missing... Datacentres Power Cooling Racks Physical Security Servers RAM

    CPUs Disk Network Adapters Fans Networks Switches Routers IPs VLANS Firewalls Bandwidth Operating Systems Distributions Packages Configuration Files Services Patching Your Application ?
  8. Hire a Sys Admin • Due to complexity, the only

    option available was to hire a sys admin that knew how to navigate that world. • Sys Admin would prepare the environment, and deploy your code on your behalf since they know the system best. • This leads to Sys Admin vs. Developer wars.
  9. The Cloud... • Abstracts away the complication of running a

    proper server environment. • Controlled through an API so you can “deploy” infrastructure. • Entire environments can be deployed in minutes.
  10. SAN Backend Network Fabric SAN Compute Node Compute Node Compute

    Node Compute Node Frontend Network Fabric
  11. SAN Backend Network Fabric SAN Compute Node Compute Node Compute

    Node Compute Node Frontend Network Fabric
  12. Compute Nodes • Physical servers running virtualization** software. • Provided

    CPU and Memory resources. • Usually don’t store any permanent data (ephemeral). • Connected to multiple networks.
  13. Virtualization • Virtualization allows you to run multiple operating systems

    on a single server. • Virtualization usually refers to the Hypervisor. • Motivation for virtualization - most hardware is under-utilized. • Virtualization increases utilization to get the most out of expensive hardware.
  14. Types of Virtualization • Fully Virtualized • Complete simulation of

    the hardware. • Unmodified Guests. • Partial Virtualization • Partial simulation of hardware components. • Modified Guests. • Para-virtualized • Hardware is not simulated - direct isolated access. • Modified Guests.
  15. Hardware Hypervisor VM Full Virtualized Device Drviers Device Drviers VM

    Device Drviers VM Device Drviers Device Drviers Device Drviers
  16. Examples • Xen (OSS, full and para-virtualized guests) • KVM

    (OSS, fully virtualized guests) • VMware (fully and partially virtualized guests) • Microsoft Hyper-V
 (fully virtualized Windows guests) • Solaris Containers (para-virtualized guests)
  17. Xen • Most popular and prevalent hypervisor amongst hosting providers.

    • Amazon, Linode, Rackspace, VM Farms, IBM, HP all use Xen. • Started as a research project at the University of Cambridge. • First version released in 2003. • Citrix acquired XenSource in 2007.
  18. SAN Backend Network Fabric SAN Compute Node Compute Node Compute

    Node Compute Node Frontend Network Fabric
  19. Network Layers • One network sits in front of Compute

    Nodes to provide Internet access. • One network sits in the back of Compute Nodes to provide access to Storage layer. • Usually Ethernet and Fibre Optic based. • Not uncommon to see 10GigE switches.
  20. SAN Backend Network Fabric SAN Compute Node Compute Node Compute

    Node Compute Node Frontend Network Fabric
  21. Storage Layer • Use of Storage Area Networks (SANs), Network

    Attached Storage (NAS), Direct Attached Storage (DAS). • Usually in the form of specialized commercial hardware with lots and lots of drives. • Typically communicate over the iSCSI protocol. • Very *very* Expensive. • You usually have to buy 2 for redundancy. • Failures can be catastrophic.
  22. SAN SAN Compute Node Compute Node Compute Node Compute Node

    Frontend Network Fabric Backend Network Fabric
  23. SAN SAN Compute Node Compute Node Compute Node Compute Node

    Frontend Network Fabric Backend Network Fabric
  24. SAN SAN Compute Node Compute Node Compute Node Compute Node

    Frontend Network Fabric Backend Network Fabric
  25. SAN SAN Compute Node Compute Node Compute Node Compute Node

    Frontend Network Fabric Backend Network Fabric
  26. SAN Backend Network Fabric SAN Compute Node Compute Node Compute

    Node Compute Node Frontend Network Fabric
  27. SAN Backend Network Fabric SAN Compute Node Compute Node Compute

    Node Compute Node Frontend Network Fabric
  28. SAN Backend Network Fabric SAN Compute Node Compute Node Compute

    Node Compute Node Frontend Network Fabric
  29. True Story. • On April 20, 2011, some parts of

    Amazon Web Services suffered a major outage. • A portion of volumes utilizing the Elastic Block Store (EBS) service became "stuck" and were unable to fulfill read/write requests. • It took at least five days for service to be fully restored.
  30. Compute Node Compute Node Compute Node Compute Node Frontend Network

    Fabric Compute Node Compute Node Compute Node Compute Node
  31. Compute Node Compute Node Compute Node Compute Node Frontend Network

    Fabric Compute Node Compute Node Compute Node Compute Node
  32. Compute Node Compute Node Compute Node Compute Node Frontend Network

    Fabric Compute Node Compute Node Compute Node Compute Node
  33. IaaS • Infrastructure-as-a-Service. • Ideal for specialization. • Examples: •

    Amazon Web Services • Rackspace Cloud • VM Farms • Linode
  34. PaaS • Platform-as-a-Service. • Ideal for rapid web development. •

    Examples: • Heroku • Engine Yard • VM Farms • Google App Engine • Windows Azure
  35. API • Most providers offer an API to allow you

    to use the service programmatically. • Script and automate deployment of resources. • Web Interfaces available.
  36. Which to choose? • If you’re building an application based

    on a common framework (Django, RoR, etc...), pick a PaaS provider. • If you’re building a non-standard application that requires specialized resources (such as GPU rendering), pick an IaaS provider. • Or, just come to VM Farms.
  37. Downsides • Clouds are complex - many moving parts. •

    Outages are common - expect failures. • Expensive. • Data locality.
  38. USA Patriot Act snippet from Wikipedia • One of the

    most controversial aspects of the USA PATRIOT Act is in title V, and relates to National Security Letters (NSLs). An NSL is a form of administrative subpoena used by the FBI, and reportedly by other U.S. government agencies including the CIA and the Department of Defense (DoD). It is a demand letter issued to a particular entity or organization to turn over various records and data pertaining to individuals. They require no probable cause or judicial oversight and also contain a gag order, preventing the recipient of the letter from disclosing that the letter was ever issued. • Signed into law by President George W. Bush on October 26, 2001. 45 days after 911.
  39. BC wants data in BC • On October 21, 2004,

    the BC government amended their Freedom of Information and Protection of Privacy Act (FOIPPA) and put tough restrictions on the storage, access, and disclosing public sector data. • Since then, it is an offense to store such data out of Canada, and even outside of BC under some circumstances.
  40. Build your own • OpenStack is an open source IaaS

    project. • AMD, Intel, Red Hat, Rackspace, Dell, HP, IBM, Cisco, VMware, Yahoo, NASA are on board. • Started by Rackspace and NASA in July 2010.
  41. OpenStack Family • Nova (compute) • Swift (object storage) •

    Glance (image management) • Quantum (network) • Cinder (block storage)
  42. The Future... • A WSJ article cites “cloud computing” market

    to explode to $241B by 2020 (up from $41B in 2011). • Dec 2010 - Heroku was acquired by Salesforce.com for $212B! • https://aws.amazon.com/solutions/case-studies/ lists 422 major websites using AWS, and that’s just Amazon’s list. • https://wiki.openstack.org/wiki/Contributors/Corporate lists 133 *corporate* contributes, with individual contributors in the thousands. • Data locality will continue to be a big concern for all players (providers, end users, government bodies) as more legislation is passed.