Upgrade to Pro — share decks privately, control downloads, hide ads and more …

OpenStack @ IIIT-H

OpenStack @ IIIT-H

OpenStack usage at IIIT-H presented at Hyderabad OpenStack Meetup.
https://www.meetup.com/Indian-OpenStack-User-Group/events/106526922/

dharmeshkakadia

March 31, 2013
Tweet

More Decks by dharmeshkakadia

Other Decks in Technology

Transcript

  1. What we do • Run an Indian Languages Search Engine

    • Research ◦ Information Extraction ◦ Information Retrieval ◦ Information Access ◦ Virtualization and Cloud • Users of ◦ OpenStack ◦ Hadoop ◦ and lot of other FOSS
  2. Problems • Provisioning ◦ Adhoc ◦ Time consuming ◦ Unmanaged

    • User Management ◦ No resource accounting ◦ Access Control ◦ Usage Restriction • Storage ◦ Data reliability ◦ Duplication
  3. More Problems... • Cluster ◦ Terrible Resource Utilization ◦ New

    deployment => Too much time ◦ Data Redundancy ◦ Non-optimal deployments • Academic ◦ No cloud platform for experimentation ◦ Large Scale sandboxed resource provisioning for students.
  4. OpenStack(KVM) • 7 Compute nodes (8GB, quad-core) • 1 nova-volume(2

    TB, Raid-1) Swift • 3 storage nodes (2TB each) OpenStack(LXC) • 16 Compute nodes (6GB, dual core)
  5. Provisioning • Pre-configured images to quickly get started. • VM

    of any capacity available at any time( 2 a.m. Sunday morning) • Snapshots
  6. User Management • Resource restrictions using Quota • Project based

    collaboration and private resources • Usage monitoring
  7. Storage This wasn't easy. We experimented with • nova-volume •

    Swift(diablo) • GlusterFS • Swift(Folsom)(current)
  8. Storage • Hadoop compatible distributed storage • Glance image store

    • Desktop backup utility using CloudFuse • Data reliability • No more Data Fragmentation
  9. OpenStack in Academia • Research ◦ Inter cloud migration ◦

    Inter cloud scheduling ◦ Performance Evaluation • Resource provisioning for course assignments and projects. ◦ 3 courses ◦ 350+ students ◦ 20+ projects
  10. HadoopStack • Big Data processing on Demand • Entire ecosystem

    for Big Data - Hadoop Family, Spark, Mahout, R • Multi-Cloud - OpenStack and AWS.