Upgrade to Pro — share decks privately, control downloads, hide ads and more …

DeathStar: Easy, Dynamic, Multi-tenant HBase via YARN

DeathStar: Easy, Dynamic, Multi-tenant HBase via YARN

Multi-tenancy is a hard problem, made harder in distributed systems like HBase. In this talk we describe DeathStar, a solution built at Rocket Fuel on top of Apache Slider and YARN to make HBase multi-tenancy easy. We also discuss the understanding we have developed about various HBase access patterns and multi-tenancy scenario. We cover how we use a single YARN cluster to host multiple smaller and highly customized HBase clusters and how we leverage dynamic provisioning and elastic scaling made possible in this model. Finally, we describe some of the the improvements we have made to Slider and HBase to improve availability, monitorability, automation and usability.

Ishan Chhabra

May 07, 2015
Tweet

More Decks by Ishan Chhabra

Other Decks in Technology

Transcript

  1. Non uniform network usage Different DFSs’, leading to lot of

    copying of data Low cluster utilization High lead time for new applications
  2. (grid/deathstar): $ git commit Capacity planning and configuration discussion Create

    simple JSON config As applications mature from hangar to their cluster
  3. Dynamic Cluster: Make API call to start, stop and scale

    cluster Static Cluster: Good to go
  4. Hot swap a new cluster (human error / corruption) Easier

    HBase version upgrades and testing Temporary scale up for backfill “Dynamic” enables interesting use cases
  5. Long running apps a secondary use case Logging, an unsolved

    problem Store logs on local disks, considering ELK
  6. YARN/Slider lack certain scheduling constraints At most x instances per

    node for spread and availability Custom patch in-house