Riak

Jeremy Thurgood July 4, 2012

What is Riak? Distributed, highly-available, key/value store. Open source, but
commercial ”Enterprise” version available. Strong community, but commercial support available. basho

Under the hood • Keyspace is partioned (consistent hashing) •
Physical nodes run multiple vnodes • Each vnode claims a partition • Replicas are stored in multiple partitions

CAP, replication, all that good stuff • N replicas are
stored • R replicas are required for a successful read • W replicas are required for a successful write • R, W and DW can specified per-request • Version conflicts are detected with vclocks and may be resolved by the application • But! By default, this is done for you using “latest wins”

Pluggable backends Bitcask • High throughput, low latency, predictable performance
• Keys must ﬁt in memory, no secondary indexes LevelDB • Data compression, secondary indexes • Slower than Bitcask Memory • No persistence, good for testing Multi • Conﬁgure backend per bucket

APIs • HTTP (RESTful), Protocol Buﬀers, Erlang • GET, PUT,
DELETE objects • Do various things with buckets • Map/reduce, secondary indexes • Search

PUT object • Bucket name • Key name (optional) •
Content-Type • Vclock (if the object already exists) • Links, indexes, metadata (if any) • Quorum size: W , DW • Content Content can be arbitrary data, but something like JSON is best for search, etc.

GET object • Bucket name, key name • Quorum size:
R Response • Content-Type • ETag, Last-Modiﬁed • Vclock • Links, indexes, metadata (if any) • Content (of course)

Buckets • Not really real, just namespaces • Properties •
n val, allow mult, postcommit, etc. • Warning! Don’t have too many • Can’t delete (but I’m working on that) • “List keys” supported, but very expensive

Map/reduce • Multiple steps: map, reduce, link • Inputs: bucket/key
pairs, index query, key ﬁlter • Javascript or Erlang functions • See docs for all the details

Secondary indexes • Requires eleveldb backend • Data types: bin,
int • Orthogonal to object value • Query on exact value or range • Query on one index only • Returns keys, not objects • Can feed map/reduce operations

Search • Lucene (with some bits ported to Erlang) •
Index distributed by term (high performance, possible latency hit) • Solr API • Search key/value data • Set precommit hook for indexer • JSON and XML are indexed by ﬁeld name • Can feed map/reduce operations

Client libraries • Lots, for all sorts of exciting languages
• Basho oﬃcially supports C/C++, Erlang, Java, PHP, Python, Ruby • We maintain Riakasaurus (Python/Twisted) • We wrote a hybrid sync/async ORM-sort-of-thing which needs a cool name

Operational stuﬀ • Recommend minimum of 5 nodes, but 3
will do • All nodes are equal (no master) • Nodes can go away with no downtime (as long as there’s still a quorum) • Adding nodes is completely transparent • For best performance, use dedicated boxes with good I/O

Making the cluster dance • riak start • riak stop
• riak-admin join [email protected] • riak-admin status (lots of stuﬀ) • riak-admin diag (needs riaknostic installation)

Web console

So, why are we using it? We need a db
that. . . • has high performance reads and writes • doesn’t require a devops army • can store relations (in a limited way) • works well with non-relational data • allows full-text searching

Gotchas • Bucket properties are global metadata, which impacts performance
• Eventual consistency can be a pain at times

And that’s it Thanks! Also, come to PyCon ZA. http://za.pycon.org

Riak

Riak

jerith

Other Decks in Programming

Featured

Transcript

Jeremy Thurgood July 4, 2012

What is Riak? Distributed, highly-available, key/value store. Open source, but

Under the hood • Keyspace is partioned (consistent hashing) •

CAP, replication, all that good stuﬀ • N replicas are

Pluggable backends Bitcask • High throughput, low latency, predictable performance

APIs • HTTP (RESTful), Protocol Buﬀers, Erlang • GET, PUT,

PUT object • Bucket name • Key name (optional) •

GET object • Bucket name, key name • Quorum size:

Buckets • Not really real, just namespaces • Properties •

Map/reduce • Multiple steps: map, reduce, link • Inputs: bucket/key

Secondary indexes • Requires eleveldb backend • Data types: bin,

Search • Lucene (with some bits ported to Erlang) •

Client libraries • Lots, for all sorts of exciting languages

Operational stuﬀ • Recommend minimum of 5 nodes, but 3

Making the cluster dance • riak start • riak stop

Web console

Web console

So, why are we using it? We need a db

Gotchas • Bucket properties are global metadata, which impacts performance

And that’s it Thanks! Also, come to PyCon ZA. http://za.pycon.org