Slide 1

Slide 1 text

Ruby Driver Explained Overview of the Ruby Driver for Apache Cassandra Bulat Shakirzyanov @avalanche123

Slide 2

Slide 2 text

Introduction Cassandra Overview

Slide 3

Slide 3 text

© 2014 DataStax, All Rights Reserved. Datacenter Datacenter Cassandra Topology 3 Node Node Node Node Client Client Node Node Node Node Client Client

Slide 4

Slide 4 text

© 2014 DataStax, All Rights Reserved. Datacenter Datacenter Request Coordinator 4 Node Node Node Node Client Client Node Node Coordinator Node Client Client Coordinator node: Forwards requests to corresponding replicas

Slide 5

Slide 5 text

© 2014 DataStax, All Rights Reserved. Datacenter Row Replica 5 Replica Node Node Replica Client Client Datacenter Node Node Replica Client Client Coordinator Replica node: Stores a slice of total rows of each keyspace

Slide 6

Slide 6 text

Quick Start Installation and Usage

Slide 7

Slide 7 text

© 2014 DataStax, All Rights Reserved. Installation 7 gem 'cassandra-driver', '~> 1.0.0' gem install cassandra-driver

Slide 8

Slide 8 text

© 2014 DataStax, All Rights Reserved. Usage 8 require 'cassandra' cluster = Cassandra.cluster cluster.each_host do |h| puts "Host #{h.ip}: datacenter=#{h.datacenter} rack=#{h.rack}" end keyspace = 'system' session = cluster.connect(keyspace) future = session.execute_async('SELECT * FROM schema_columnfamilies') future.on_success do |rows| rows.each do |row| puts “Table: #{row[‘keyspace_name']}.#{row['columnfamily_name']}" end end future.join

Slide 9

Slide 9 text

Asynchronous Execution IO Reactor, Request Pipelining and Future Composition

Slide 10

Slide 10 text

© 2014 DataStax, All Rights Reserved. Asynchronous Core 10 Application Thread Business Logic Driver Background Thread IO Reactor

Slide 11

Slide 11 text

© 2014 DataStax, All Rights Reserved. Request Pipelining 11 Client Without Request Pipelining Server Client Server With Request Pipelining 1 2 2 3 1 3 1 2 3 1 2 3

Slide 12

Slide 12 text

© 2014 DataStax, All Rights Reserved. Future Composition 12 select_user = session.prepare("SELECT * FROM users WHERE id = ?") select_page = session.prepare("SELECT * FROM pages WHERE slug = ?") user_ids = [1, 2, 3, 4] futures = user_ids.map do |id| future = session.execute_async(select_user, id) future.then do |users| user = users.first future = session.execute_async(select_page, user[‘username']) future.then do |pages| page = pages.first User.new(user, Page.new(page)) end end end Cassandra::Future.all(futures).get

Slide 13

Slide 13 text

© 2014 DataStax, All Rights Reserved. Future Composition 13 select_user = session.prepare("SELECT * FROM users WHERE id = ?") select_page = session.prepare("SELECT * FROM pages WHERE slug = ?") user_ids = [1, 2, 3, 4] futures = user_ids.map do |id| future = session.execute_async(select_user, id) future.then do |users| user = users.first future = session.execute_async(select_page, user[‘username']) future.then do |pages| page = pages.first User.new(user, Page.new(page)) end end end Cassandra::Future.all(futures).get

Slide 14

Slide 14 text

© 2014 DataStax, All Rights Reserved. Future Composition 14 select_user = session.prepare("SELECT * FROM users WHERE id = ?") select_page = session.prepare("SELECT * FROM pages WHERE slug = ?") user_ids = [1, 2, 3, 4] futures = user_ids.map do |id| future = session.execute_async(select_user, id) future.then do |users| user = users.first future = session.execute_async(select_page, user[‘username']) future.then do |pages| page = pages.first User.new(user, Page.new(page)) end end end Cassandra::Future.all(futures).get

Slide 15

Slide 15 text

© 2014 DataStax, All Rights Reserved. Future Composition 15 select_user = session.prepare("SELECT * FROM users WHERE id = ?") select_page = session.prepare("SELECT * FROM pages WHERE slug = ?") user_ids = [1, 2, 3, 4] futures = user_ids.map do |id| future = session.execute_async(select_user, id) future.then do |users| user = users.first future = session.execute_async(select_page, user[‘username']) future.then do |pages| page = pages.first User.new(user, Page.new(page)) end end end Cassandra::Future.all(futures).get

Slide 16

Slide 16 text

© 2014 DataStax, All Rights Reserved. Future Composition 16 select_user = session.prepare("SELECT * FROM users WHERE id = ?") select_page = session.prepare("SELECT * FROM pages WHERE slug = ?") user_ids = [1, 2, 3, 4] futures = user_ids.map do |id| future = session.execute_async(select_user, id) future.then do |users| user = users.first future = session.execute_async(select_page, user[‘username']) future.then do |pages| page = pages.first User.new(user, Page.new(page)) end end end Cassandra::Future.all(futures).get

Slide 17

Slide 17 text

© 2014 DataStax, All Rights Reserved. Future Composition 17 select_user = session.prepare("SELECT * FROM users WHERE id = ?") select_page = session.prepare("SELECT * FROM pages WHERE slug = ?") user_ids = [1, 2, 3, 4] futures = user_ids.map do |id| future = session.execute_async(select_user, id) future.then do |users| user = users.first future = session.execute_async(select_page, user[‘username']) future.then do |pages| page = pages.first User.new(user, Page.new(page)) end end end Cassandra::Future.all(futures).get

Slide 18

Slide 18 text

© 2014 DataStax, All Rights Reserved. Future Composition 18 [# ... >, ... ]

Slide 19

Slide 19 text

Load Balancing Principles and Implementations

Slide 20

Slide 20 text

© 2014 DataStax, All Rights Reserved. Application Driver Load Balancing 20 Application Thread Node Pool Session Pool Pool Pool Application Thread Application Thread Client Cluster Node Node Node Load Balancing Policy

Slide 21

Slide 21 text

© 2014 DataStax, All Rights Reserved. Application Driver Load Balancing 20 Application Thread Node Pool Session Pool Pool Pool Application Thread Application Thread Client Cluster Node Node Node Load Balancing Policy

Slide 22

Slide 22 text

© 2014 DataStax, All Rights Reserved. Application Driver Load Balancing 20 Application Thread Node Pool Session Pool Pool Pool Application Thread Application Thread Client Cluster Node Node Node Load Balancing Policy

Slide 23

Slide 23 text

© 2014 DataStax, All Rights Reserved. Datacenter Datacenter DataCenter Aware Balancing 21 Node Node Node Client Node Node Node Client Client Client Client Client Local nodes are queried first, if non are available, the request could be sent to a remote node.

Slide 24

Slide 24 text

© 2014 DataStax, All Rights Reserved. Token Aware Balancing 22 Nodes that own a Replica of the PK being read or written by the query will be contacted first. Node Node Replica Node Client Replica Replica Partition Key will be inferred from Prepared Statements metadata

Slide 25

Slide 25 text

Fault Tolerance Sources of Failure and Error Handling

Slide 26

Slide 26 text

© 2014 DataStax, All Rights Reserved. Fault Tolerance 24 Coordinator Node Replica Replica Replica Node Business Logic Driver Application

Slide 27

Slide 27 text

© 2014 DataStax, All Rights Reserved. 25 Coordinator Node Replica Replica Replica Node Business Logic Driver Application Invalid Requests Network Timeouts Server Errors Possible Failures

Slide 28

Slide 28 text

© 2014 DataStax, All Rights Reserved. Application Driver Automatic Retry of Server Errors 26 Application Thread Node Pool Session Pool Pool Pool Application Thread Application Thread Client Cluster Node Node Node Load Balancing Policy

Slide 29

Slide 29 text

© 2014 DataStax, All Rights Reserved. Application Driver Automatic Retry of Server Errors 26 Application Thread Node Pool Session Pool Pool Pool Application Thread Application Thread Client Cluster Node Node Node Load Balancing Policy

Slide 30

Slide 30 text

© 2014 DataStax, All Rights Reserved. Application Driver Automatic Retry of Server Errors 26 Application Thread Node Pool Session Pool Pool Pool Application Thread Application Thread Client Cluster Node Node Node Load Balancing Policy

Slide 31

Slide 31 text

© 2014 DataStax, All Rights Reserved. 27 Coordinator Node Replica Replica Replica Node Business Logic Driver Application Unreachable Consistency

Slide 32

Slide 32 text

© 2014 DataStax, All Rights Reserved. Coordinator Node Replica Replica Node 28 Replica Business Logic Driver Application Read / Write Timeout Error

Slide 33

Slide 33 text

© 2014 DataStax, All Rights Reserved. Coordinator Node Replica Replica Node 28 Replica Business Logic Driver Application Read / Write Timeout Error

Slide 34

Slide 34 text

© 2014 DataStax, All Rights Reserved. Coordinator Node Replica Replica Node 28 Replica Business Logic Driver Application Read / Write Timeout Error read / write timeout

Slide 35

Slide 35 text

© 2014 DataStax, All Rights Reserved. 29 Coordinator Node Replica Replica Replica Node Business Logic Driver Application Unavailable Error

Slide 36

Slide 36 text

© 2014 DataStax, All Rights Reserved. 29 Coordinator Node Replica Replica Replica Node Business Logic Driver Application Unavailable Error unavailable

Slide 37

Slide 37 text

© 2014 DataStax, All Rights Reserved. 30 Error Handling

Slide 38

Slide 38 text

Address Resolution Topology Aware Client

Slide 39

Slide 39 text

© 2014 DataStax, All Rights Reserved. Application Driver Address Resolution 32 Application Thread Driver Application Thread Application Thread Client Cluster Address Resolution Policy

Slide 40

Slide 40 text

© 2014 DataStax, All Rights Reserved. Application Driver Address Resolution 32 Application Thread Node Driver Application Thread Application Thread Client Cluster Address Resolution Policy

Slide 41

Slide 41 text

© 2014 DataStax, All Rights Reserved. Application Driver Address Resolution 32 Application Thread Node Pool Driver Application Thread Application Thread Client Cluster Node Node Node Address Resolution Policy

Slide 42

Slide 42 text

© 2014 DataStax, All Rights Reserved. Application Driver Address Resolution 32 Application Thread Node Pool Driver Pool Pool Pool Application Thread Application Thread Client Cluster Node Node Node Address Resolution Policy

Slide 43

Slide 43 text

© 2014 DataStax, All Rights Reserved. Datacenter Datacenter Address Resolution 33 Node Node Node Node Client Client Node Node Node Node Client Client Within Datacenter:
 Private IPs Across Datacenters:
 Public IPs

Slide 44

Slide 44 text

© 2014 DataStax, All Rights Reserved. EC2 Multi-Region Address Resolution 34

Slide 45

Slide 45 text

Questions and Links • http://datastax.github.io/ruby-driver/ • https://github.com/datastax/ruby-driver • @avalanche123