Upgrade to Pro — share decks privately, control downloads, hide ads and more …

MongoDB Multi-Master Replication

rick446
December 05, 2012

MongoDB Multi-Master Replication

Learn about the MongoDB multimaster replication tool that provides mostly eventually consistent replication

rick446

December 05, 2012
Tweet

More Decks by rick446

Other Decks in Programming

Transcript

  1. The Client Problem MongoDB (factory floor) MongoDB (cloud) App Server

    App Server App Server App Server Camera Camera Camera Camera Frame & Feature Data (mostly) Inspection Metadata (mostly) • Cameras save frame images to GridFS • Inspections, measurements, etc. defined by metadata in MongoDB • Access locally + from the web Tuesday, December 4, 12
  2. The Client Problem MongoDB (factory floor) MongoDB (cloud) App Server

    App Server App Server App Server Camera Camera Camera Camera Frame & Feature Data (mostly) Inspection Metadata (mostly) • Frames must still be saved, inspections performed • Tolerate being inaccessible from the cloud Tuesday, December 4, 12
  3. Put the Primary in the Cloud? MongoDB (factory floor) MongoDB

    (cloud) App Server App Server App Server App Server Camera Camera Camera Camera Frame & Feature Data (mostly) Inspection Metadata (mostly) Primary • Pros: • More conventional • Additional seconaries in the cloud • Cons: • Dependent on internet • Saving frame data up an ADSL connection??!! Tuesday, December 4, 12
  4. Put the Primary in the Factory? MongoDB (factory floor) MongoDB

    (cloud) App Server App Server App Server App Server Camera Camera Camera Camera Frame & Feature Data (mostly) Inspection Metadata (mostly) Primary • Pros: • Frame data saves fast • Resilient to outages • Cons: • Backups / durability? • Seems wrong, somehow... Tuesday, December 4, 12
  5. But what if.... MongoDB (factory floor) MongoDB (cloud) App Server

    App Server App Server App Server Camera Camera Camera Camera Frame & Feature Data (mostly) Inspection Metadata (mostly) Primary Primary • Pros: • Frame data saves fast • Resilient to outages • Backups / durability in cloud • Cons: • But MongoDB doesn’t do multi-master replication Tuesday, December 4, 12
  6. Replication: the Oplog { "ts" : Timestamp(1317653790000, 2), "h" :

    -6022751846629753359, "op" : "i", "ns" : "confoo.People", "o" : { "_id" : ObjectId("4e89cd1e0364241932324269"), "first" : "Rick", "last" : "Copeland” } } Tuesday, December 4, 12
  7. Replication: the Oplog { "ts" : Timestamp(1317653790000, 2), "h" :

    -6022751846629753359, "op" : "i", "ns" : "confoo.People", "o" : { "_id" : ObjectId("4e89cd1e0364241932324269"), "first" : "Rick", "last" : "Copeland” } } Insert Tuesday, December 4, 12
  8. Replication: the Oplog { "ts" : Timestamp(1317653790000, 2), "h" :

    -6022751846629753359, "op" : "i", "ns" : "confoo.People", "o" : { "_id" : ObjectId("4e89cd1e0364241932324269"), "first" : "Rick", "last" : "Copeland” } } Insert Collection name Tuesday, December 4, 12
  9. Replication: the Oplog { "ts" : Timestamp(1317653790000, 2), "h" :

    -6022751846629753359, "op" : "i", "ns" : "confoo.People", "o" : { "_id" : ObjectId("4e89cd1e0364241932324269"), "first" : "Rick", "last" : "Copeland” } } Insert Collection name Object to insert Tuesday, December 4, 12
  10. Step 1: Build Triggers def run(self): while True: yield self.checkpoint

    spec = dict(ts={'$gt': self.checkpoint}) q = self._oplog.find( spec, tailable=True, await_data=True) found=False for op in q.sort('$natural'): found = True self.checkpoint = op['ts'] for callback in self._callbacks.get( (op['ns'], op['op']), []): callback(**op) if found: sleep(0) else: sleep(1) Tuesday, December 4, 12
  11. Step 1: Build Triggers def run(self): while True: yield self.checkpoint

    spec = dict(ts={'$gt': self.checkpoint}) q = self._oplog.find( spec, tailable=True, await_data=True) found=False for op in q.sort('$natural'): found = True self.checkpoint = op['ts'] for callback in self._callbacks.get( (op['ns'], op['op']), []): callback(**op) if found: sleep(0) else: sleep(1) Find ops Tuesday, December 4, 12
  12. Step 1: Build Triggers def run(self): while True: yield self.checkpoint

    spec = dict(ts={'$gt': self.checkpoint}) q = self._oplog.find( spec, tailable=True, await_data=True) found=False for op in q.sort('$natural'): found = True self.checkpoint = op['ts'] for callback in self._callbacks.get( (op['ns'], op['op']), []): callback(**op) if found: sleep(0) else: sleep(1) Find ops Call triggers Tuesday, December 4, 12
  13. Step 2: Build the replication MMM_REPL_FLAG = '__mmm' def trigger(ts,

    h, op, ns, o, o2=None, b=False): if op == 'i': if o.get(MMM_REPL_FLAG) == self.id: return o.setdefault(MMM_REPL_FLAG, src_id) collection.insert(o) elif op == 'u': upsert = b if any(k.startswith('$') for k in o): # With modifiers, check & update setters setters = o.setdefault('$set', {}) else: # Without modifiers, update doc directly setters = o if setters.get(MMM_REPL_FLAG) == self.id: return setters.setdefault(MMM_REPL_FLAG, src_id) collection.update(o2, o, upsert) elif op == 'd': justOne = b collection.remove(o) Tuesday, December 4, 12
  14. Step 2: Build the replication MMM_REPL_FLAG = '__mmm' def trigger(ts,

    h, op, ns, o, o2=None, b=False): if op == 'i': if o.get(MMM_REPL_FLAG) == self.id: return o.setdefault(MMM_REPL_FLAG, src_id) collection.insert(o) elif op == 'u': upsert = b if any(k.startswith('$') for k in o): # With modifiers, check & update setters setters = o.setdefault('$set', {}) else: # Without modifiers, update doc directly setters = o if setters.get(MMM_REPL_FLAG) == self.id: return setters.setdefault(MMM_REPL_FLAG, src_id) collection.update(o2, o, upsert) elif op == 'd': justOne = b collection.remove(o) Inserts Tuesday, December 4, 12
  15. Step 2: Build the replication MMM_REPL_FLAG = '__mmm' def trigger(ts,

    h, op, ns, o, o2=None, b=False): if op == 'i': if o.get(MMM_REPL_FLAG) == self.id: return o.setdefault(MMM_REPL_FLAG, src_id) collection.insert(o) elif op == 'u': upsert = b if any(k.startswith('$') for k in o): # With modifiers, check & update setters setters = o.setdefault('$set', {}) else: # Without modifiers, update doc directly setters = o if setters.get(MMM_REPL_FLAG) == self.id: return setters.setdefault(MMM_REPL_FLAG, src_id) collection.update(o2, o, upsert) elif op == 'd': justOne = b collection.remove(o) Inserts Updates Tuesday, December 4, 12
  16. Step 2: Build the replication MMM_REPL_FLAG = '__mmm' def trigger(ts,

    h, op, ns, o, o2=None, b=False): if op == 'i': if o.get(MMM_REPL_FLAG) == self.id: return o.setdefault(MMM_REPL_FLAG, src_id) collection.insert(o) elif op == 'u': upsert = b if any(k.startswith('$') for k in o): # With modifiers, check & update setters setters = o.setdefault('$set', {}) else: # Without modifiers, update doc directly setters = o if setters.get(MMM_REPL_FLAG) == self.id: return setters.setdefault(MMM_REPL_FLAG, src_id) collection.update(o2, o, upsert) elif op == 'd': justOne = b collection.remove(o) Inserts Updates Deletes Tuesday, December 4, 12