Upgrade to Pro — share decks privately, control downloads, hide ads and more …

big data

big data

big data keynote at the opencode quebec, introducing cassandra, hadoop and pig.

ngarneau

March 09, 2012
Tweet

More Decks by ngarneau

Other Decks in Programming

Transcript

  1. cassandra - Facebook - 2007. - Apache - 2008. -

    Netflix, Digg, Twitter, Rackspace...
  2. cassandra data model - no joins (maybe joints, we don’t

    know as of version 1.0.9..) - denormalization
  3. cassandra data model - keyspace - column family - row

    key - super column - column / value
  4. cassandra data model application = { users = { ‘ngarneau’:

    { ‘first_name’: ‘nicolas’, ‘last_name’: ‘garneau’ } } }
  5. cassandra data model application = { users = { ‘ngarneau’:

    { ‘first_name’: ‘nicolas’, ‘last_name’: ‘garneau’ } } } keyspace
  6. cassandra data model application = { users = { ‘ngarneau’:

    { ‘first_name’: ‘nicolas’, ‘last_name’: ‘garneau’ } } } keyspace column family
  7. cassandra data model application = { users = { ‘ngarneau’:

    { ‘first_name’: ‘nicolas’, ‘last_name’: ‘garneau’ } } } keyspace column family row key
  8. cassandra data model application = { users = { ‘ngarneau’:

    { ‘first_name’: ‘nicolas’, ‘last_name’: ‘garneau’ } } } keyspace column family row key column
  9. cassandra data model application = { users = { ‘ngarneau’:

    { ‘first_name’: ‘nicolas’, ‘last_name’: ‘garneau’ } } } keyspace column family row key column value
  10. cassandra keep in mind memory disk memtable commit log memtable

    memtable memtables SSTables SSTables SSTables
  11. cassandra keep in mind memory disk memtable commit log memtable

    memtable memtables SSTables SSTables SSTables
  12. cassandra keep in mind memory disk memtable commit log memtable

    memtable memtables SSTables SSTables SSTables SSTables
  13. cassandra keep in mind memory disk memtable commit log memtable

    memtable memtables SSTables SSTables SSTables SSTables SSTables
  14. cassandra keep in mind memory disk memtable commit log memtable

    memtable memtables SSTables SSTables SSTables SSTables SSTables SSTables
  15. pig - pigs eat anything. - pigs live anywhere. -

    pigs are domestic. - pigs fly.
  16. let’s play! dataset Salons = { ’1’: { ‘id’: 1,

    ‘attendants’: 47, ‘name’: ‘Salon Laval’, ‘year’: 2010 } } Commandes = { ’1’: { ‘amount’: 799, ‘salon’: 1 } }
  17. let’s play! we want to know what is the correlation

    between the number of attendants and the total revenues by salon.