CRM bob 35 legal GOOG steve 38 ceo GOOG ben 43 dev GOOG mary 25 dev SELECT * from employees; SELECT * from employees WHERE company = ‘CRM’; SELECT * from employees WHERE company = ‘CRM’ and name = ‘alice’; SELECT * from employees WHERE name = ’alice’;
writer CA alice 30 CRM new legal TX bob 35 CRM new ceo WA steve 38 CRM old dev MD ben 43 GOOG mid dev AZ mary 25 SELECT * from employees; SELECT * from employees WHERE company = ‘CRM’ and status = ‘new’; SELECT * from employees WHERE company = ‘CRM’ and status=‘new’ AND role=‘ceo’; SELECT * from employees WHERE company = ‘CRM’; SELECT * from employees WHERE name = ‘alice’; SELECT * from employees WHERE company = ‘CRM’ AND status=‘new’ and loc=‘WA’;
pig m/r, hive multi-datacenter full cross-region multi availability zones only idempotent write batches yes no largest value supported 2GB 64KB conditional updates no yes backups snapshot, incremental manually with EMR, s3