Big Data • Developed by Google • Based on Google File System ( GFS ) • Provides transactions and locking • Faster than comparable Map Reduce • Developed by Google due to MapReduce limitations www.semtech-solutions.co.nz [email protected]
updates • No need to batch process • Update as data received • Data in multi petabyte range • Strong consistency needed • Improved latency ( 100 x ) • Reduced document age ( 50 % ) • Random access to big data repository www.semtech-solutions.co.nz [email protected]
transactions • Latency A • Run time scales with data • Code in C++ • Open source • Uses HDFS Percolator • Iterative • Transactions • Latency 100 x A • Incremental updates • Code in Java ( mainly ) • Google owned • Uses GFS www.semtech-solutions.co.nz [email protected]
• An observer is called via a notification • A notification is triggered when table data changes • Application calls TabletServer via RPC • TabletServer calls GFS ChunkServer
www.semtech-solutions.co.nz – [email protected] • We offer IT project consultancy • We are happy to hear about your problems • You can just pay for those hours that you need • To solve your problems