Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Separating Hadoop Myths from Reality by ROB AND...

Big Data Spain
December 19, 2013

Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

According to Gartner, Hadoop is near the top of the Hype Cycle. While some customers have questions about the enterprise capabilities of Hadoop, the answers are clear as production deployments continue to expand. This session will use successful customer experiences to highlight the power of Hadoop and separate the myths from reality.

Big Data Spain

December 19, 2013
Tweet

More Decks by Big Data Spain

Other Decks in Business

Transcript

  1. 1   The  Myths  &  Reali.es   Surrounding  Hadoop  

      Rob  Anderson   VP  Systems  Engineering  
  2. 2   Sales   SCM   CRM   Public  

    Web  Logs   Produc7on   Data   Sensor     Data   Click   Streams   Loca7on   Social   Media   Billing   Enterprise   Data  Hub   Hadoop  Changes  Analy.cs   “Simple  algorithms  and  lots  of  data  trump   complex  models  ”   Halevy,    Norvig,  and    Pereira,  Google   IEEE  Intelligent  Systems    
  3. 8   What  was  the  genius  of  Hadoop?   § 

    Fueling  an  industry  revolu7on   by  providing  infinite  capability   to  store  and  process  big  data   §  Expanding  analy7cs  across  data   types   §  Compelling  economics   –   20  to  100X  more  cost  effec7ve   than  alterna7ves  
  4. 10   Random  Wri.ng  in  MapR   S1 S2 S3

    S5 S4 S1, S2, S4 S1, S3 S1, S4, S5 S2, S4, S5 S3 Client   wri.ng   data   CLDB   Ask  for   64M  block   Create  cont.   Picks  master   and  2  replica  slaves   Write   next  chunk   to  S2   S2, S3, S5 aZach  
  5. 12   MapR   Spout   TwiZer   TwiZer  

       API   TwiZerLogger   Storm         MapR   Op7onal   MapReduce   DFS  
  6. MapR  Data  System   Architecture  Comparison   HBase   JVM

      HDFS   JVM   ext3/ext4   Disks   Other  Distribu7ons   Disks   MapR  M7