Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Big Data and Data Virtualization

Big Data and Data Virtualization

Gain Better insight from Big Data Using JBoss Data Virtualization

Kenneth Peeples

May 28, 2014
Tweet

More Decks by Kenneth Peeples

Other Decks in Technology

Transcript

  1. GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

    Kenneth Peeples, JBoss Technology Evangelist [email protected] www.ossmentor.com
  2. AGENDA  Demystifying Big Data  Data Virtualization: Making Big

    Data Available to Everyone  Red Hat Big Data Strategy and Platform  Real World Customer Example using Red Hat Big Data Platform  Data Virtualization Roadmap  Demo  Q&A
  3. IT’S ALL ABOUT GAINING BUSINESS INSIGHTS  Improve product development

     Optimize business processes  Improve customer care  Improve customer lifetime value  Personalize products  Competitive intelligence  …
  4. BENEFITS OF DATA VIRTUALIZATION ON BIG DATA  Enterprise democratization

    of big data  Any reporting or analytical tool can be used  Easy access to big data  Seamless integration of big data and existing data assets  Sharing of integration specifications  Collaborative development on big data  Fine-grained of security big data  Increased time-to-market of reports on big data
  5. ABOUT LUCIDWORKS Employs 40% of the “committers” for Lucene/Solr Makes

    50% - 70% of the enhancements to each release of Lucene/Solr Only company to offer Open Source and Open Core Search Solutions
  6. LUCIDWORKS DEMONSTRATION • LucidWorks/Solr to provide full text search and

    statistics • Data Virtualization provides the data through Teiid JDBC driver and pulls the data from Hive/Hadoop, CSV File, XML File • Red Hat Storage provides the Enterprise Data Repository https://drive.google.com/file/d/0B5kKwcd4kOq9VDNPbjlqX25XN1E/edit?usp=sharing
  7. ABOUT HORTONWORKS  Founded in 2011 by 24 engineers from

    the original Yahoo! Hadoop development and operations team  Hortonworks drive innovation in the open exclusively via the Apache Software Foundation process  Hortonworks is responsible for around 50% of core code base advances to Apache Hadoop
  8. HORTONWORKS DATA PLATFORM 2 SANDBOX  Enterprise Ready YARN, the

    Hadoop Operating System  Stinger Phase 2; Interactive SQL Queries at Petabyte Scale  Reliable NoSQL IN Hadoop with Hbase  Technical Specs Component Version Apache Hadoop 2.2.0 Apache Hive 0.12.0 Apache HCatalog 0.12.0 Apache HBase 0.96.0 Apache ZooKeeper 3.4.5 Apache Pig 0.12.0 Apache Sqoop 1.4.4 Apache Flume 1.4.0 Apache Oozie 4.0.0 Apache Ambari 1.4.1 Apache Mahout 0.8.0 Hue 2.3.0