Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Big Data and Data Virtualization

Big Data and Data Virtualization

Gain Better insight from Big Data Using JBoss Data Virtualization

8b1341caf2e112bcb5b0c1699180fb4c?s=128

Kenneth Peeples

May 28, 2014
Tweet

More Decks by Kenneth Peeples

Other Decks in Technology

Transcript

  1. GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

    Kenneth Peeples, JBoss Technology Evangelist kpeeples@redhat.com www.ossmentor.com
  2. AGENDA  Demystifying Big Data  Data Virtualization: Making Big

    Data Available to Everyone  Red Hat Big Data Strategy and Platform  Real World Customer Example using Red Hat Big Data Platform  Data Virtualization Roadmap  Demo  Q&A
  3. DO WE AGREE ON WHAT BIG DATA IS?

  4. Source: http://blogs.ifsworld.com/2013/02/how-will-big-data-influence-your-finance-team/

  5. IT’S ALL ABOUT GAINING BUSINESS INSIGHTS  Improve product development

     Optimize business processes  Improve customer care  Improve customer lifetime value  Personalize products  Competitive intelligence  …
  6. None
  7. None
  8. None
  9. BIG DATA FOR EVERYONE

  10. None
  11. None
  12. BENEFITS OF DATA VIRTUALIZATION ON BIG DATA  Enterprise democratization

    of big data  Any reporting or analytical tool can be used  Easy access to big data  Seamless integration of big data and existing data assets  Sharing of integration specifications  Collaborative development on big data  Fine-grained of security big data  Increased time-to-market of reports on big data
  13. None
  14. None
  15. None
  16. EXAMPLES: RED HAT BIG DATA PLATFORM IN THE REAL WORLD

  17. None
  18. None
  19. None
  20. JBOSS DATA VIRTUALIZATION PRODUCT ROADMAP AND BIG DATA

  21. WHAT COMING: JBOSS DATA VIRTUALIZATION 6.1

  22. DEMOS LUCIDWORKS, JBOSS DATA VIRTUALIZATION AND RED HAT STORAGE

  23. ABOUT LUCIDWORKS Employs 40% of the “committers” for Lucene/Solr Makes

    50% - 70% of the enhancements to each release of Lucene/Solr Only company to offer Open Source and Open Core Search Solutions
  24. LUCENE/SOLR: ENABLING BETTER, DATA-DRIVEN DECISIONS

  25. LUCIDWORKS DEMONSTRATION • LucidWorks/Solr to provide full text search and

    statistics • Data Virtualization provides the data through Teiid JDBC driver and pulls the data from Hive/Hadoop, CSV File, XML File • Red Hat Storage provides the Enterprise Data Repository https://drive.google.com/file/d/0B5kKwcd4kOq9VDNPbjlqX25XN1E/edit?usp=sharing
  26. DEMONSTRATION ARCHITECTURE

  27. DEMOS HORTONWORKS AND JBOSS DATA VIRTUALIZATION

  28. ABOUT HORTONWORKS  Founded in 2011 by 24 engineers from

    the original Yahoo! Hadoop development and operations team  Hortonworks drive innovation in the open exclusively via the Apache Software Foundation process  Hortonworks is responsible for around 50% of core code base advances to Apache Hadoop
  29. HORTONWORKS DATA PLATFORM 2 SANDBOX  Enterprise Ready YARN, the

    Hadoop Operating System  Stinger Phase 2; Interactive SQL Queries at Petabyte Scale  Reliable NoSQL IN Hadoop with Hbase  Technical Specs Component Version Apache Hadoop 2.2.0 Apache Hive 0.12.0 Apache HCatalog 0.12.0 Apache HBase 0.96.0 Apache ZooKeeper 3.4.5 Apache Pig 0.12.0 Apache Sqoop 1.4.4 Apache Flume 1.4.0 Apache Oozie 4.0.0 Apache Ambari 1.4.1 Apache Mahout 0.8.0 Hue 2.3.0
  30. Currently in processing to be published soon https://vimeo.com/97126145

  31. (Contains Video as well) http://www.slideshare.net/opensourcementor/strata-conference

  32. None
  33. None
  34. None
  35. None
  36. None
  37. None
  38. None
  39. THANK YOU Q & A