What is Apache Hive in terms of big data and Hadoop ?
How does it relate to business intelligence and
management reporting ? Can it be used with Business
Objects ?
for Hadoop • Open source writen in Java • Holds meta data in a relational database • Allows SQL like queries • Supports “big data” data sets • Offers built in and user defined functions • Has indexing
cluster • Then add Hive and the meta data structure • Use Flume and Sqoop to move data • Use Hive LOAD DATA command to load from flat files • Use ODBC for connectivity to your BI layer
a.age FROM customer a WHERE a.sdate ='2008-08-15'; selects all data from table for a partition but doesnt store it hive> INSERT OVERWRITE DIRECTORY '/data/hdfs_file' SELECT a.* FROM customer a WHERE a.sdate='2008-08-15'; writes all of customer table to an hdfs directory
to your BI layer • Now you can use BI tools like Business Objects – Create a universe over the Hive instance – Create reports against the universe – Create add hoc queries against the universe
www.semtech-solutions.co.nz – [email protected] • We offer IT project consultancy • We are happy to hear about your problems • You can just pay for those hours that you need • To solve your problems