Adopting a Big-Data mindset and establishing a capable data infrastructure to support it is a challenge. This session, hosted by iHeartRadio, will introduce current technologies (Hadoop/Hive/Impala/Parquet, Luigi, Kafka/Flume, ElasticSearch/Kibana), along with a few examples of data and machine learning projects that leverage them. We’ll leave the realm of theory and also go into operationalization strategies, discuss cluster configurations and automated deployment using Chef.
Presented by Pasha Katsev