world o No Hadoop • Hired as analytics developer at Polar Mobile in 2009 o Designed Hive-based mobile analytics platform o Ended up as Director of Engineering • Engineer at Kontagent o Yellow elephant tamer o Working on new analytics platform based on Hadoop Who is this guy? Data Processing with Hive and Cascading
insight into some aspect of your business and produce actionable results. What is Analytics? Data Processing with Hive and Cascading 0 50 100 150 200 250 300 0 10 20 30 40 50 60 70 80 90 100 Cumulative Spend Days Since Install
more natural for humans o Hive o Pig o Cascading • Frameworks provide alternative computational models o Declarative – Let the planner do the work o Imperative but not MapReduce – Some input over execution Hadoop MapReduce Frameworks Data Processing with Hive and Cascading
runs on MapReduce • Many Domain-Specific Languages available o Cascalog o Scalding o Lingual o PyCascading o Cascading.JRuby Cascading Data Processing with Hive and Cascading