A short introduction to Apache Falcon, what is it and what is it used for ?
How can it help with Hadoop based data life cycle management ? What is it's
architecture and what are the benefits of using it ?
life cycle management framework • Created for Hadoop • Logic based in Falcon rather than apps • Simplifies data management • Developed by InMobi and HortonWorks • Falcon can manage – Work flows – Replication – Provides data abstraction www.semtech-solutions.co.nz [email protected]
time • Reduce costs • No need to re implement functionality – Already in Falcon – Already tested • Use a single Falcon configuration file to – Define replication points – Define data processing pipeline www.semtech-solutions.co.nz [email protected]
work flow • Falcon used to manage Cluster data replication • BI example – Staged and presented data replicated – Presented data visible for • Reporting • Analytics • See next slide ..... www.semtech-solutions.co.nz [email protected]
www.semtech-solutions.co.nz – [email protected] • We offer IT project consultancy • We are happy to hear about your problems • You can just pay for those hours that you need • To solve your problems