Slide 1

Slide 1 text

Apache Falcon ● What is it ? ● Benefits ● Architecture ● Example www.semtech-solutions.co.nz [email protected]

Slide 2

Slide 2 text

Apache Falcon – What is it ? ● A data life cycle management framework ● Created for Hadoop ● Logic based in Falcon rather than apps ● Simplifies data management ● Developed by InMobi and HortonWorks ● Falcon can manage – Work flows – Replication – Provides data abstraction www.semtech-solutions.co.nz [email protected]

Slide 3

Slide 3 text

Apache Falcon – What is it ? ● Falcon provides services – Data import / replication – Scheduling / coordination – Lifecycle policies – Cluster management – SLA Management ● An enterprise solution for data lifecycle management ● Currently an Apache incubator project www.semtech-solutions.co.nz [email protected]

Slide 4

Slide 4 text

Apache Falcon – Benefits ● Reduce workflow / ETL development time ● Reduce costs ● No need to re implement functionality – Already in Falcon – Already tested ● Use a single Falcon configuration file to – Define replication points – Define data processing pipeline www.semtech-solutions.co.nz [email protected]

Slide 5

Slide 5 text

Apache Falcon – Architecture www.semtech-solutions.co.nz [email protected]

Slide 6

Slide 6 text

Apache Falcon – BI Example ● Falcon used to manage work flow ● Falcon used to manage Cluster data replication ● BI example – Staged and presented data replicated – Presented data visible for ● Reporting ● Analytics ● See next slide ..... www.semtech-solutions.co.nz [email protected]

Slide 7

Slide 7 text

Apache Falcon – BI Example www.semtech-solutions.co.nz [email protected]

Slide 8

Slide 8 text

Contact Us ● Feel free to contact us at – www.semtech-solutions.co.nz – [email protected] ● We offer IT project consultancy ● We are happy to hear about your problems ● You can just pay for those hours that you need ● To solve your problems