Upgrade to Pro — share decks privately, control downloads, hide ads and more …

AppsFlyer Data Architecture

Sponsored · SiteGround - Reliable hosting with speed, security, and support you can count on.

AppsFlyer Data Architecture

Avatar for AppsFlyer

AppsFlyer

May 21, 2015
Tweet

More Decks by AppsFlyer

Other Decks in Technology

Transcript

  1. Dis$lling  insights  @              

                              Arnon  Rotem-­‐Gal-­‐Oz   Chief  Data  Officer  
  2. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  3. Data’s  hierarchy  of  needs*   *With  apologies  to  Maslow  

    Acted   upon   presented   Dis$lled   Usable   Accessible   Exist  
  4. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  5. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  6. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  7. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  8. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  9. RT  insights     Predic$ve       Prescrip$ve  

      Dashboards     whatnot     presented