Upgrade to Pro — share decks privately, control downloads, hide ads and more …

AppsFlyer Data Architecture

AppsFlyer Data Architecture

AppsFlyer

May 21, 2015
Tweet

More Decks by AppsFlyer

Other Decks in Technology

Transcript

  1. Dis$lling  insights  @              

                              Arnon  Rotem-­‐Gal-­‐Oz   Chief  Data  Officer  
  2. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  3. Data’s  hierarchy  of  needs*   *With  apologies  to  Maslow  

    Acted   upon   presented   Dis$lled   Usable   Accessible   Exist  
  4. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  5. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  6. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  7. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  8. Kafka Columnar Database (Redshift- evaluating Vertica) IMDG (Ignite - evaluating

    Geode) Secor Spark Aggregations SparkSQL (evaluating Drill, Presto) SQL SQL Raw (sequence files) DW (parquet files) DM (Aggregations) Application dashboard Self-serve BI (TBD) Spark ETL Spark Spark ML Latest Events Scoring exploration Agg. logic Internal tools installs clicks inapp launches Accounts
  9. RT  insights     Predic$ve       Prescrip$ve  

      Dashboards     whatnot     presented