Upgrade to Pro — share decks privately, control downloads, hide ads and more …

End-to-end serverless data pipeline on AWS

End-to-end serverless data pipeline on AWS

Use case powered by Amazon Kinesis, Lambda, Athena, and QuickSight.

Alex Casalboni

October 16, 2017
Tweet

More Decks by Alex Casalboni

Other Decks in Technology

Transcript

  1. About Me @alex_casalboni clda.co/sls-milano-data-pipeline Computer Science Background Master in Sound

    & Music Engineering Sr. SoBware Engineer & Web Developer Cloud Evangelist @ Cloud Academy
  2. Amazon Athena clda.co/sls-milano-data-pipeline InteracOve SQL queries over S3 Transparent compute

    provisioning “Serverless Database” Results are stored on S3 too
  3. Amazon QuickSight clda.co/sls-milano-data-pipeline Business Intelligence (BI) Powered by SPICE (in-memory

    engine) Can read from RedshiB, RDS, Aurora, Athena, S3, EMR, etc. Monthly subscripOon (not really PAYG)
  4. Real-Ome Fraud DetecOon clda.co/sls-milano-data-pipeline Real-0me ingesOon of credit card transacOons

    Stream processing, data validaOon, and fraud detecOon Secure storage of transacOons Real-0me analysis and reporOng
  5. AddiOonal Requirements clda.co/sls-milano-data-pipeline Many heterogeneous event producers (temporary credenOals) Elas0c

    architecture (no upfront costs & easy to scale up) Extensible architecture (plug-and-play components) A lot of data (cheap storage, please!)