Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
End-to-end serverless data pipeline on AWS
Search
Alex Casalboni
October 16, 2017
Technology
1
96
End-to-end serverless data pipeline on AWS
Use case powered by Amazon Kinesis, Lambda, Athena, and QuickSight.
Alex Casalboni
October 16, 2017
Tweet
Share
More Decks by Alex Casalboni
See All by Alex Casalboni
Data-driven & cost-aware performance tuning for AWS Lambda
alexcasalboni
0
240
AWS Lambda function performance tuning - AWS re:Invent 2019
alexcasalboni
1
1.1k
Supercharge product development with cloud best practices - Web Summit 2019
alexcasalboni
0
94
AWS data services for machine learning - AWS Innovate Online
alexcasalboni
0
63
Advanced Serverless Architectural Patterns on AWS [Devoxx Poland]
alexcasalboni
0
240
Building a Modern Data Platform in the Cloud [AWS Dev Day @ Kyiv]
alexcasalboni
0
120
How to build a global serverless service [AWS Dev Day @ Kyiv]
alexcasalboni
0
63
Become a machine learning developer using AWS Machine Learning Services [AWS Summit @ Warsaw]
alexcasalboni
0
120
Getting started with AWS for Developers [Collision @ Toronto]
alexcasalboni
0
79
Other Decks in Technology
See All in Technology
DeepSeekとは?何がいいの? - Databricksと学ぶDeepSeek! 〜これからのLLMに備えよ!〜
taka_aki
1
160
ExaDB-XSで利用されているExadata Exascaleについて
oracle4engineer
PRO
3
280
Snowflakeの開発・運用コストをApache Icebergで効率化しよう!~機能と活用例のご紹介~
sagara
1
500
JavaにおけるNull非許容性
skrb
2
2.7k
【5分でわかる】セーフィー エンジニア向け会社紹介
safie_recruit
0
19k
開発者のための FinOps/FinOps for Engineers
oracle4engineer
PRO
2
210
フォーイット_エンジニア向け会社紹介資料_Forit_Company_Profile.pdf
forit_tech
1
1.7k
リクルートのエンジニア組織を下支えする 新卒の育成の仕組み
recruitengineers
PRO
1
140
IAMポリシーのAllow/Denyについて、改めて理解する
smt7174
2
210
どちらかだけじゃもったいないかも? ECSとEKSを適材適所で併用するメリット、運用課題とそれらの対応について
tk3fftk
2
240
AIエージェント入門
minorun365
PRO
32
19k
Change Managerを活用して本番環境へのセキュアなGUIアクセスを統制する / Control Secure GUI Access to the Production Environment with Change Manager
yuj1osm
0
110
Featured
See All Featured
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
13
1k
How To Stay Up To Date on Web Technology
chriscoyier
790
250k
Designing for humans not robots
tammielis
250
25k
Typedesign – Prime Four
hannesfritz
41
2.5k
Site-Speed That Sticks
csswizardry
4
410
Reflections from 52 weeks, 52 projects
jeffersonlam
348
20k
Intergalactic Javascript Robots from Outer Space
tanoku
270
27k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
44
7k
Statistics for Hackers
jakevdp
797
220k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
356
29k
The Invisible Side of Design
smashingmag
299
50k
Scaling GitHub
holman
459
140k
Transcript
clda.co/sls-milano-data-pipeline End-to-end serverless data pipeline on AWS Powered by Amazon
Kinesis, Lambda, Athena, and QuickSight 10/19/2017
About Me @alex_casalboni clda.co/sls-milano-data-pipeline Computer Science Background Master in Sound
& Music Engineering Sr. SoBware Engineer & Web Developer Cloud Evangelist @ Cloud Academy
clda.co/serverless-italy Call for Speakers! clda.co/sls-milano-data-pipeline
Agenda Prerequisites Use Case / Requirements Serverless Architecture Hands-on Demo
clda.co/sls-milano-data-pipeline
Prerequisites Breve introduzione dei singoli servizi clda.co/sls-milano-data-pipeline
AWS Lambda clda.co/sls-milano-data-pipeline FaaS (FuncOon as a Service) NodeJs, Python,
Java, C#, etc. Event-driven
Amazon S3 clda.co/sls-milano-data-pipeline PB-scale Object-storage 99.999999999% durability SSE Support Can
trigger Lambda (per-object)
Amazon Kinesis (Streams) clda.co/sls-milano-data-pipeline Real-Ome data ingesOon Shards scale up/down
Almost Serverless Can trigger Lambda (batch)
Amazon Athena clda.co/sls-milano-data-pipeline InteracOve SQL queries over S3 Transparent compute
provisioning “Serverless Database” Results are stored on S3 too
Amazon QuickSight clda.co/sls-milano-data-pipeline Business Intelligence (BI) Powered by SPICE (in-memory
engine) Can read from RedshiB, RDS, Aurora, Athena, S3, EMR, etc. Monthly subscripOon (not really PAYG)
Amazon IAM clda.co/sls-milano-data-pipeline IdenOty & Access Management Very granular permissions
Free service (use it!)
Amazon Cognito clda.co/sls-milano-data-pipeline Federated idenOOes Role-based temporary credenOals Data Sync
and User Management (Pools) Complex but useful (use it!)
Use Case / Requirements Descrizione del caso d’uso clda.co/sls-milano-data-pipeline
Real-Ome Fraud DetecOon clda.co/sls-milano-data-pipeline Real-0me ingesOon of credit card transacOons
Stream processing, data validaOon, and fraud detecOon Secure storage of transacOons Real-0me analysis and reporOng
AddiOonal Requirements clda.co/sls-milano-data-pipeline Many heterogeneous event producers (temporary credenOals) Elas0c
architecture (no upfront costs & easy to scale up) Extensible architecture (plug-and-play components) A lot of data (cheap storage, please!)
Serverless Architecture Come lo implemen7amo senza ges7re server? clda.co/sls-milano-data-pipeline
Architecture Diagram clda.co/sls-milano-data-pipeline
IngesOon & Processing (simple) clda.co/sls-milano-data-pipeline
IngesOon & Processing (opOmized) clda.co/sls-milano-data-pipeline
IngesOon & Processing (real-Ome analyOcs)
Serverless Data Analysis clda.co/sls-milano-data-pipeline
Hands-on Demo Sporchiamoci un po’ le mani clda.co/sls-milano-data-pipeline
clda.co/sls-milano-data-pipeline < demo >
Grazie =) Domande? 10/19/2017 clda.co/sls-milano-data-pipeline