Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
End-to-end serverless data pipeline on AWS
Search
Alex Casalboni
October 16, 2017
Technology
1
95
End-to-end serverless data pipeline on AWS
Use case powered by Amazon Kinesis, Lambda, Athena, and QuickSight.
Alex Casalboni
October 16, 2017
Tweet
Share
More Decks by Alex Casalboni
See All by Alex Casalboni
Data-driven & cost-aware performance tuning for AWS Lambda
alexcasalboni
0
200
AWS Lambda function performance tuning - AWS re:Invent 2019
alexcasalboni
1
1.1k
Supercharge product development with cloud best practices - Web Summit 2019
alexcasalboni
0
86
AWS data services for machine learning - AWS Innovate Online
alexcasalboni
0
54
Advanced Serverless Architectural Patterns on AWS [Devoxx Poland]
alexcasalboni
0
230
Building a Modern Data Platform in the Cloud [AWS Dev Day @ Kyiv]
alexcasalboni
0
110
How to build a global serverless service [AWS Dev Day @ Kyiv]
alexcasalboni
0
52
Become a machine learning developer using AWS Machine Learning Services [AWS Summit @ Warsaw]
alexcasalboni
0
110
Getting started with AWS for Developers [Collision @ Toronto]
alexcasalboni
0
71
Other Decks in Technology
See All in Technology
エムスリーマネジメントチーム紹介資料 / Introduction of M3 Management Team
m3_engineering
0
270
15 JSON serializers for Ruby
okuramasafumi
2
100
テストコードの品質を客観的な数値で担保しよう〜Mutation Testのすすめ〜
ysknsid25
11
3.1k
エムスリー全チーム紹介資料 / Introduction of M3 All Teams
m3_engineering
1
280
HashHub会社案内「なぜ今、パブリックブロックチェーンに賭けるのか」
hashhub
3
75k
Pythonを活用したLLMによる構造的データ生成の手法と実践
brainpadpr
3
310
Webセキュリティのあるきかた
akiym
30
9.4k
【shownet.conf_】コンピューティング資源を統合した分散コンテナ基盤の進化
shownet
PRO
0
390
O'Reilly Superstream: Building a RAG App to Chat with Your Data
pamelafox
0
120
Case Study: Concurrent Counting
ennael
PRO
0
100
Hazard pointers with reference counter
ennael
PRO
0
120
OPENLOGI Company Profile
hr01
0
54k
Featured
See All Featured
How GitHub (no longer) Works
holman
311
140k
Six Lessons from altMBA
skipperchong
26
3.4k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
38
2.1k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
228
52k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
131
32k
Designing on Purpose - Digital PM Summit 2013
jponch
114
6.9k
The Art of Programming - Codeland 2020
erikaheidi
51
13k
Practical Orchestrator
shlominoach
186
10k
Robots, Beer and Maslow
schacon
PRO
157
8.2k
Become a Pro
speakerdeck
PRO
24
4.9k
BBQ
matthewcrist
85
9.2k
Product Roadmaps are Hard
iamctodd
PRO
48
10k
Transcript
clda.co/sls-milano-data-pipeline End-to-end serverless data pipeline on AWS Powered by Amazon
Kinesis, Lambda, Athena, and QuickSight 10/19/2017
About Me @alex_casalboni clda.co/sls-milano-data-pipeline Computer Science Background Master in Sound
& Music Engineering Sr. SoBware Engineer & Web Developer Cloud Evangelist @ Cloud Academy
clda.co/serverless-italy Call for Speakers! clda.co/sls-milano-data-pipeline
Agenda Prerequisites Use Case / Requirements Serverless Architecture Hands-on Demo
clda.co/sls-milano-data-pipeline
Prerequisites Breve introduzione dei singoli servizi clda.co/sls-milano-data-pipeline
AWS Lambda clda.co/sls-milano-data-pipeline FaaS (FuncOon as a Service) NodeJs, Python,
Java, C#, etc. Event-driven
Amazon S3 clda.co/sls-milano-data-pipeline PB-scale Object-storage 99.999999999% durability SSE Support Can
trigger Lambda (per-object)
Amazon Kinesis (Streams) clda.co/sls-milano-data-pipeline Real-Ome data ingesOon Shards scale up/down
Almost Serverless Can trigger Lambda (batch)
Amazon Athena clda.co/sls-milano-data-pipeline InteracOve SQL queries over S3 Transparent compute
provisioning “Serverless Database” Results are stored on S3 too
Amazon QuickSight clda.co/sls-milano-data-pipeline Business Intelligence (BI) Powered by SPICE (in-memory
engine) Can read from RedshiB, RDS, Aurora, Athena, S3, EMR, etc. Monthly subscripOon (not really PAYG)
Amazon IAM clda.co/sls-milano-data-pipeline IdenOty & Access Management Very granular permissions
Free service (use it!)
Amazon Cognito clda.co/sls-milano-data-pipeline Federated idenOOes Role-based temporary credenOals Data Sync
and User Management (Pools) Complex but useful (use it!)
Use Case / Requirements Descrizione del caso d’uso clda.co/sls-milano-data-pipeline
Real-Ome Fraud DetecOon clda.co/sls-milano-data-pipeline Real-0me ingesOon of credit card transacOons
Stream processing, data validaOon, and fraud detecOon Secure storage of transacOons Real-0me analysis and reporOng
AddiOonal Requirements clda.co/sls-milano-data-pipeline Many heterogeneous event producers (temporary credenOals) Elas0c
architecture (no upfront costs & easy to scale up) Extensible architecture (plug-and-play components) A lot of data (cheap storage, please!)
Serverless Architecture Come lo implemen7amo senza ges7re server? clda.co/sls-milano-data-pipeline
Architecture Diagram clda.co/sls-milano-data-pipeline
IngesOon & Processing (simple) clda.co/sls-milano-data-pipeline
IngesOon & Processing (opOmized) clda.co/sls-milano-data-pipeline
IngesOon & Processing (real-Ome analyOcs)
Serverless Data Analysis clda.co/sls-milano-data-pipeline
Hands-on Demo Sporchiamoci un po’ le mani clda.co/sls-milano-data-pipeline
clda.co/sls-milano-data-pipeline < demo >
Grazie =) Domande? 10/19/2017 clda.co/sls-milano-data-pipeline