Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
End-to-end serverless data pipeline on AWS
Search
Alex Casalboni
October 16, 2017
Technology
1
95
End-to-end serverless data pipeline on AWS
Use case powered by Amazon Kinesis, Lambda, Athena, and QuickSight.
Alex Casalboni
October 16, 2017
Tweet
Share
More Decks by Alex Casalboni
See All by Alex Casalboni
Data-driven & cost-aware performance tuning for AWS Lambda
alexcasalboni
0
220
AWS Lambda function performance tuning - AWS re:Invent 2019
alexcasalboni
1
1.1k
Supercharge product development with cloud best practices - Web Summit 2019
alexcasalboni
0
92
AWS data services for machine learning - AWS Innovate Online
alexcasalboni
0
61
Advanced Serverless Architectural Patterns on AWS [Devoxx Poland]
alexcasalboni
0
240
Building a Modern Data Platform in the Cloud [AWS Dev Day @ Kyiv]
alexcasalboni
0
120
How to build a global serverless service [AWS Dev Day @ Kyiv]
alexcasalboni
0
58
Become a machine learning developer using AWS Machine Learning Services [AWS Summit @ Warsaw]
alexcasalboni
0
120
Getting started with AWS for Developers [Collision @ Toronto]
alexcasalboni
0
75
Other Decks in Technology
See All in Technology
pg_bigmをRustで実装する(第50回PostgreSQLアンカンファレンス@オンライン 発表資料)
shinyakato_
0
150
スケールし続ける事業とサービスを支える組織とアーキテクチャの生き残り戦略 / The survival strategy for Money Forward’s engineering.
moneyforward
0
240
終了の危機にあった15年続くWebサービスを全力で存続させる - phpcon2024
yositosi
28
25k
大規模言語モデルとそのソフトウェア開発に向けた応用 (2024年版)
kazato
2
450
20241220_S3 tablesの使い方を検証してみた
handy
4
870
エンジニアリングマネージャー視点での、自律的なスケーリングを実現するFASTという選択肢 / RSGT2025
yoshikiiida
4
2.9k
PHPerのための計算量入門/Complexity101 for PHPer
hanhan1978
6
1.5k
Fearsome File Formats
ange
0
550
完全自律型AIエージェントとAgentic Workflow〜ワークフロー構築という現実解
pharma_x_tech
0
110
.NET AspireでAzure Functionsやクラウドリソースを統合する
tsubakimoto_s
0
140
.NET 9 のパフォーマンス改善
nenonaninu
0
2.2k
Storage Browser for Amazon S3
miu_crescent
1
350
Featured
See All Featured
Music & Morning Musume
bryan
46
6.3k
The Cost Of JavaScript in 2023
addyosmani
46
7.2k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
356
29k
Six Lessons from altMBA
skipperchong
27
3.5k
Fantastic passwords and where to find them - at NoRuKo
philnash
50
2.9k
Practical Orchestrator
shlominoach
186
10k
VelocityConf: Rendering Performance Case Studies
addyosmani
327
24k
GraphQLとの向き合い方2022年版
quramy
44
13k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
45
2.3k
What's in a price? How to price your products and services
michaelherold
244
12k
Large-scale JavaScript Application Architecture
addyosmani
510
110k
YesSQL, Process and Tooling at Scale
rocio
170
14k
Transcript
clda.co/sls-milano-data-pipeline End-to-end serverless data pipeline on AWS Powered by Amazon
Kinesis, Lambda, Athena, and QuickSight 10/19/2017
About Me @alex_casalboni clda.co/sls-milano-data-pipeline Computer Science Background Master in Sound
& Music Engineering Sr. SoBware Engineer & Web Developer Cloud Evangelist @ Cloud Academy
clda.co/serverless-italy Call for Speakers! clda.co/sls-milano-data-pipeline
Agenda Prerequisites Use Case / Requirements Serverless Architecture Hands-on Demo
clda.co/sls-milano-data-pipeline
Prerequisites Breve introduzione dei singoli servizi clda.co/sls-milano-data-pipeline
AWS Lambda clda.co/sls-milano-data-pipeline FaaS (FuncOon as a Service) NodeJs, Python,
Java, C#, etc. Event-driven
Amazon S3 clda.co/sls-milano-data-pipeline PB-scale Object-storage 99.999999999% durability SSE Support Can
trigger Lambda (per-object)
Amazon Kinesis (Streams) clda.co/sls-milano-data-pipeline Real-Ome data ingesOon Shards scale up/down
Almost Serverless Can trigger Lambda (batch)
Amazon Athena clda.co/sls-milano-data-pipeline InteracOve SQL queries over S3 Transparent compute
provisioning “Serverless Database” Results are stored on S3 too
Amazon QuickSight clda.co/sls-milano-data-pipeline Business Intelligence (BI) Powered by SPICE (in-memory
engine) Can read from RedshiB, RDS, Aurora, Athena, S3, EMR, etc. Monthly subscripOon (not really PAYG)
Amazon IAM clda.co/sls-milano-data-pipeline IdenOty & Access Management Very granular permissions
Free service (use it!)
Amazon Cognito clda.co/sls-milano-data-pipeline Federated idenOOes Role-based temporary credenOals Data Sync
and User Management (Pools) Complex but useful (use it!)
Use Case / Requirements Descrizione del caso d’uso clda.co/sls-milano-data-pipeline
Real-Ome Fraud DetecOon clda.co/sls-milano-data-pipeline Real-0me ingesOon of credit card transacOons
Stream processing, data validaOon, and fraud detecOon Secure storage of transacOons Real-0me analysis and reporOng
AddiOonal Requirements clda.co/sls-milano-data-pipeline Many heterogeneous event producers (temporary credenOals) Elas0c
architecture (no upfront costs & easy to scale up) Extensible architecture (plug-and-play components) A lot of data (cheap storage, please!)
Serverless Architecture Come lo implemen7amo senza ges7re server? clda.co/sls-milano-data-pipeline
Architecture Diagram clda.co/sls-milano-data-pipeline
IngesOon & Processing (simple) clda.co/sls-milano-data-pipeline
IngesOon & Processing (opOmized) clda.co/sls-milano-data-pipeline
IngesOon & Processing (real-Ome analyOcs)
Serverless Data Analysis clda.co/sls-milano-data-pipeline
Hands-on Demo Sporchiamoci un po’ le mani clda.co/sls-milano-data-pipeline
clda.co/sls-milano-data-pipeline < demo >
Grazie =) Domande? 10/19/2017 clda.co/sls-milano-data-pipeline