Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
End-to-end automated data science process using...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Keerthi
October 31, 2018
Education
2
260
End-to-end automated data science process using Airflow.
End-to-end automated data science process using Airflow.
Keerthi
October 31, 2018
Tweet
Share
Other Decks in Education
See All in Education
次期バージョン 14.5.1 Early Access Program が始まりました
harunakano
1
120
自己紹介 / who-am-i
yasulab
6
6.4k
2025-12-19-LT
takesection
0
110
【ベテランCTOからのメッセージ】AIとか組織とかキャリアとか気になることはあるけどさ、個人の技術力から目を背けないでやっていきましょうよ
netmarkjp
2
3.9k
Future Trends and Review - Lecture 12 - Web Technologies (1019888BNR)
signer
PRO
0
3.3k
くまのココロンともぐらのロジ
frievea
0
190
AIで日本はどう進化する? 〜キミが生きる2035年の地図〜
behomazn
0
130
IHLヘルスケアリーダーシップ研究会17期説明資料
ihlhealthcareleadership
0
2k
TinyGoをWebブラウザで動かすための方法+アルファ_20260201
masakiokuda
2
280
Adobe Express
matleenalaakso
2
8.2k
滑空スポーツ講習会2025(実技講習)EMFT講習 実施要領/JSA EMFT 2025 procedure
jsaseminar
0
140
多様なメンター、多様な基準
yasulab
6
19k
Featured
See All Featured
Claude Code のすすめ
schroneko
67
220k
The Invisible Side of Design
smashingmag
302
51k
Embracing the Ebb and Flow
colly
88
5k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
25
1.8k
GraphQLの誤解/rethinking-graphql
sonatard
75
11k
Money Talks: Using Revenue to Get Sh*t Done
nikkihalliwell
0
180
Optimising Largest Contentful Paint
csswizardry
37
3.6k
Future Trends and Review - Lecture 12 - Web Technologies (1019888BNR)
signer
PRO
0
3.3k
Leading Effective Engineering Teams in the AI Era
addyosmani
9
1.7k
Docker and Python
trallard
47
3.8k
Facilitating Awesome Meetings
lara
57
6.8k
Chasing Engaging Ingredients in Design
codingconduct
0
130
Transcript
End-to-end automated data science process using Airflow. Evive
About Evive • Data Driven benefit navigator • Founded in
2006 • 400 + employees
Evive Data 15 2.5M 400 Data team Evive Employee Total
Active members
Data Usage 500+GB 50+ 30+ Total data per day Number
of data channels Number of models running daily
Why Airflow THE WORKFLOW Ingestion Merge data from multiple sources
Standardise Verify Publish
Airflow workers Data Sources Scheduler Database
Airflow Architecture
Functionalities • Scheduling • Dependency management • Error recovery •
Monitoring • Versioning • Mailing and alerting
Creating a dag and an operator
Scheduling tasks
File sensor • Operator that listens to a particular directory
and triggers the downstream task once the file lands on the corresponding directory. • Pynotify as operator.
Monitoring using airflow dashboard
Versioning • Versioning can be easily incorporated in airflow as
the entire dag execution happens as one instance. • You can version your data as well as model outputs.
Mailing and alerting system
Future work • Integrating with the existing database architecture and
ETL pipeline • Airflow Kubernetes executors