Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Back To The Future: Emerging Trends in Data Eng...
Search
Ananth Packkildurai
October 08, 2021
Technology
0
1.3k
Back To The Future: Emerging Trends in Data Engineering
Data Engineering Trends 2021 from
www.dataengineeringweekly.com
Ananth Packkildurai
October 08, 2021
Tweet
Share
More Decks by Ananth Packkildurai
See All by Ananth Packkildurai
Data Contracts & Domain Ownership
vananth22
0
130
Data Catalogs - Rebuild the Broken Promise
vananth22
0
88
Functional Data Engineering - A Blueprint for adopting functional principles in data pipeline
vananth22
0
590
Murron: A Reliable Monitoring Pipeline
vananth22
0
420
The_journey_towards_Pinot.pdf
vananth22
0
240
Reliable_Event_Pipeline___scale.pdf
vananth22
0
220
Operating Data Pipeline with Airflow @ Slack
vananth22
1
2.6k
Streaming data pipelines @ Slack
vananth22
2
2.5k
measuring api performance using druid
vananth22
0
1.7k
Other Decks in Technology
See All in Technology
AI開発の落とし穴 〜馬には乗ってみよAIには添うてみよ〜
sansantech
PRO
10
5.4k
入社1ヶ月でデータパイプライン講座を作った話
waiwai2111
1
180
システムのアラート調査をサポートするAI Agentの紹介/Introduction to an AI Agent for System Alert Investigation
taddy_919
0
450
Mosaic AI Gatewayでコーディングエージェントを配るための運用Tips / JEDAI 2026 新春 Meetup! AIコーディング特集
genda
0
130
【NGK2026S】日本株のシステムトレードに入門してみた
kazuhitotakahashi
0
220
しろおびセキュリティへ ようこそ
log0417
0
190
Sansan Engineering Unit 紹介資料
sansan33
PRO
1
3.8k
GCASアップデート(202510-202601)
techniczna
0
200
エンジニアとマネジメントの距離/Engineering and Management
ikuodanaka
3
680
Zephyr RTOS の発表をOpen Source Summit Japan 2025で行った件
iotengineer22
0
290
AI開発をスケールさせるデータ中心の仕組みづくり
kzykmyzw
0
180
【インシデント入門】サイバー攻撃を受けた現場って何してるの?
shumei_ito
0
1.2k
Featured
See All Featured
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
34
2.6k
Darren the Foodie - Storyboard
khoart
PRO
2
2.3k
Context Engineering - Making Every Token Count
addyosmani
9
630
The untapped power of vector embeddings
frankvandijk
1
1.6k
A Tale of Four Properties
chriscoyier
162
24k
Code Reviewing Like a Champion
maltzj
527
40k
Lightning talk: Run Django tests with GitHub Actions
sabderemane
0
110
Reality Check: Gamification 10 Years Later
codingconduct
0
2k
Imperfection Machines: The Place of Print at Facebook
scottboms
269
14k
YesSQL, Process and Tooling at Scale
rocio
174
15k
XXLCSS - How to scale CSS and keep your sanity
sugarenia
249
1.3M
How to Think Like a Performance Engineer
csswizardry
28
2.4k
Transcript
Emerging Trends in Data Engineering
Principal Data Engineer @ Zendesk www.dataengineeringweekly.com - weekly data engineering
newsletter @ananthdurai
Data Practitioners life Infinite Loop of Sadness
#1: Data Discovery & Metadata Management
Open Source Data Discovery Tools ➔ Amundsen - https://www.amundsen.io/ ➔
Marquez - https://marquezproject.github.io/ marquez/ ➔ DataHub - https://github.com/linkedin/datah ub
#2 Data Mesh & Domain Ownership
None
Catalog The Mes(h)s
#3 Data Observability
Data Observability ➔ DBT ➔ Great Expectations ➔ Deeque ➔
Airflow ➔ Dagster ➔ Prefect
#4 Data LakeHouse
Data LakeHouse ➔ Apache Iceberg ➔ Delta Lake ➔ Apache
Hudi
#5 Modern Data Stack
Modern Data Stack ➔ Extraction & Load: AirByte, FiveTran, RudderStack
etc., ➔ Data Transformation: DBT, Dataform ➔ Data Warehouse: BigQuery, Redshift, Snowflake etc., ➔ Data Governance: Acryl data, Stemma, Atlan etc., ➔ BI: Looker, Mode, Metabase etc.,
#6 Industrialized ML
Industrialized ML ➔ Tensorflow ➔ PyTorch ➔ Transformer Neural Network
& Trunk Model ➔ TPU
#7 Diversity, Privacy, AI Ethics
Diversity, Privacy & AI Ethics ➔ Explainable AI ➔ Privacy
preserve modeling ➔ AI model bias.
Emerging trends in Data Engineering 1. Data Discovery & Metadata
Management 2. Data Mesh & Domain Ownership 3. Data Observability 4. Data LakeHouse 5. Modern Data Stack 6. Industrialized ML 7. Diversity, Privacy & AI Ethics
Thank You!