Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Data & AI Day 2025 - You Created a Pipeline, No...
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
Karn Wong
October 27, 2025
Technology
110
0
Share
Data & AI Day 2025 - You Created a Pipeline, Now What?
Karn Wong
October 27, 2025
More Decks by Karn Wong
See All by Karn Wong
AgentCon Bangkok 2026 - How to Stay Sane in the Age of Agents
kahnwong
0
49
National Coding Day 2026 - Software Evolution: The Complete Lifecycle
kahnwong
0
49
Microsoft Ignite After Party 2025 - Azure Infrastructure for Cloud Native Solutions
kahnwong
0
34
AI Community Day Bangkok 2025 - In-Browser ML/LLM Inference Ecosystem
kahnwong
0
41
Pycon Thailand 2025 - ML Model Serving Optimization with ONNX
kahnwong
0
49
MFEC x Google Cloud Thailand: Betagro Bootcamp - IaC Adoption
kahnwong
0
58
{{Ops}Ver.se - Infrastructure as Code and Business Values
kahnwong
0
100
BKK.JS #23 - Intro to WASM
kahnwong
0
50
FossAsia 2025 - Take Control of Your Own Data via Self-Hosting Through Open Source Software
kahnwong
0
120
Other Decks in Technology
See All in Technology
"SQLは書けません"から始まる データドリブン
kubell_hr
0
190
プロンプトエンジニアリングを超えて:自由と統制のあいだでつくる Platform × Context Engineering
yuriemori
0
170
Introduction to Bill One Development Engineer
sansan33
PRO
0
400
DIPS2.0データに基づく森林管理における無人航空機の利用状況
naokimuroki
0
190
LLM とプロンプトエンジニアリング/チューターを定義する / LLMs and Prompt Engineering, and Defining Tutors
ks91
PRO
0
330
GitHub Copilotを極める会 - 開発者のための活用術
findy_eventslides
6
4k
新メンバーのために、シニアエンジニアが環境を作る時代
puku0x
0
670
BIツール「Omni」の紹介 @Snowflake中部UG
sagara
0
270
🀄️ on swiftc
giginet
PRO
0
320
ふりかえりを 「あそび」にしたら、 学習が勝手に進んだ / Playful Retros Drive Learning
katoaz
0
450
AIがコードを書く時代の ジェネレーティブプログラミング
polidog
PRO
3
670
Databricksで構築するログ検索基盤とアーキテクチャ設計
cscengineer
0
150
Featured
See All Featured
The untapped power of vector embeddings
frankvandijk
2
1.7k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.7k
Stewardship and Sustainability of Urban and Community Forests
pwiseman
0
170
GraphQLとの向き合い方2022年版
quramy
50
14k
Building the Perfect Custom Keyboard
takai
2
720
ReactJS: Keep Simple. Everything can be a component!
pedronauck
666
130k
Sam Torres - BigQuery for SEOs
techseoconnect
PRO
0
240
WCS-LA-2024
lcolladotor
0
520
How to make the Groovebox
asonas
2
2.1k
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
128
55k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.8k
The Limits of Empathy - UXLibs8
cassininazir
1
290
Transcript
You Created a Data Pipeline Now What? Data + AI
Day 2025-10-25
Karn Wong Loves optimization Has too much fun cranking out
benchmarks HashiCorp Ambassador & AWS Community Builder Blog & Portfolio karnwong.me Say hi at Bluesky @karnwong.me Independent Consultant
Data Pipeline Anatomy Source Transformation Output
Source Data Owner Access Location Updates Format / Schema
Data Transformation Author Read / Write Access Deployment Updates Active
Revision in Production Reproducibility
Output Data Destination Access Devs Systems Users Versioning Development vs
Production
Key Themes Role-Based Access Control (RBAC) Versioning Reproducibility
RBAC Assign permissions to groups Attach users to groups Easier
to setup and maintain Can assign ad-hoc & special permissions on a case-by-case basis
Monitoring & Observability Logs Visibility & Access Pipeline Failures When
Time to Resolution Business Impact
Monitoring & Observability Service-Level Agreement Pipeline Runtimes Disaster Recovery Server
Crashes Resources Consumption Future Scaling
Schema Evolution Lead Time Data Team in the Loop? Rollout
Strategy Deprecation To Rename or Not to Rename
Data Requests Time to Complete Slow Completion Time Shadow Data
Team Silos Governance: #@%#$%!@#!!!
Data Mesh & Data Stack Migration Reach out to me
during networking
Thank you 🙏 Download slides at: karnwong.me