Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Data & AI Day 2025 - You Created a Pipeline, No...
Search
Karn Wong
October 27, 2025
Technology
120
0
Share
Data & AI Day 2025 - You Created a Pipeline, Now What?
Karn Wong
October 27, 2025
More Decks by Karn Wong
See All by Karn Wong
Global Azure 2026 - Securing VM Access On Azure
kahnwong
0
21
AgentCon Bangkok 2026 - How to Stay Sane in the Age of Agents
kahnwong
0
58
National Coding Day 2026 - Software Evolution: The Complete Lifecycle
kahnwong
0
71
Microsoft Ignite After Party 2025 - Azure Infrastructure for Cloud Native Solutions
kahnwong
0
41
AI Community Day Bangkok 2025 - In-Browser ML/LLM Inference Ecosystem
kahnwong
0
54
Pycon Thailand 2025 - ML Model Serving Optimization with ONNX
kahnwong
0
59
MFEC x Google Cloud Thailand: Betagro Bootcamp - IaC Adoption
kahnwong
0
68
{{Ops}Ver.se - Infrastructure as Code and Business Value
kahnwong
0
110
BKK.JS #23 - Intro to WASM
kahnwong
0
57
Other Decks in Technology
See All in Technology
AIAgentと取り組むKaggle
508shuto
2
540
checker.tsにチキンレースを仕掛けてみた:型エラー(TS2589)が発生する境界線を求めて
hal_spidernight
1
200
TSKaigi 2026 - 型プラグインシステムの実装に使われるテクニック
teamlab
PRO
2
330
TSKaigi 2026 - Auth.jsからBetter Authへの 移行に見る「型とランタイム」の 設計思想の変化
teamlab
PRO
1
250
Anthropic AIネイティブ・スタートアップ構築のプレイブック を理解する
nagatsu
0
160
ジュニアエンジニアはSREとどう向き合うべきか
nrinetcom
PRO
1
120
TSKaigi 2026 - 10秒のビルドを1秒へ:tsdownが切り拓く2026年のTypeScriptライブラリ開発
teamlab
PRO
2
250
long-running-tasks
cipepser
2
270
組織の中で自分を経営する技術
shoota
0
130
既存プロダクトQAから新規プロダクトQAへ
ryotakahashi
0
200
「使われるデータ基盤」を目指してデータアナリストとワークショップをやった話
jackojacko_
2
850
データ分析基盤の信頼を支える視点と設計
yuki_saito
1
630
Featured
See All Featured
Ecommerce SEO: The Keys for Success Now & Beyond - #SERPConf2024
aleyda
1
2k
Music & Morning Musume
bryan
47
7.2k
Self-Hosted WebAssembly Runtime for Runtime-Neutral Checkpoint/Restore in Edge–Cloud Continuum
chikuwait
0
540
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
Typedesign – Prime Four
hannesfritz
42
3k
Applied NLP in the Age of Generative AI
inesmontani
PRO
4
2.3k
Documentation Writing (for coders)
carmenintech
77
5.3k
Redefining SEO in the New Era of Traffic Generation
szymonslowik
1
300
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
5.9k
GraphQLの誤解/rethinking-graphql
sonatard
75
12k
How to Align SEO within the Product Triangle To Get Buy-In & Support - #RIMC
aleyda
2
1.5k
Between Models and Reality
mayunak
4
300
Transcript
You Created a Data Pipeline Now What? Data + AI
Day 2025-10-25
Karn Wong Loves optimization Has too much fun cranking out
benchmarks HashiCorp Ambassador & AWS Community Builder Blog & Portfolio karnwong.me Say hi at Bluesky @karnwong.me Independent Consultant
Data Pipeline Anatomy Source Transformation Output
Source Data Owner Access Location Updates Format / Schema
Data Transformation Author Read / Write Access Deployment Updates Active
Revision in Production Reproducibility
Output Data Destination Access Devs Systems Users Versioning Development vs
Production
Key Themes Role-Based Access Control (RBAC) Versioning Reproducibility
RBAC Assign permissions to groups Attach users to groups Easier
to setup and maintain Can assign ad-hoc & special permissions on a case-by-case basis
Monitoring & Observability Logs Visibility & Access Pipeline Failures When
Time to Resolution Business Impact
Monitoring & Observability Service-Level Agreement Pipeline Runtimes Disaster Recovery Server
Crashes Resources Consumption Future Scaling
Schema Evolution Lead Time Data Team in the Loop? Rollout
Strategy Deprecation To Rename or Not to Rename
Data Requests Time to Complete Slow Completion Time Shadow Data
Team Silos Governance: #@%#$%!@#!!!
Data Mesh & Data Stack Migration Reach out to me
during networking
Thank you 🙏 Download slides at: karnwong.me