Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Grill the data 2022 - The unsung hero behind Be...
Search
Karn Wong
October 01, 2022
Technology
0
140
Grill the data 2022 - The unsung hero behind Bestimate: data platform
Karn Wong
October 01, 2022
Tweet
Share
More Decks by Karn Wong
See All by Karn Wong
Data & AI Day 2025 - You Created a Pipeline, Now What?
kahnwong
0
79
Pycon Thailand 2025 - ML Model Serving Optimization with ONNX
kahnwong
0
30
MFEC x Google Cloud Thailand: Betagro Bootcamp - IaC Adoption
kahnwong
0
28
{{Ops}Ver.se - Infrastructure as Code and Business Values
kahnwong
0
72
BKK.JS #23 - Intro to WASM
kahnwong
0
32
FossAsia 2025 - Take Control of Your Own Data via Self-Hosting Through Open Source Software
kahnwong
0
84
Technologista 2024 - Rust for Data - What Works and What Doesn't
kahnwong
0
180
HashiCorp User Group Thailand Meetup - Self-hosting Kubernetes at Home with Terraform
kahnwong
0
120
HashiCorp User Hub Thailand #2 - Simplify Proxmox VM Management with Terraform
kahnwong
0
110
Other Decks in Technology
See All in Technology
AI連携の新常識! 話題のMCPをはじめて学ぶ!
makoakiba
0
180
AIがコードを書いてくれるなら、新米エンジニアは何をする? / komekaigi2025
nkzn
25
17k
AIエージェントは「使う」だけじゃなくて「作る」時代! 〜最新フレームワークで楽しく開発入門しよう〜
minorun365
PRO
3
200
可観測性は開発環境から、開発環境にもオブザーバビリティ導入のススメ
layerx
PRO
4
2.7k
最近読んで良かった本 / Yokohama North Meetup #10
mktakuya
0
670
プロダクト開発と社内データ活用での、BI×AIの現在地 / Data_Findy
sansan_randd
1
810
DSPy入門
tomehirata
6
890
The Twin Mandate of Observability
charity
0
150
[Journal club] Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
keio_smilab
PRO
0
110
累計5000万DLサービスの裏側 – LINEマンガのKotlinで挑む大規模 Server-side ETLの最適化
ldf_tech
0
180
Amazon Athena で JSON・Parquet・Iceberg のデータを検索し、性能を比較してみた
shigeruoda
1
300
AIでデータ活用を加速させる取り組み / Leveraging AI to accelerate data utilization
okiyuki99
6
1.7k
Featured
See All Featured
A better future with KSS
kneath
239
18k
The Language of Interfaces
destraynor
162
25k
Into the Great Unknown - MozCon
thekraken
40
2.1k
How GitHub (no longer) Works
holman
315
140k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
34
2.3k
4 Signs Your Business is Dying
shpigford
186
22k
Statistics for Hackers
jakevdp
799
220k
Raft: Consensus for Rubyists
vanstee
140
7.2k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
231
22k
GitHub's CSS Performance
jonrohan
1032
470k
Making Projects Easy
brettharned
120
6.4k
Git: the NoSQL Database
bkeepers
PRO
431
66k
Transcript
The unsung hero behind Bestimate: data platform Grill the data
2022
➔ Who is Baania? ➔ What is Bestimate? ➔ What
data platform is right for Bestimate? ➔ What components do we need? Agenda
Intro We are a proptech company 🏡 😎 Our goal
is to provide data-centric solutions to all parties in Thailand’s property industry Marketplace Data Center Home Flipping
Intro
None
None
None
Why we need data platform? Project vs product development in
a nutshell 🥜 🐿 01 | Add new columns 02 | Update feature engineering transformations 03 | Roll out new model version 04 | Oops I made a mistake, how do I roll back?
Data platform 101 • Automated workflows • Maintainability • Collaboration
Data lake 01 • Unlimited storage • Big data friendly
• Cheaper than maintaining a database • Single source of truth Image: AWS
Data catalog 02 • Find which tables have attributes you
want! • Yay data lineage 😄 Image: Amundsen Image: Marquez
Task orchestrator 03 • Say goodbye to running stuff manually!
• Not enough RAM? Cloud compute to the rescue! • Status reports are bae Image: Fivetran
Model registry 04 Image: Databricks
CI/CD 05 Image: Squadcast
Monitoring 06 Image: Banzai Cloud
Wrap up Data platform: • Reduces onboarding time to develop
features • Enables fast updates and deployment • Provides service status monitoring • Leaves trails for auditing
We are hiring! https://baaniathailand.com/career/
Thank you! Karn Wong Head of Platform Engineering at Baania
@kahnwong