Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Machine Learning on Production
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Eko Kurniawan Khannedy
March 18, 2016
Technology
0
130
Machine Learning on Production
Machine Learning on Production
Eko Kurniawan Khannedy
March 18, 2016
Tweet
Share
More Decks by Eko Kurniawan Khannedy
See All by Eko Kurniawan Khannedy
Monolith to Event-Driven Microservices
khannedy
1
270
Refactoring
khannedy
0
350
Multi-Datacenter Kafka at Blibli.com
khannedy
2
1.5k
QA Tools - Research and Development
khannedy
0
290
Reactive Puzzle
khannedy
0
210
Event-Driven Architecture
khannedy
1
2k
Resilience Engineering with Hystrix and Spring
khannedy
1
570
Mocking for Unit Test using Mockito
khannedy
1
340
Centralized Configuration using Consul and Spring Cloud
khannedy
2
710
Other Decks in Technology
See All in Technology
入社1ヶ月でデータパイプライン講座を作った話
waiwai2111
1
210
Data Hubグループ 紹介資料
sansan33
PRO
0
2.7k
データ民主化のための LLM 活用状況と課題紹介(IVRy の場合)
wxyzzz
2
600
2026年、サーバーレスの現在地 -「制約と戦う技術」から「当たり前の実行基盤」へ- /serverless2026
slsops
2
150
KubeCon + CloudNativeCon NA ‘25 Recap, Extensibility: Gateway API / NRI
ladicle
0
170
予期せぬコストの急増を障害のように扱う――「コスト版ポストモーテム」の導入とその後の改善
muziyoshiz
0
770
What happened to RubyGems and what can we learn?
mikemcquaid
0
200
【5分でわかる】セーフィー エンジニア向け会社紹介
safie_recruit
0
41k
日本語テキストと音楽の対照学習の技術とその応用
lycorptech_jp
PRO
1
400
あたらしい上流工程の形。 0日導入からはじめるAI駆動PM
kumaiu
4
700
プロダクト成長を支える開発基盤とスケールに伴う課題
yuu26
1
280
2人で作ったAIダッシュボードが、開発組織の次の一手を照らした話― Cursor × SpecKit × 可視化の実践 ― Qiita AI Summit
noalisaai
1
340
Featured
See All Featured
WCS-LA-2024
lcolladotor
0
440
Building Adaptive Systems
keathley
44
2.9k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
Effective software design: The role of men in debugging patriarchy in IT @ Voxxed Days AMS
baasie
0
220
Reflections from 52 weeks, 52 projects
jeffersonlam
356
21k
The SEO identity crisis: Don't let AI make you average
varn
0
62
Build The Right Thing And Hit Your Dates
maggiecrowley
38
3k
16th Malabo Montpellier Forum Presentation
akademiya2063
PRO
0
46
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
133
19k
Ethics towards AI in product and experience design
skipperchong
2
190
AI in Enterprises - Java and Open Source to the Rescue
ivargrimstad
0
1.1k
Navigating the moral maze — ethical principles for Al-driven product design
skipperchong
2
240
Transcript
MACHINE LEARNING ON PRODUCTION EKO KURNIAWAN KHANNEDY
MACHINE LEARNING ON PRODUCTION EKO KURNIAWAN KHANNEDY ▸ Principal Software
Development Engineer at blibli.com ▸ Part of Research and Development Team ▸
[email protected]
HAL YANG PALING SULIT ITU ADALAH MEMBAWA MACHINE LEARNING KE
PRODUCTION …. MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION AGENDA ▸ The Hard Part ▸
Best Practice ▸ Machine Learning in blibli.com
THE HARD PART MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION DATA ▸ Data Too Big ▸
Unstructured Data ▸ Document Oriented and Master Detail Data ▸ Continuous Data ▸ Imbalance Data ▸ Wild Data
MACHINE LEARNING ON PRODUCTION PREPROCESSING ▸ Feature Extraction ▸ Too
Many Features Extraction Makes Process Too Long
MACHINE LEARNING ON PRODUCTION TRAINING ▸ Batch Training ▸ Sequential
Algorithm ▸ Validation
BEST PRACTICE MACHINE LEARNING ON PRODUCTION
DATA
MACHINE LEARNING ON PRODUCTION DATA TOO BIG ▸ Load data
to memory. ▸ Streaming the datasource. ▸ Split data into multiple nodes. ▸ Use memory-file database.
MACHINE LEARNING ON PRODUCTION UNSTRUCTURED DATA ▸ Analyse Your Data
▸ Find Characteristic of Your Data ▸ Find Best Approachment for that case.
MACHINE LEARNING ON PRODUCTION DOCUMENT ORIENTED AND MASTER DETAIL DATA
▸ Analyse Your Data ▸ Find the Best Way to Treat The Data
MACHINE LEARNING ON PRODUCTION CONTINUOUS DATA ▸ Wide the range
that use in normalization process. ▸ Consider it as a missing value.
MACHINE LEARNING ON PRODUCTION IMBALANCE DATA ▸ Down Sampling. ▸
Up Sampling.
MACHINE LEARNING ON PRODUCTION WILD DATA ▸ Use Default Value.
▸ Use Average Value. ▸ Use Machine Learning to Predict Missing Value.
PREPROCESSING
MACHINE LEARNING ON PRODUCTION FEATURE EXTRACTION ▸ Add as Many
Facts as Possible ▸ Remove Irrelevant Feature
MACHINE LEARNING ON PRODUCTION TOO MANY FEATURES EXTRACTION MAKES PROCESS
TOO LONG ▸ Use Non-Blocking Process ▸ Use Event Driven Process ▸ Use Parallel Process
TRAINING
MACHINE LEARNING ON PRODUCTION BATCH TRAINING ▸ Use Real Time
Training ▸ Scheduled Training
MACHINE LEARNING ON PRODUCTION SEQUENTIAL ALGORITHM ▸ Distributed The Data
▸ Parallel The Algorithm
MACHINE LEARNING ON PRODUCTION VALIDATION ▸ Split Validation ▸ Cross
Validation ▸ Parallel The Validation
MACHINE LEARNING IN BLIBLI.COM MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION FRAUD PREVENTION PLATFORM RESTFULL MASTER DATA
CLIENT MACHINE LEARNING ENGINE PREPROCESSING ENGINE THIRD PARTY SERVICE
MACHINE LEARNING ON PRODUCTION MACHINE LEARNING ENGINE RESTFULL METADATA DATA
CLIENT TRAINING ENGINE TRAINING DATA CLASSIFICATION ENGINE
THANKS