Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Machine Learning on Production
Search
Eko Kurniawan Khannedy
March 18, 2016
Technology
0
130
Machine Learning on Production
Machine Learning on Production
Eko Kurniawan Khannedy
March 18, 2016
Tweet
Share
More Decks by Eko Kurniawan Khannedy
See All by Eko Kurniawan Khannedy
Monolith to Event-Driven Microservices
khannedy
1
250
Refactoring
khannedy
0
320
Multi-Datacenter Kafka at Blibli.com
khannedy
2
1.5k
QA Tools - Research and Development
khannedy
0
280
Reactive Puzzle
khannedy
0
200
Event-Driven Architecture
khannedy
1
1.9k
Resilience Engineering with Hystrix and Spring
khannedy
1
560
Mocking for Unit Test using Mockito
khannedy
1
340
Centralized Configuration using Consul and Spring Cloud
khannedy
2
680
Other Decks in Technology
See All in Technology
プロダクトエンジニアリング組織への歩み、その現在地 / Our journey to becoming a product engineering organization
hiro_torii
0
110
キャディでのApache Iceberg, Trino採用事例 -Apache Iceberg and Trino Usecase in CADDi--
caddi_eng
0
170
In Praise of "Normal" Engineers (LDX3)
charity
3
1.2k
GeminiとNotebookLMによる金融実務の業務革新
abenben
0
170
CSS、JSをHTMLテンプレートにまとめるフロントエンド戦略
d120145
0
240
標準技術と独自システムで作る「つらくない」SaaS アカウント管理 / Effortless SaaS Account Management with Standard Technologies & Custom Systems
yuyatakeyama
2
1.1k
2年でここまで成長!AWSで育てたAI Slack botの軌跡
iwamot
PRO
2
460
Amazon ECS & AWS Fargate 運用アーキテクチャ2025 / Amazon ECS and AWS Fargate Ops Architecture 2025
iselegant
16
4.8k
“社内”だけで完結していた私が、AWS Community Builder になるまで
nagisa53
1
240
OpenHands🤲にContributeしてみた
kotauchisunsun
0
250
Wasm元年
askua
0
110
低レイヤを知りたいPHPerのためのCコンパイラ作成入門 完全版 / Building a C Compiler for PHPers Who Want to Dive into Low-Level Programming - Expanded
tomzoh
2
580
Featured
See All Featured
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
48
5.4k
Designing for Performance
lara
609
69k
How to train your dragon (web standard)
notwaldorf
92
6.1k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
357
30k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
53k
Optimising Largest Contentful Paint
csswizardry
37
3.3k
Learning to Love Humans: Emotional Interface Design
aarron
273
40k
A better future with KSS
kneath
239
17k
The Cost Of JavaScript in 2023
addyosmani
51
8.4k
Building an army of robots
kneath
306
45k
Music & Morning Musume
bryan
46
6.6k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
48
2.8k
Transcript
MACHINE LEARNING ON PRODUCTION EKO KURNIAWAN KHANNEDY
MACHINE LEARNING ON PRODUCTION EKO KURNIAWAN KHANNEDY ▸ Principal Software
Development Engineer at blibli.com ▸ Part of Research and Development Team ▸
[email protected]
HAL YANG PALING SULIT ITU ADALAH MEMBAWA MACHINE LEARNING KE
PRODUCTION …. MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION AGENDA ▸ The Hard Part ▸
Best Practice ▸ Machine Learning in blibli.com
THE HARD PART MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION DATA ▸ Data Too Big ▸
Unstructured Data ▸ Document Oriented and Master Detail Data ▸ Continuous Data ▸ Imbalance Data ▸ Wild Data
MACHINE LEARNING ON PRODUCTION PREPROCESSING ▸ Feature Extraction ▸ Too
Many Features Extraction Makes Process Too Long
MACHINE LEARNING ON PRODUCTION TRAINING ▸ Batch Training ▸ Sequential
Algorithm ▸ Validation
BEST PRACTICE MACHINE LEARNING ON PRODUCTION
DATA
MACHINE LEARNING ON PRODUCTION DATA TOO BIG ▸ Load data
to memory. ▸ Streaming the datasource. ▸ Split data into multiple nodes. ▸ Use memory-file database.
MACHINE LEARNING ON PRODUCTION UNSTRUCTURED DATA ▸ Analyse Your Data
▸ Find Characteristic of Your Data ▸ Find Best Approachment for that case.
MACHINE LEARNING ON PRODUCTION DOCUMENT ORIENTED AND MASTER DETAIL DATA
▸ Analyse Your Data ▸ Find the Best Way to Treat The Data
MACHINE LEARNING ON PRODUCTION CONTINUOUS DATA ▸ Wide the range
that use in normalization process. ▸ Consider it as a missing value.
MACHINE LEARNING ON PRODUCTION IMBALANCE DATA ▸ Down Sampling. ▸
Up Sampling.
MACHINE LEARNING ON PRODUCTION WILD DATA ▸ Use Default Value.
▸ Use Average Value. ▸ Use Machine Learning to Predict Missing Value.
PREPROCESSING
MACHINE LEARNING ON PRODUCTION FEATURE EXTRACTION ▸ Add as Many
Facts as Possible ▸ Remove Irrelevant Feature
MACHINE LEARNING ON PRODUCTION TOO MANY FEATURES EXTRACTION MAKES PROCESS
TOO LONG ▸ Use Non-Blocking Process ▸ Use Event Driven Process ▸ Use Parallel Process
TRAINING
MACHINE LEARNING ON PRODUCTION BATCH TRAINING ▸ Use Real Time
Training ▸ Scheduled Training
MACHINE LEARNING ON PRODUCTION SEQUENTIAL ALGORITHM ▸ Distributed The Data
▸ Parallel The Algorithm
MACHINE LEARNING ON PRODUCTION VALIDATION ▸ Split Validation ▸ Cross
Validation ▸ Parallel The Validation
MACHINE LEARNING IN BLIBLI.COM MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION FRAUD PREVENTION PLATFORM RESTFULL MASTER DATA
CLIENT MACHINE LEARNING ENGINE PREPROCESSING ENGINE THIRD PARTY SERVICE
MACHINE LEARNING ON PRODUCTION MACHINE LEARNING ENGINE RESTFULL METADATA DATA
CLIENT TRAINING ENGINE TRAINING DATA CLASSIFICATION ENGINE
THANKS