Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Machine Learning on Production
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Eko Kurniawan Khannedy
March 18, 2016
Technology
130
0
Share
Machine Learning on Production
Machine Learning on Production
Eko Kurniawan Khannedy
March 18, 2016
More Decks by Eko Kurniawan Khannedy
See All by Eko Kurniawan Khannedy
Monolith to Event-Driven Microservices
khannedy
1
270
Refactoring
khannedy
0
360
Multi-Datacenter Kafka at Blibli.com
khannedy
2
1.5k
QA Tools - Research and Development
khannedy
0
300
Reactive Puzzle
khannedy
0
220
Event-Driven Architecture
khannedy
1
2k
Resilience Engineering with Hystrix and Spring
khannedy
1
580
Mocking for Unit Test using Mockito
khannedy
1
350
Centralized Configuration using Consul and Spring Cloud
khannedy
2
720
Other Decks in Technology
See All in Technology
2026年度新卒技術研修 サイバーエージェントのデータベース 活用事例とパフォーマンス調査入門
cyberagentdevelopers
PRO
1
170
「決め方」の渡し方 / How to hand over the "decision-making process"
pauli
7
1.2k
Podcast配信で広がったアウトプットの輪~70人と音声発信してきた7年間~/outputconf_01
fortegp05
0
230
Even G2 クイックスタートガイド(日本語版)
vrshinobi1
0
210
AIを活用したアクセシビリティ改善フロー
degudegu2510
1
150
「活動」は激変する。「ベース」は変わらない ~ 4つの軸で捉える_AI時代ソフトウェア開発マネジメント
sentokun
0
150
【PHPカンファレンス小田原2026】Webアプリケーションエンジニアにも知ってほしい オブザーバビリティ の本質
fendo181
0
210
今年60歳のおっさんCBになる
kentapapa
1
190
スクラムを支える内部品質の話
iij_pr
0
280
Cortex Code君、今日から内製化支援担当ね。
coco_se
0
280
最大のアウトプット術は問題を作ること
ryoaccount
0
300
Oracle AI Database@Azure:サービス概要のご紹介
oracle4engineer
PRO
6
1.4k
Featured
See All Featured
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.7k
HDC tutorial
michielstock
1
600
Utilizing Notion as your number one productivity tool
mfonobong
4
280
Impact Scores and Hybrid Strategies: The future of link building
tamaranovitovic
0
250
Reality Check: Gamification 10 Years Later
codingconduct
0
2.1k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
31
3.1k
Crafting Experiences
bethany
1
110
Speed Design
sergeychernyshev
33
1.6k
Agile Actions for Facilitating Distributed Teams - ADO2019
mkilby
0
170
Reflections from 52 weeks, 52 projects
jeffersonlam
356
21k
How to optimise 3,500 product descriptions for ecommerce in one day using ChatGPT
katarinadahlin
PRO
1
3.5k
AI Search: Implications for SEO and How to Move Forward - #ShenzhenSEOConference
aleyda
1
1.2k
Transcript
MACHINE LEARNING ON PRODUCTION EKO KURNIAWAN KHANNEDY
MACHINE LEARNING ON PRODUCTION EKO KURNIAWAN KHANNEDY ▸ Principal Software
Development Engineer at blibli.com ▸ Part of Research and Development Team ▸
[email protected]
HAL YANG PALING SULIT ITU ADALAH MEMBAWA MACHINE LEARNING KE
PRODUCTION …. MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION AGENDA ▸ The Hard Part ▸
Best Practice ▸ Machine Learning in blibli.com
THE HARD PART MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION DATA ▸ Data Too Big ▸
Unstructured Data ▸ Document Oriented and Master Detail Data ▸ Continuous Data ▸ Imbalance Data ▸ Wild Data
MACHINE LEARNING ON PRODUCTION PREPROCESSING ▸ Feature Extraction ▸ Too
Many Features Extraction Makes Process Too Long
MACHINE LEARNING ON PRODUCTION TRAINING ▸ Batch Training ▸ Sequential
Algorithm ▸ Validation
BEST PRACTICE MACHINE LEARNING ON PRODUCTION
DATA
MACHINE LEARNING ON PRODUCTION DATA TOO BIG ▸ Load data
to memory. ▸ Streaming the datasource. ▸ Split data into multiple nodes. ▸ Use memory-file database.
MACHINE LEARNING ON PRODUCTION UNSTRUCTURED DATA ▸ Analyse Your Data
▸ Find Characteristic of Your Data ▸ Find Best Approachment for that case.
MACHINE LEARNING ON PRODUCTION DOCUMENT ORIENTED AND MASTER DETAIL DATA
▸ Analyse Your Data ▸ Find the Best Way to Treat The Data
MACHINE LEARNING ON PRODUCTION CONTINUOUS DATA ▸ Wide the range
that use in normalization process. ▸ Consider it as a missing value.
MACHINE LEARNING ON PRODUCTION IMBALANCE DATA ▸ Down Sampling. ▸
Up Sampling.
MACHINE LEARNING ON PRODUCTION WILD DATA ▸ Use Default Value.
▸ Use Average Value. ▸ Use Machine Learning to Predict Missing Value.
PREPROCESSING
MACHINE LEARNING ON PRODUCTION FEATURE EXTRACTION ▸ Add as Many
Facts as Possible ▸ Remove Irrelevant Feature
MACHINE LEARNING ON PRODUCTION TOO MANY FEATURES EXTRACTION MAKES PROCESS
TOO LONG ▸ Use Non-Blocking Process ▸ Use Event Driven Process ▸ Use Parallel Process
TRAINING
MACHINE LEARNING ON PRODUCTION BATCH TRAINING ▸ Use Real Time
Training ▸ Scheduled Training
MACHINE LEARNING ON PRODUCTION SEQUENTIAL ALGORITHM ▸ Distributed The Data
▸ Parallel The Algorithm
MACHINE LEARNING ON PRODUCTION VALIDATION ▸ Split Validation ▸ Cross
Validation ▸ Parallel The Validation
MACHINE LEARNING IN BLIBLI.COM MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION FRAUD PREVENTION PLATFORM RESTFULL MASTER DATA
CLIENT MACHINE LEARNING ENGINE PREPROCESSING ENGINE THIRD PARTY SERVICE
MACHINE LEARNING ON PRODUCTION MACHINE LEARNING ENGINE RESTFULL METADATA DATA
CLIENT TRAINING ENGINE TRAINING DATA CLASSIFICATION ENGINE
THANKS