Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Machine Learning on Production
Search
Eko Kurniawan Khannedy
March 18, 2016
Technology
0
130
Machine Learning on Production
Machine Learning on Production
Eko Kurniawan Khannedy
March 18, 2016
Tweet
Share
More Decks by Eko Kurniawan Khannedy
See All by Eko Kurniawan Khannedy
Monolith to Event-Driven Microservices
khannedy
1
240
Refactoring
khannedy
0
300
Multi-Datacenter Kafka at Blibli.com
khannedy
2
1.5k
QA Tools - Research and Development
khannedy
0
270
Reactive Puzzle
khannedy
0
190
Event-Driven Architecture
khannedy
1
1.8k
Resilience Engineering with Hystrix and Spring
khannedy
1
550
Mocking for Unit Test using Mockito
khannedy
1
320
Centralized Configuration using Consul and Spring Cloud
khannedy
2
630
Other Decks in Technology
See All in Technology
SREが投資するAIOps ~ペアーズにおけるLLM for Developerへの取り組み~
takumiogawa
1
380
適材適所の技術選定 〜GraphQL・REST API・tRPC〜 / Optimal Technology Selection
kakehashi
1
680
AWS Lambdaと歩んだ“サーバーレス”と今後 #lambda_10years
yoshidashingo
1
180
個人でもIAM Identity Centerを使おう!(アクセス管理編)
ryder472
4
230
Can We Measure Developer Productivity?
ewolff
1
150
VideoMamba: State Space Model for Efficient Video Understanding
chou500
0
190
Why App Signing Matters for Your Android Apps - Android Bangkok Conference 2024
akexorcist
0
130
DynamoDB でスロットリングが発生したとき_大盛りver/when_throttling_occurs_in_dynamodb_long
emiki
1
420
インフラとバックエンドとフロントエンドをくまなく調べて遅いアプリを早くした件
tubone24
1
430
Shopifyアプリ開発における Shopifyの機能活用
sonatard
4
250
ドメインの本質を掴む / Get the essence of the domain
sinsoku
2
160
[CV勉強会@関東 ECCV2024 読み会] オンラインマッピング x トラッキング MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping (Chen+, ECCV24)
abemii
0
220
Featured
See All Featured
Designing the Hi-DPI Web
ddemaree
280
34k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
10
720
Java REST API Framework Comparison - PWX 2021
mraible
PRO
28
8.2k
Learning to Love Humans: Emotional Interface Design
aarron
273
40k
How to Think Like a Performance Engineer
csswizardry
20
1.1k
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
8
890
Facilitating Awesome Meetings
lara
50
6.1k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
44
6.8k
XXLCSS - How to scale CSS and keep your sanity
sugarenia
246
1.3M
Mobile First: as difficult as doing things right
swwweet
222
8.9k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
27
840
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
126
18k
Transcript
MACHINE LEARNING ON PRODUCTION EKO KURNIAWAN KHANNEDY
MACHINE LEARNING ON PRODUCTION EKO KURNIAWAN KHANNEDY ▸ Principal Software
Development Engineer at blibli.com ▸ Part of Research and Development Team ▸
[email protected]
HAL YANG PALING SULIT ITU ADALAH MEMBAWA MACHINE LEARNING KE
PRODUCTION …. MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION AGENDA ▸ The Hard Part ▸
Best Practice ▸ Machine Learning in blibli.com
THE HARD PART MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION DATA ▸ Data Too Big ▸
Unstructured Data ▸ Document Oriented and Master Detail Data ▸ Continuous Data ▸ Imbalance Data ▸ Wild Data
MACHINE LEARNING ON PRODUCTION PREPROCESSING ▸ Feature Extraction ▸ Too
Many Features Extraction Makes Process Too Long
MACHINE LEARNING ON PRODUCTION TRAINING ▸ Batch Training ▸ Sequential
Algorithm ▸ Validation
BEST PRACTICE MACHINE LEARNING ON PRODUCTION
DATA
MACHINE LEARNING ON PRODUCTION DATA TOO BIG ▸ Load data
to memory. ▸ Streaming the datasource. ▸ Split data into multiple nodes. ▸ Use memory-file database.
MACHINE LEARNING ON PRODUCTION UNSTRUCTURED DATA ▸ Analyse Your Data
▸ Find Characteristic of Your Data ▸ Find Best Approachment for that case.
MACHINE LEARNING ON PRODUCTION DOCUMENT ORIENTED AND MASTER DETAIL DATA
▸ Analyse Your Data ▸ Find the Best Way to Treat The Data
MACHINE LEARNING ON PRODUCTION CONTINUOUS DATA ▸ Wide the range
that use in normalization process. ▸ Consider it as a missing value.
MACHINE LEARNING ON PRODUCTION IMBALANCE DATA ▸ Down Sampling. ▸
Up Sampling.
MACHINE LEARNING ON PRODUCTION WILD DATA ▸ Use Default Value.
▸ Use Average Value. ▸ Use Machine Learning to Predict Missing Value.
PREPROCESSING
MACHINE LEARNING ON PRODUCTION FEATURE EXTRACTION ▸ Add as Many
Facts as Possible ▸ Remove Irrelevant Feature
MACHINE LEARNING ON PRODUCTION TOO MANY FEATURES EXTRACTION MAKES PROCESS
TOO LONG ▸ Use Non-Blocking Process ▸ Use Event Driven Process ▸ Use Parallel Process
TRAINING
MACHINE LEARNING ON PRODUCTION BATCH TRAINING ▸ Use Real Time
Training ▸ Scheduled Training
MACHINE LEARNING ON PRODUCTION SEQUENTIAL ALGORITHM ▸ Distributed The Data
▸ Parallel The Algorithm
MACHINE LEARNING ON PRODUCTION VALIDATION ▸ Split Validation ▸ Cross
Validation ▸ Parallel The Validation
MACHINE LEARNING IN BLIBLI.COM MACHINE LEARNING ON PRODUCTION
MACHINE LEARNING ON PRODUCTION FRAUD PREVENTION PLATFORM RESTFULL MASTER DATA
CLIENT MACHINE LEARNING ENGINE PREPROCESSING ENGINE THIRD PARTY SERVICE
MACHINE LEARNING ON PRODUCTION MACHINE LEARNING ENGINE RESTFULL METADATA DATA
CLIENT TRAINING ENGINE TRAINING DATA CLASSIFICATION ENGINE
THANKS