Slide 7
Slide 7 text
Some Data
training time 20min-2hours
training sample 3-30million
DAG tasks 20-50+
model configuration {“numIterations”: 200, “maxDepth”: 5,
“maxBins”:28}
daily requests 10million-30+million
single predict /response time 2ms / 5-12ms
serialised model size 50KB-1.5MB
features size 40-100
spark version 2.1.0
alternative framework XGBoost, TensorFlowDNN,
Facebook GBDT+LR