Slide 66
Slide 66 text
KFServing
Overview
● Provides inference features (defined with InferenceServer)
● Makes model serving abstract
(compatible with TensorFlow, PyTorch, XGBoost, etc.)
● Manages serving containers (Knative)
● Manages traffic routing (Istio)
Special features
● Auto scaling
● Canary rollout, A/B tests
● Prediction, preprocessing,
post-processing, etc.
https://github.com/kubeflow/kfserving/blob/master/docs/diagrams/kfserving.png