Slide 12
Slide 12 text
Ray Systems Complexity and FinOps
The recommended way of running Ray on Kubernetes requires using multiple layers of resource
managers:
● RayCluster autoscaler - for managing Serve Actors.
● Kuberay - for managing k8s Pods.
● EKS, GKE, AKS - for managing compute instances.
Introducing this level of complexity must be justified:
● Costly hardware.
● Deep inference graphs.
● Multiplicity of served models.