.. _deploying_with_kubeai: Deploying with KubeAI ===================== `KubeAI `_ is a Kubernetes operator that enables you to deploy and manage AI models on Kubernetes. It provides a simple and scalable way to deploy vLLM in production. Functionality such as scale-from-zero, load based autoscaling, model caching, and much more is provided out of the box with zero external dependencies. Please see the Installation Guides for environment specific instructions: * `Any Kubernetes Cluster `_ * `EKS `_ * `GKE `_ Once you have KubeAI installed, you can `configure text generation models `_ using vLLM.