329 B
329 B
(deploying-with-kserve)=
Deploying with KServe
vLLM can be deployed with KServe on Kubernetes for highly scalable distributed model serving.
Please see this guide for more details on using vLLM with KServe.