Unverified Commit 74062740 authored by Sam Stoelinga's avatar Sam Stoelinga Committed by GitHub
Browse files

[Doc] add KubeAI to serving integrations (#10837)


Signed-off-by: default avatarSam Stoelinga <sammiestoel@gmail.com>
parent 8b596318
.. _deploying_with_kubeai:
Deploying with KubeAI
=====================
`KubeAI <https://github.com/substratusai/kubeai>`_ is a Kubernetes operator that enables you to deploy and manage AI models on Kubernetes. It provides a simple and scalable way to deploy vLLM in production. Functionality such as scale-from-zero, load based autoscaling, model caching, and much more is provided out of the box with zero external dependencies.
Please see the Installation Guides for environment specific instructions:
* `Any Kubernetes Cluster <https://www.kubeai.org/installation/any/>`_
* `EKS <https://www.kubeai.org/installation/eks/>`_
* `GKE <https://www.kubeai.org/installation/gke/>`_
Once you have KubeAI installed, you can
`configure text generation models <https://www.kubeai.org/how-to/configure-text-generation-models/>`_
using vLLM.
\ No newline at end of file
...@@ -6,6 +6,7 @@ Integrations ...@@ -6,6 +6,7 @@ Integrations
run_on_sky run_on_sky
deploying_with_kserve deploying_with_kserve
deploying_with_kubeai
deploying_with_triton deploying_with_triton
deploying_with_bentoml deploying_with_bentoml
deploying_with_cerebrium deploying_with_cerebrium
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment