Unverified Commit a5e1335c authored by dagil-nvidia's avatar dagil-nvidia Committed by GitHub
Browse files

docs: use default dynamo queue in KAI-Scheduler example (#4625)


Signed-off-by: default avatarDan Gil <dagil@nvidia.com>
parent 6cd6033c
...@@ -51,7 +51,7 @@ These systems provide enhanced scheduling capabilities including topology-aware ...@@ -51,7 +51,7 @@ These systems provide enhanced scheduling capabilities including topology-aware
##### Prerequisites ##### Prerequisites
- [Grove](https://github.com/NVIDIA/grove/blob/main/docs/installation.md) installed on the cluster - [Grove](https://github.com/NVIDIA/grove/blob/main/docs/installation.md) installed on the cluster
- (Optional) [KAI-Scheduler](https://github.com/NVIDIA/KAI-Scheduler) installed on the cluster with default queue name `dynamo` created. You can use a different queue name by setting the `nvidia.com/kai-scheduler-queue` annotation on the DGD resource. - (Optional) [KAI-Scheduler](https://github.com/NVIDIA/KAI-Scheduler) installed on the cluster with the default queue name `dynamo` created. If no queue annotation is specified on the DGD resource, the operator uses the `dynamo` queue by default. Custom queue names can be specified via the `nvidia.com/kai-scheduler-queue` annotation, but the queue must exist in the cluster before deployment.
KAI-Scheduler is optional but recommended for advanced scheduling capabilities. KAI-Scheduler is optional but recommended for advanced scheduling capabilities.
...@@ -94,11 +94,13 @@ kind: DynamoGraphDeployment ...@@ -94,11 +94,13 @@ kind: DynamoGraphDeployment
metadata: metadata:
name: my-multinode-deployment name: my-multinode-deployment
annotations: annotations:
nvidia.com/kai-scheduler-queue: "gpu-intensive" # Optional: defaults to "dynamo" nvidia.com/kai-scheduler-queue: "dynamo"
spec: spec:
# ... your deployment spec # ... your deployment spec
``` ```
> **Note:** The `nvidia.com/kai-scheduler-queue` annotation defaults to `"dynamo"`. If you specify a custom queue name, ensure the queue exists in your cluster before deploying. You can verify available queues with `kubectl get queues`.
**Force LWS usage:** **Force LWS usage:**
```yaml ```yaml
apiVersion: nvidia.com/v1alpha1 apiVersion: nvidia.com/v1alpha1
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment