Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
c183aa31
"...git@developer.sourcefind.cn:2222/OpenDAS/vllm_cscc.git" did not exist on "12449f9492c5deddb0dc262ef9bfc34d44404ec0"
Unverified
Commit
c183aa31
authored
Jul 08, 2025
by
Biswa Panda
Committed by
GitHub
Jul 08, 2025
Browse files
fix: fix gpu resource spec in llm deployments (#1812)
parent
24bede9b
Changes
10
Show whitespace changes
Inline
Side-by-side
Showing
10 changed files
with
32 additions
and
32 deletions
+32
-32
examples/llm/deploy/agg.yaml
examples/llm/deploy/agg.yaml
+2
-2
examples/llm/deploy/agg_router.yaml
examples/llm/deploy/agg_router.yaml
+2
-2
examples/llm/deploy/disagg.yaml
examples/llm/deploy/disagg.yaml
+4
-4
examples/llm/deploy/disagg_router.yaml
examples/llm/deploy/disagg_router.yaml
+4
-4
examples/vllm_v0/deploy/agg.yaml
examples/vllm_v0/deploy/agg.yaml
+2
-2
examples/vllm_v0/deploy/disagg.yaml
examples/vllm_v0/deploy/disagg.yaml
+4
-4
examples/vllm_v0/deploy/disagg_planner.yaml
examples/vllm_v0/deploy/disagg_planner.yaml
+4
-4
examples/vllm_v1/deploy/agg.yaml
examples/vllm_v1/deploy/agg.yaml
+2
-2
examples/vllm_v1/deploy/disagg.yaml
examples/vllm_v1/deploy/disagg.yaml
+4
-4
examples/vllm_v1/deploy/disagg_planner.yaml
examples/vllm_v1/deploy/disagg_planner.yaml
+4
-4
No files found.
examples/llm/deploy/agg.yaml
View file @
c183aa31
...
@@ -79,11 +79,11 @@ spec:
...
@@ -79,11 +79,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
...
examples/llm/deploy/agg_router.yaml
View file @
c183aa31
...
@@ -104,11 +104,11 @@ spec:
...
@@ -104,11 +104,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
...
examples/llm/deploy/disagg.yaml
View file @
c183aa31
...
@@ -79,11 +79,11 @@ spec:
...
@@ -79,11 +79,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
@@ -106,11 +106,11 @@ spec:
...
@@ -106,11 +106,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
...
examples/llm/deploy/disagg_router.yaml
View file @
c183aa31
...
@@ -104,11 +104,11 @@ spec:
...
@@ -104,11 +104,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
@@ -131,11 +131,11 @@ spec:
...
@@ -131,11 +131,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
...
examples/vllm_v0/deploy/agg.yaml
View file @
c183aa31
...
@@ -54,11 +54,11 @@ spec:
...
@@ -54,11 +54,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
...
examples/vllm_v0/deploy/disagg.yaml
View file @
c183aa31
...
@@ -54,11 +54,11 @@ spec:
...
@@ -54,11 +54,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
@@ -81,11 +81,11 @@ spec:
...
@@ -81,11 +81,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
...
examples/vllm_v0/deploy/disagg_planner.yaml
View file @
c183aa31
...
@@ -55,11 +55,11 @@ spec:
...
@@ -55,11 +55,11 @@ spec:
requests
:
requests
:
cpu
:
"
20"
cpu
:
"
20"
memory
:
"
40Gi"
memory
:
"
40Gi"
nvidia.com/
gpu
:
"
2"
gpu
:
"
2"
limits
:
limits
:
cpu
:
"
20"
cpu
:
"
20"
memory
:
"
40Gi"
memory
:
"
40Gi"
nvidia.com/
gpu
:
"
2"
gpu
:
"
2"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
@@ -83,11 +83,11 @@ spec:
...
@@ -83,11 +83,11 @@ spec:
requests
:
requests
:
cpu
:
"
20"
cpu
:
"
20"
memory
:
"
40Gi"
memory
:
"
40Gi"
nvidia.com/
gpu
:
"
2"
gpu
:
"
2"
limits
:
limits
:
cpu
:
"
20"
cpu
:
"
20"
memory
:
"
40Gi"
memory
:
"
40Gi"
nvidia.com/
gpu
:
"
2"
gpu
:
"
2"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
...
examples/vllm_v1/deploy/agg.yaml
View file @
c183aa31
...
@@ -79,11 +79,11 @@ spec:
...
@@ -79,11 +79,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
...
examples/vllm_v1/deploy/disagg.yaml
View file @
c183aa31
...
@@ -79,11 +79,11 @@ spec:
...
@@ -79,11 +79,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
@@ -106,11 +106,11 @@ spec:
...
@@ -106,11 +106,11 @@ spec:
requests
:
requests
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
limits
:
limits
:
cpu
:
"
10"
cpu
:
"
10"
memory
:
"
20Gi"
memory
:
"
20Gi"
nvidia.com/
gpu
:
"
1"
gpu
:
"
1"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
...
examples/vllm_v1/deploy/disagg_planner.yaml
View file @
c183aa31
...
@@ -81,11 +81,11 @@ spec:
...
@@ -81,11 +81,11 @@ spec:
requests
:
requests
:
cpu
:
"
20"
cpu
:
"
20"
memory
:
"
40Gi"
memory
:
"
40Gi"
nvidia.com/
gpu
:
"
2"
gpu
:
"
2"
limits
:
limits
:
cpu
:
"
20"
cpu
:
"
20"
memory
:
"
40Gi"
memory
:
"
40Gi"
nvidia.com/
gpu
:
"
2"
gpu
:
"
2"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
@@ -109,11 +109,11 @@ spec:
...
@@ -109,11 +109,11 @@ spec:
requests
:
requests
:
cpu
:
"
20"
cpu
:
"
20"
memory
:
"
40Gi"
memory
:
"
40Gi"
nvidia.com/
gpu
:
"
2"
gpu
:
"
2"
limits
:
limits
:
cpu
:
"
20"
cpu
:
"
20"
memory
:
"
40Gi"
memory
:
"
40Gi"
nvidia.com/
gpu
:
"
2"
gpu
:
"
2"
extraPodSpec
:
extraPodSpec
:
mainContainer
:
mainContainer
:
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
image
:
nvcr.io/nvidia/ai-dynamo/vllm-runtime:latest
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment