Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
2c642fd0
"fern/vscode:/vscode.git/clone" did not exist on "d48de155db7e08af69aa01a55c8b82590919060c"
Unverified
Commit
2c642fd0
authored
Jul 22, 2025
by
Biswa Panda
Committed by
GitHub
Jul 22, 2025
Browse files
fix: vllm deployment examples (#2062)
parent
1958b3aa
Changes
3
Show whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
5 additions
and
5 deletions
+5
-5
components/backends/vllm/deploy/agg_router.yaml
components/backends/vllm/deploy/agg_router.yaml
+1
-1
components/backends/vllm/deploy/disagg.yaml
components/backends/vllm/deploy/disagg.yaml
+2
-2
components/backends/vllm/deploy/disagg_planner.yaml
components/backends/vllm/deploy/disagg_planner.yaml
+2
-2
No files found.
components/backends/vllm/deploy/agg_router.yaml
View file @
2c642fd0
...
@@ -80,4 +80,4 @@ spec:
...
@@ -80,4 +80,4 @@ spec:
image
:
nvcr.io/nvidian/nim-llm-dev/vllm_v1-runtime:dep-216.4
image
:
nvcr.io/nvidian/nim-llm-dev/vllm_v1-runtime:dep-216.4
workingDir
:
/workspace/components/backends/vllm
workingDir
:
/workspace/components/backends/vllm
args
:
args
:
-
"
python3
components/main.py
--model
Qwen/Qwen3-0.6B
--enforce-eager
2>&1
|
tee
/tmp/vllm.log"
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
--enforce-eager
2>&1
|
tee
/tmp/vllm.log"
components/backends/vllm/deploy/disagg.yaml
View file @
2c642fd0
...
@@ -80,7 +80,7 @@ spec:
...
@@ -80,7 +80,7 @@ spec:
image
:
nvcr.io/nvidian/nim-llm-dev/vllm_v1-runtime:dep-216.4
image
:
nvcr.io/nvidian/nim-llm-dev/vllm_v1-runtime:dep-216.4
workingDir
:
/workspace/components/backends/vllm
workingDir
:
/workspace/components/backends/vllm
args
:
args
:
-
"
python3
components/main.py
--model
Qwen/Qwen3-0.6B
--enforce-eager
2>&1
|
tee
/tmp/vllm.log"
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
--enforce-eager
2>&1
|
tee
/tmp/vllm.log"
VllmPrefillWorker
:
VllmPrefillWorker
:
dynamoNamespace
:
vllm-v1-disagg
dynamoNamespace
:
vllm-v1-disagg
envFromSecret
:
hf-token-secret
envFromSecret
:
hf-token-secret
...
@@ -119,4 +119,4 @@ spec:
...
@@ -119,4 +119,4 @@ spec:
image
:
nvcr.io/nvidian/nim-llm-dev/vllm_v1-runtime:dep-216.4
image
:
nvcr.io/nvidian/nim-llm-dev/vllm_v1-runtime:dep-216.4
workingDir
:
/workspace/components/backends/vllm
workingDir
:
/workspace/components/backends/vllm
args
:
args
:
-
"
python3
components/main.py
--model
Qwen/Qwen3-0.6B
--enforce-eager
--is-prefill-worker
2>&1
|
tee
/tmp/vllm.log"
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
--enforce-eager
--is-prefill-worker
2>&1
|
tee
/tmp/vllm.log"
components/backends/vllm/deploy/disagg_planner.yaml
View file @
2c642fd0
...
@@ -80,7 +80,7 @@ spec:
...
@@ -80,7 +80,7 @@ spec:
image
:
nvcr.io/nvidian/nim-llm-dev/vllm_v1-runtime:dep-216.4
image
:
nvcr.io/nvidian/nim-llm-dev/vllm_v1-runtime:dep-216.4
workingDir
:
/workspace/components/backends/vllm
workingDir
:
/workspace/components/backends/vllm
args
:
args
:
-
"
python3
components/main.py
--model
Qwen/Qwen3-0.6B
--enforce-eager
2>&1
|
tee
/tmp/vllm.log"
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
--enforce-eager
2>&1
|
tee
/tmp/vllm.log"
VllmPrefillWorker
:
VllmPrefillWorker
:
dynamoNamespace
:
vllm-v1-disagg-planner
dynamoNamespace
:
vllm-v1-disagg-planner
envFromSecret
:
hf-token-secret
envFromSecret
:
hf-token-secret
...
@@ -119,4 +119,4 @@ spec:
...
@@ -119,4 +119,4 @@ spec:
image
:
nvcr.io/nvidian/nim-llm-dev/vllm_v1-runtime:dep-216.4
image
:
nvcr.io/nvidian/nim-llm-dev/vllm_v1-runtime:dep-216.4
workingDir
:
/workspace/components/backends/vllm
workingDir
:
/workspace/components/backends/vllm
args
:
args
:
-
"
python3
components/main.py
--model
Qwen/Qwen3-0.6B
--enforce-eager
--is-prefill-worker
2>&1
|
tee
/tmp/vllm.log"
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
--enforce-eager
--is-prefill-worker
2>&1
|
tee
/tmp/vllm.log"
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment