Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
412a12a8
"deploy/helm/vscode:/vscode.git/clone" did not exist on "bfb95df77deda8cc3edb7008de055e6583da3125"
Unverified
Commit
412a12a8
authored
Jul 25, 2025
by
Biswa Panda
Committed by
GitHub
Jul 25, 2025
Browse files
fix: rm enforce eager from vllm deploy - prefer perf over pod launch time (#2109)
parent
24cb926e
Changes
5
Show whitespace changes
Inline
Side-by-side
Showing
5 changed files
with
8 additions
and
8 deletions
+8
-8
components/backends/vllm/deploy/agg.yaml
components/backends/vllm/deploy/agg.yaml
+1
-1
components/backends/vllm/deploy/agg_router.yaml
components/backends/vllm/deploy/agg_router.yaml
+1
-1
components/backends/vllm/deploy/disagg.yaml
components/backends/vllm/deploy/disagg.yaml
+2
-2
components/backends/vllm/deploy/disagg_planner.yaml
components/backends/vllm/deploy/disagg_planner.yaml
+2
-2
components/backends/vllm/deploy/disagg_router.yaml
components/backends/vllm/deploy/disagg_router.yaml
+2
-2
No files found.
components/backends/vllm/deploy/agg.yaml
View file @
412a12a8
...
@@ -86,4 +86,4 @@ spec:
...
@@ -86,4 +86,4 @@ spec:
-
/bin/sh
-
/bin/sh
-
-c
-
-c
args
:
args
:
-
python3 -m dynamo.vllm --model Qwen/Qwen3-0.6B
--enforce-eager
2>&1 | tee /tmp/vllm.log
-
python3 -m dynamo.vllm --model Qwen/Qwen3-0.6B 2>&1 | tee /tmp/vllm.log
components/backends/vllm/deploy/agg_router.yaml
View file @
412a12a8
...
@@ -86,4 +86,4 @@ spec:
...
@@ -86,4 +86,4 @@ spec:
-
/bin/sh
-
/bin/sh
-
-c
-
-c
args
:
args
:
-
python3 -m dynamo.vllm --model Qwen/Qwen3-0.6B
--enforce-eager
2>&1 | tee /tmp/vllm.log
-
python3 -m dynamo.vllm --model Qwen/Qwen3-0.6B 2>&1 | tee /tmp/vllm.log
components/backends/vllm/deploy/disagg.yaml
View file @
412a12a8
...
@@ -86,7 +86,7 @@ spec:
...
@@ -86,7 +86,7 @@ spec:
-
/bin/sh
-
/bin/sh
-
-c
-
-c
args
:
args
:
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
--enforce-eager
2>&1
|
tee
/tmp/vllm.log"
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
2>&1
|
tee
/tmp/vllm.log"
VllmPrefillWorker
:
VllmPrefillWorker
:
dynamoNamespace
:
vllm-disagg
dynamoNamespace
:
vllm-disagg
envFromSecret
:
hf-token-secret
envFromSecret
:
hf-token-secret
...
@@ -128,4 +128,4 @@ spec:
...
@@ -128,4 +128,4 @@ spec:
-
/bin/sh
-
/bin/sh
-
-c
-
-c
args
:
args
:
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
--enforce-eager
--is-prefill-worker
2>&1
|
tee
/tmp/vllm.log"
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
--is-prefill-worker
2>&1
|
tee
/tmp/vllm.log"
components/backends/vllm/deploy/disagg_planner.yaml
View file @
412a12a8
...
@@ -86,7 +86,7 @@ spec:
...
@@ -86,7 +86,7 @@ spec:
-
/bin/sh
-
/bin/sh
-
-c
-
-c
args
:
args
:
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
--enforce-eager
2>&1
|
tee
/tmp/vllm.log"
-
"
python3
-m
dynamo.vllm
--model
Qwen/Qwen3-0.6B
2>&1
|
tee
/tmp/vllm.log"
VllmPrefillWorker
:
VllmPrefillWorker
:
dynamoNamespace
:
vllm-disagg-planner
dynamoNamespace
:
vllm-disagg-planner
envFromSecret
:
hf-token-secret
envFromSecret
:
hf-token-secret
...
@@ -128,4 +128,4 @@ spec:
...
@@ -128,4 +128,4 @@ spec:
-
/bin/sh
-
/bin/sh
-
-c
-
-c
args
:
args
:
-
python3 -m dynamo.vllm --model Qwen/Qwen3-0.6B
--enforce-eager
--is-prefill-worker 2>&1 | tee /tmp/vllm.log
-
python3 -m dynamo.vllm --model Qwen/Qwen3-0.6B --is-prefill-worker 2>&1 | tee /tmp/vllm.log
components/backends/vllm/deploy/disagg_router.yaml
View file @
412a12a8
...
@@ -86,7 +86,7 @@ spec:
...
@@ -86,7 +86,7 @@ spec:
-
/bin/sh
-
/bin/sh
-
-c
-
-c
args
:
args
:
-
python3 -m dynamo.vllm --model Qwen/Qwen3-0.6B
--enforce-eager
2>&1 | tee /tmp/vllm.log
-
python3 -m dynamo.vllm --model Qwen/Qwen3-0.6B 2>&1 | tee /tmp/vllm.log
VllmPrefillWorker
:
VllmPrefillWorker
:
dynamoNamespace
:
vllm-v1-disagg-router
dynamoNamespace
:
vllm-v1-disagg-router
envFromSecret
:
hf-token-secret
envFromSecret
:
hf-token-secret
...
@@ -128,4 +128,4 @@ spec:
...
@@ -128,4 +128,4 @@ spec:
-
/bin/sh
-
/bin/sh
-
-c
-
-c
args
:
args
:
-
python3 -m dynamo.vllm --model Qwen/Qwen3-0.6B
--enforce-eager
--is-prefill-worker 2>&1 | tee /tmp/vllm.log
-
python3 -m dynamo.vllm --model Qwen/Qwen3-0.6B --is-prefill-worker 2>&1 | tee /tmp/vllm.log
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment