Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
60feb955
"examples/backends/vllm/deploy/agg_router_kv_approx.yaml" did not exist on "5bf23d54f3e46a15ff5000773a32d8829befa919"
Unverified
Commit
60feb955
authored
Dec 03, 2025
by
Karen Chung
Committed by
GitHub
Dec 03, 2025
Browse files
chore: bump vLLM to 0.11.2 (#4476)
parent
c5e8c4c2
Changes
23
Show whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
2 additions
and
5 deletions
+2
-5
tests/fault_tolerance/deploy/container/Dockerfile.local_vllm
tests/fault_tolerance/deploy/container/Dockerfile.local_vllm
+2
-2
tests/fault_tolerance/deploy/templates/vllm/moe_agg.yaml
tests/fault_tolerance/deploy/templates/vllm/moe_agg.yaml
+0
-1
tests/fault_tolerance/deploy/templates/vllm/moe_disagg.yaml
tests/fault_tolerance/deploy/templates/vllm/moe_disagg.yaml
+0
-2
No files found.
tests/fault_tolerance/deploy/container/Dockerfile.local_vllm
View file @
60feb955
...
...
@@ -9,7 +9,7 @@
ARG LOCAL_VLLM_IMAGE="vllm-elastic-ep:latest_all2all_buffer_input"
ARG DYNAMO_BASE_IMAGE="dynamo:latest-none"
ARG RUNTIME_IMAGE="nvcr.io/nvidia/cuda"
ARG RUNTIME_IMAGE_TAG="12.
8.1
-runtime-ubuntu24.04"
ARG RUNTIME_IMAGE_TAG="12.
9.0
-runtime-ubuntu24.04"
# Other build arguments
ARG PYTHON_VERSION=3.12
...
...
@@ -57,7 +57,7 @@ RUN apt-get update && \
# prometheus dependencies
ca-certificates \
# DeepGemm uses 'cuobjdump' which does not come with CUDA image
cuda-command-line-tools-12-
8
&& \
cuda-command-line-tools-12-
9
&& \
rm -rf /var/lib/apt/lists/*
# Copy CUDA development tools from vLLM image (for JIT compilation)
...
...
tests/fault_tolerance/deploy/templates/vllm/moe_agg.yaml
View file @
60feb955
...
...
@@ -60,7 +60,6 @@ spec:
-
--model
-
deepseek-ai/DeepSeek-V2-Lite
-
--trust-remote-code
-
--disable-log-requests
-
--tensor-parallel-size
-
"
1"
-
--data-parallel-size
...
...
tests/fault_tolerance/deploy/templates/vllm/moe_disagg.yaml
View file @
60feb955
...
...
@@ -63,7 +63,6 @@ spec:
-
--model
-
deepseek-ai/DeepSeek-V2-Lite
-
--trust-remote-code
-
--disable-log-requests
-
--tensor-parallel-size
-
"
1"
-
--data-parallel-size
...
...
@@ -130,7 +129,6 @@ spec:
-
--model
-
deepseek-ai/DeepSeek-V2-Lite
-
--trust-remote-code
-
--disable-log-requests
-
--is-prefill-worker
-
--tensor-parallel-size
-
"
1"
...
...
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment