Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
dcfe9523
Unverified
Commit
dcfe9523
authored
May 17, 2025
by
Michael Goin
Committed by
GitHub
May 17, 2025
Browse files
Update Dockerfile to build for Blackwell (#18095)
parent
48ac2bed
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
3 deletions
+3
-3
docker/Dockerfile
docker/Dockerfile
+3
-3
No files found.
docker/Dockerfile
View file @
dcfe9523
...
@@ -77,7 +77,7 @@ RUN --mount=type=cache,target=/root/.cache/uv \
...
@@ -77,7 +77,7 @@ RUN --mount=type=cache,target=/root/.cache/uv \
# can be useful for both `dev` and `test`
# can be useful for both `dev` and `test`
# explicitly set the list to avoid issues with torch 2.2
# explicitly set the list to avoid issues with torch 2.2
# see https://github.com/pytorch/pytorch/pull/123243
# see https://github.com/pytorch/pytorch/pull/123243
ARG
torch_cuda_arch_list='7.0 7.5 8.0 8.
6 8.9 9
.0+PTX'
ARG
torch_cuda_arch_list='7.0 7.5 8.0 8.
9 9.0 10
.0+PTX'
ENV
TORCH_CUDA_ARCH_LIST=${torch_cuda_arch_list}
ENV
TORCH_CUDA_ARCH_LIST=${torch_cuda_arch_list}
# Override the arch list for flash-attn to reduce the binary size
# Override the arch list for flash-attn to reduce the binary size
ARG
vllm_fa_cmake_gpu_arches='80-real;90-real'
ARG
vllm_fa_cmake_gpu_arches='80-real;90-real'
...
@@ -257,8 +257,8 @@ RUN --mount=type=cache,target=/root/.cache/uv \
...
@@ -257,8 +257,8 @@ RUN --mount=type=cache,target=/root/.cache/uv \
if
[
"
$TARGETPLATFORM
"
!=
"linux/arm64"
]
;
then
\
if
[
"
$TARGETPLATFORM
"
!=
"linux/arm64"
]
;
then
\
# uv pip install --system https://github.com/flashinfer-ai/flashinfer/releases/download/v0.2.4/flashinfer_python-0.2.4+cu124torch2.6-cp38-abi3-linux_x86_64.whl ;
\
# uv pip install --system https://github.com/flashinfer-ai/flashinfer/releases/download/v0.2.4/flashinfer_python-0.2.4+cu124torch2.6-cp38-abi3-linux_x86_64.whl ;
\
# TESTING: install FlashInfer from source to test 2.7.0 final RC
# TESTING: install FlashInfer from source to test 2.7.0 final RC
FLASHINFER_ENABLE_AOT=1 TORCH_CUDA_ARCH_LIST='7.5 8.0 8.
6 8.9 9
.0+PTX' \
FLASHINFER_ENABLE_AOT=1 TORCH_CUDA_ARCH_LIST='7.5 8.0 8.
9 9.0 10
.0+PTX' \
uv pip install --system --no-build-isolation "git+https://github.com/flashinfer-ai/flashinfer@
v0.2.4
" ; \
uv pip install --system --no-build-isolation "git+https://github.com/flashinfer-ai/flashinfer@
e00e8cedbfcb220f328fd36aa8f529f869b01e6b
" ; \
fi
fi
COPY
examples examples
COPY
examples examples
COPY
benchmarks benchmarks
COPY
benchmarks benchmarks
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment