Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
1645b601
Unverified
Commit
1645b601
authored
May 23, 2025
by
Huy Do
Committed by
GitHub
May 23, 2025
Browse files
Use prebuilt FlashInfer x86_64 PyTorch 2.7 CUDA 12.8 wheel for CI (#18537)
Signed-off-by:
Huy Do
<
huydhn@gmail.com
>
parent
2628a69e
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
8 additions
and
9 deletions
+8
-9
docker/Dockerfile
docker/Dockerfile
+8
-9
No files found.
docker/Dockerfile
View file @
1645b601
...
...
@@ -257,18 +257,17 @@ RUN --mount=type=bind,from=build,src=/workspace/dist,target=/vllm-workspace/dist
RUN
--mount
=
type
=
cache,target
=
/root/.cache/uv
\
.
/etc/environment
&&
\
if
[
"
$TARGETPLATFORM
"
!=
"linux/arm64"
]
;
then
\
# uv pip install --system https://github.com/flashinfer-ai/flashinfer/releases/download/v0.2.4/flashinfer_python-0.2.4+cu124torch2.6-cp38-abi3-linux_x86_64.whl ;
\
# TESTING: install FlashInfer from source to test 2.7.0 final RC
# FlashInfer alreary has a wheel for PyTorch 2.7.0 and CUDA 12.8. This is enough for CI use
if [[ "$CUDA_VERSION" == 12.8* ]]; then \
export TORCH_CUDA_ARCH_LIST='7.5 8.0 8.9 9.0 10.0+PTX'
; \
uv pip install --system https://download.pytorch.org/whl/cu128/flashinfer/flashinfer_python-0.2.5%2Bcu128torch2.7-cp38-abi3-linux_x86_64.whl
; \
else \
export TORCH_CUDA_ARCH_LIST='7.5 8.0 8.9 9.0+PTX'; \
fi
; \
CUDA_MAJOR="${CUDA_VERSION%%.*}";
\
if [ "$CUDA_MAJOR" -lt 12 ]; then
\
export FLASHINFER_ENABLE_SM90=0
; \
fi
; \
uv pip install --system --no-build-isolation "git+https://github.com/flashinfer-ai/flashinfer@21ea1d2545f74782b91eb8c08fd503ac4c0743fc" ;
\
CUDA_MAJOR="${CUDA_VERSION%%.*}"
; \
if [ "$CUDA_MAJOR" -lt 12 ]; then
\
export FLASHINFER_ENABLE_SM90=0;
\
fi
; \
uv pip install --system --no-build-isolation "git+https://github.com/flashinfer-ai/flashinfer@21ea1d2545f74782b91eb8c08fd503ac4c0743fc"
; \
fi
\
fi
COPY
examples examples
COPY
benchmarks benchmarks
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment