Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
4b9a9435
Unverified
Commit
4b9a9435
authored
Jul 11, 2025
by
Michael Goin
Committed by
GitHub
Jul 10, 2025
Browse files
Update Dockerfile FlashInfer to v0.2.8rc1 (#20718)
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
parent
3482fd7e
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
2 deletions
+5
-2
docker/Dockerfile
docker/Dockerfile
+5
-2
No files found.
docker/Dockerfile
View file @
4b9a9435
...
...
@@ -379,12 +379,15 @@ RUN --mount=type=bind,from=build,src=/workspace/dist,target=/vllm-workspace/dist
ARG
FLASHINFER_CUDA128_INDEX_URL="https://download.pytorch.org/whl/cu128/flashinfer"
ARG
FLASHINFER_CUDA128_WHEEL="flashinfer_python-0.2.6.post1%2Bcu128torch2.7-cp39-abi3-linux_x86_64.whl"
ARG
FLASHINFER_GIT_REPO="https://github.com/flashinfer-ai/flashinfer.git"
ARG
FLASHINFER_GIT_REF="v0.2.6.post1"
ARG
FLASHINFER_GIT_REF="v0.2.8rc1"
# Flag to control whether to use pre-built FlashInfer wheels (set to false to force build from source)
# TODO: Currently disabled because the pre-built wheels are not available for FLASHINFER_GIT_REF
ARG
USE_FLASHINFER_PREBUILT_WHEEL=false
RUN
--mount
=
type
=
cache,target
=
/root/.cache/uv bash -
<<
'
BASH
'
. /etc/environment
if [ "$TARGETPLATFORM" != "linux/arm64" ]; then
# FlashInfer already has a wheel for PyTorch 2.7.0 and CUDA 12.8. This is enough for CI use
if [[ "$CUDA_VERSION" == 12.8* ]]; then
if [[ "$CUDA_VERSION" == 12.8*
]] && [[ "$USE_FLASHINFER_PREBUILT_WHEEL" == "true"
]]; then
uv pip install --system ${FLASHINFER_CUDA128_INDEX_URL}/${FLASHINFER_CUDA128_WHEEL}
else
# Exclude CUDA arches for older versions (11.x and 12.0-12.7)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment