Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
9007bf57
Unverified
Commit
9007bf57
authored
Oct 28, 2025
by
Simon Mo
Committed by
GitHub
Oct 28, 2025
Browse files
Revert "Install pre-built xformers-0.0.32.post2 built with pt-2.9.0" (#27714)
parent
f2575447
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
8 additions
and
1 deletion
+8
-1
docker/Dockerfile
docker/Dockerfile
+7
-0
requirements/cuda.txt
requirements/cuda.txt
+1
-1
No files found.
docker/Dockerfile
View file @
9007bf57
...
@@ -361,6 +361,13 @@ RUN --mount=type=bind,from=build,src=/workspace/dist,target=/vllm-workspace/dist
...
@@ -361,6 +361,13 @@ RUN --mount=type=bind,from=build,src=/workspace/dist,target=/vllm-workspace/dist
&& uv pip install --system dist/*.whl --verbose \
&& uv pip install --system dist/*.whl --verbose \
--extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')
--extra-index-url ${PYTORCH_CUDA_INDEX_BASE_URL}/cu$(echo $CUDA_VERSION | cut -d. -f1,2 | tr -d '.')
# TODO (huydhn): Remove this once xformers is released for 2.9.0
RUN
--mount
=
type
=
cache,target
=
/root/.cache/uv bash -
<<
'
BASH
'
. /etc/environment
export TORCH_CUDA_ARCH_LIST='7.5 8.0+PTX 9.0a'
uv pip install --system --no-build-isolation "git+https://github.com/facebookresearch/xformers@v0.0.32.post2"
BASH
# Install FlashInfer pre-compiled kernel cache and binaries
# Install FlashInfer pre-compiled kernel cache and binaries
# https://docs.flashinfer.ai/installation.html
# https://docs.flashinfer.ai/installation.html
RUN
--mount
=
type
=
cache,target
=
/root/.cache/uv
\
RUN
--mount
=
type
=
cache,target
=
/root/.cache/uv
\
...
...
requirements/cuda.txt
View file @
9007bf57
...
@@ -10,7 +10,7 @@ torchaudio==2.9.0
...
@@ -10,7 +10,7 @@ torchaudio==2.9.0
# These must be updated alongside torch
# These must be updated alongside torch
torchvision==0.24.0 # Required for phi3v processor. See https://github.com/pytorch/vision?tab=readme-ov-file#installation for corresponding version
torchvision==0.24.0 # Required for phi3v processor. See https://github.com/pytorch/vision?tab=readme-ov-file#installation for corresponding version
# https://github.com/facebookresearch/xformers/releases/tag/v0.0.32.post1
# https://github.com/facebookresearch/xformers/releases/tag/v0.0.32.post1
xformers==0.0.3
3+5d4b92a5.d20251026
; platform_system == 'Linux' and platform_machine == 'x86_64' # Requires PyTorch >= 2.
9
#
xformers==0.0.3
2.post1
; platform_system == 'Linux' and platform_machine == 'x86_64' # Requires PyTorch >= 2.
8
# FlashInfer should be updated together with the Dockerfile
# FlashInfer should be updated together with the Dockerfile
flashinfer-python==0.4.1
flashinfer-python==0.4.1
# Triton Kernels are needed for mxfp4 fused moe. (Should be updated alongside torch)
# Triton Kernels are needed for mxfp4 fused moe. (Should be updated alongside torch)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment