Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
59a0b855
Unverified
Commit
59a0b855
authored
Aug 06, 2025
by
youkaichao
Committed by
GitHub
Aug 06, 2025
Browse files
[bugfix] fix blackwell deepep installation (#22255)
parent
469b3ffa
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
12 additions
and
6 deletions
+12
-6
tools/ep_kernels/README.md
tools/ep_kernels/README.md
+5
-5
tools/ep_kernels/install_python_libraries.sh
tools/ep_kernels/install_python_libraries.sh
+7
-1
No files found.
tools/ep_kernels/README.md
View file @
59a0b855
...
@@ -13,16 +13,16 @@ All scripts accept a positional argument as workspace path for staging the build
...
@@ -13,16 +13,16 @@ All scripts accept a positional argument as workspace path for staging the build
## Usage
## Usage
### Single-node
```
bash
```
bash
bash install_python_libraries.sh
# for hopper
TORCH_CUDA_ARCH_LIST
=
"9.0"
bash install_python_libraries.sh
# for blackwell
TORCH_CUDA_ARCH_LIST
=
"10.0"
bash install_python_libraries.sh
```
```
### Multi-node
Additional step for multi-node deployment:
```
bash
```
bash
bash install_python_libraries.sh
sudo
bash configure_system_drivers.sh
sudo
bash configure_system_drivers.sh
sudo
reboot
# Reboot is required to load the new driver
sudo
reboot
# Reboot is required to load the new driver
```
```
tools/ep_kernels/install_python_libraries.sh
View file @
59a0b855
...
@@ -29,6 +29,12 @@ if [ -z "$CUDA_HOME" ]; then
...
@@ -29,6 +29,12 @@ if [ -z "$CUDA_HOME" ]; then
exit
1
exit
1
fi
fi
# assume TORCH_CUDA_ARCH_LIST is set correctly
if
[
-z
"
$TORCH_CUDA_ARCH_LIST
"
]
;
then
echo
"TORCH_CUDA_ARCH_LIST is not set, please set it to your desired architecture."
exit
1
fi
# disable all features except IBGDA
# disable all features except IBGDA
export
NVSHMEM_IBGDA_SUPPORT
=
1
export
NVSHMEM_IBGDA_SUPPORT
=
1
...
@@ -95,7 +101,7 @@ clone_repo "https://github.com/ppl-ai/pplx-kernels" "pplx-kernels" "setup.py"
...
@@ -95,7 +101,7 @@ clone_repo "https://github.com/ppl-ai/pplx-kernels" "pplx-kernels" "setup.py"
cd
pplx-kernels
cd
pplx-kernels
# see https://github.com/pypa/pip/issues/9955#issuecomment-838065925
# see https://github.com/pypa/pip/issues/9955#issuecomment-838065925
# PIP_NO_BUILD_ISOLATION=0 disables build isolation
# PIP_NO_BUILD_ISOLATION=0 disables build isolation
PIP_NO_BUILD_ISOLATION
=
0
TORCH_CUDA_ARCH_LIST
=
9.0a+PTX
pip
install
-vvv
-e
.
PIP_NO_BUILD_ISOLATION
=
0 pip
install
-vvv
-e
.
popd
popd
# build and install deepep, require pytorch installed
# build and install deepep, require pytorch installed
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment