Unverified Commit 569c252d authored by Charlene Yang's avatar Charlene Yang Committed by GitHub
Browse files

[PyTorch] Reduce FA versions in L3 CI (#1775)



* reduce FA versions to make CI leaner
Signed-off-by: default avatarCharlene Yang <8636796+cyanguwa@users.noreply.github.com>

* improve build speed
Signed-off-by: default avatarCharlene Yang <8636796+cyanguwa@users.noreply.github.com>

* add FA env var for all archs
Signed-off-by: default avatarCharlene Yang <8636796+cyanguwa@users.noreply.github.com>

---------
Signed-off-by: default avatarCharlene Yang <8636796+cyanguwa@users.noreply.github.com>
parent cfbbfb89
......@@ -11,15 +11,17 @@ mkdir -p "$XML_LOG_DIR"
pip3 install pytest==8.2.1
# Limit parallel build jobs to avoid overwhelming system resources
export MAX_JOBS=4
export MAX_JOBS=32
# Iterate over Flash Attention versions
sm_arch=`python3 -c "import torch; sm = torch.cuda.get_device_capability(0); print(sm[0]*10+sm[1])"`
export FLASH_ATTN_CUDA_ARCHS=$sm_arch
if [ $sm_arch -gt 90 ]
then
FA_versions=(2.7.3)
else
FA_versions=(2.3.0 2.4.1 2.5.7 2.7.3 3.0.0b1)
elif [ $sm_arch -eq 90 ]
then
FA_versions=(2.5.7 2.7.3 3.0.0b1)
fi
for fa_version in "${FA_versions[@]}"
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment