"...git@developer.sourcefind.cn:chenpangpang/open-webui.git" did not exist on "587101da88fe59039882ff10835e6aa2fd2cb9f2"
Commit 116b05f9 authored by Tri Dao's avatar Tri Dao
Browse files

[CI] Compile with pytorch 2.4.0.dev20240514

parent da11d1b8
...@@ -44,7 +44,7 @@ jobs: ...@@ -44,7 +44,7 @@ jobs:
# manylinux docker image, but I haven't figured out how to install CUDA on manylinux. # manylinux docker image, but I haven't figured out how to install CUDA on manylinux.
os: [ubuntu-20.04] os: [ubuntu-20.04]
python-version: ['3.8', '3.9', '3.10', '3.11', '3.12'] python-version: ['3.8', '3.9', '3.10', '3.11', '3.12']
torch-version: ['2.0.1', '2.1.2', '2.2.2', '2.3.1', '2.4.0.dev20240512'] torch-version: ['2.0.1', '2.1.2', '2.2.2', '2.3.1', '2.4.0.dev20240514']
cuda-version: ['11.8.0', '12.2.2'] cuda-version: ['11.8.0', '12.2.2']
# We need separate wheels that either uses C++11 ABI (-D_GLIBCXX_USE_CXX11_ABI) or not. # We need separate wheels that either uses C++11 ABI (-D_GLIBCXX_USE_CXX11_ABI) or not.
# Pytorch wheels currently don't use it, but nvcr images have Pytorch compiled with C++11 ABI. # Pytorch wheels currently don't use it, but nvcr images have Pytorch compiled with C++11 ABI.
......
__version__ = "2.6.0" __version__ = "2.6.0.post1"
from flash_attn.flash_attn_interface import ( from flash_attn.flash_attn_interface import (
flash_attn_func, flash_attn_func,
......
...@@ -85,7 +85,7 @@ RUN pip install transformers==4.25.1 datasets==2.8.0 pytorch-lightning==1.8.6 tr ...@@ -85,7 +85,7 @@ RUN pip install transformers==4.25.1 datasets==2.8.0 pytorch-lightning==1.8.6 tr
RUN pip install git+https://github.com/mlcommons/logging.git@2.1.0 RUN pip install git+https://github.com/mlcommons/logging.git@2.1.0
# Install FlashAttention # Install FlashAttention
RUN pip install flash-attn==2.6.0 RUN pip install flash-attn==2.6.0.post1
# Install CUDA extensions for fused dense # Install CUDA extensions for fused dense
RUN pip install git+https://github.com/HazyResearch/flash-attention@v2.6.0#subdirectory=csrc/fused_dense_lib RUN pip install git+https://github.com/HazyResearch/flash-attention@v2.6.0.post1#subdirectory=csrc/fused_dense_lib
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment