Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
bff31471
Commit
bff31471
authored
Sep 21, 2023
by
Tri Dao
Browse files
Re-enable compilation for Hopper
parent
187c2a06
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
7 additions
and
7 deletions
+7
-7
flash_attn/__init__.py
flash_attn/__init__.py
+1
-1
setup.py
setup.py
+4
-4
training/Dockerfile
training/Dockerfile
+2
-2
No files found.
flash_attn/__init__.py
View file @
bff31471
__version__
=
"2.2.4"
__version__
=
"2.2.4
.post1
"
from
flash_attn.flash_attn_interface
import
(
from
flash_attn.flash_attn_interface
import
(
flash_attn_func
,
flash_attn_func
,
...
...
setup.py
View file @
bff31471
...
@@ -122,10 +122,10 @@ if not SKIP_CUDA_BUILD:
...
@@ -122,10 +122,10 @@ if not SKIP_CUDA_BUILD:
# cc_flag.append("arch=compute_75,code=sm_75")
# cc_flag.append("arch=compute_75,code=sm_75")
cc_flag
.
append
(
"-gencode"
)
cc_flag
.
append
(
"-gencode"
)
cc_flag
.
append
(
"arch=compute_80,code=sm_80"
)
cc_flag
.
append
(
"arch=compute_80,code=sm_80"
)
#
if CUDA_HOME is not None:
if
CUDA_HOME
is
not
None
:
#
if bare_metal_version >= Version("11.8"):
if
bare_metal_version
>=
Version
(
"11.8"
):
#
cc_flag.append("-gencode")
cc_flag
.
append
(
"-gencode"
)
#
cc_flag.append("arch=compute_90,code=sm_90")
cc_flag
.
append
(
"arch=compute_90,code=sm_90"
)
# HACK: The compiler flag -D_GLIBCXX_USE_CXX11_ABI is set to be the same as
# HACK: The compiler flag -D_GLIBCXX_USE_CXX11_ABI is set to be the same as
# torch._C._GLIBCXX_USE_CXX11_ABI
# torch._C._GLIBCXX_USE_CXX11_ABI
...
...
training/Dockerfile
View file @
bff31471
...
@@ -85,11 +85,11 @@ RUN pip install transformers==4.25.1 datasets==2.8.0 pytorch-lightning==1.8.6 tr
...
@@ -85,11 +85,11 @@ RUN pip install transformers==4.25.1 datasets==2.8.0 pytorch-lightning==1.8.6 tr
RUN
pip
install
git+https://github.com/mlcommons/logging.git@2.1.0
RUN
pip
install
git+https://github.com/mlcommons/logging.git@2.1.0
# Install FlashAttention
# Install FlashAttention
RUN
pip
install
flash-attn
==
2.2.4
RUN
pip
install
flash-attn
==
2.2.4
.post1
# Install CUDA extensions for fused dense, layer norm
# Install CUDA extensions for fused dense, layer norm
RUN
git clone https://github.com/HazyResearch/flash-attention
\
RUN
git clone https://github.com/HazyResearch/flash-attention
\
&&
cd
flash-attention
&&
git checkout v2.2.4
\
&&
cd
flash-attention
&&
git checkout v2.2.4
.post1
\
&&
cd
csrc/layer_norm
&&
pip
install
.
&&
cd
../../
\
&&
cd
csrc/layer_norm
&&
pip
install
.
&&
cd
../../
\
&&
cd
csrc/fused_dense_lib
&&
pip
install
.
&&
cd
../../
\
&&
cd
csrc/fused_dense_lib
&&
pip
install
.
&&
cd
../../
\
&&
cd
..
&&
rm
-rf
flash-attention
&&
cd
..
&&
rm
-rf
flash-attention
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment