Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
d3e64409588a243393fa9b7be2045d604efbdfe0
Switch branch/tag
flash-attention
csrc
flash_attn
fmha_api.cpp
12 Jun, 2022
2 commits
Implement bwd for head dim 128
· d3e64409
Tri Dao
authored
Jun 11, 2022
d3e64409
Implement fwd for head dim 128
· 0d854692
Tri Dao
authored
Jun 05, 2022
0d854692
04 Jun, 2022
1 commit
Set block size of SM75 fwd to 256 if there's no dropout
· 321c57d0
Tri Dao
authored
Jun 04, 2022
This speeds up the fwd by 1.5x.
321c57d0
03 Jun, 2022
1 commit
Support Turing mma instructions
· 2712aa4c
Tri Dao
authored
Jun 02, 2022
2712aa4c
26 May, 2022
1 commit
Rename, add benchmarking script
· 9dbc491a
Tri Dao
authored
May 26, 2022
9dbc491a
20 May, 2022
1 commit
First release
· 1fcbe6f0
Tri Dao
authored
May 20, 2022
1fcbe6f0