Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
71befc19e130ff65e9ad0f3113635c7a7ea9db60
Switch branch/tag
flash-attention
flash_attn
modules
mha.py
28 Dec, 2022
2 commits
Bump to v0.2.6
· a6ec1782
Tri Dao
authored
Dec 27, 2022
a6ec1782
Implement generation for GPT
· 63670fd8
Tri Dao
authored
Dec 27, 2022
63670fd8
25 Dec, 2022
1 commit
Implement TensorParallel for MHA
· 1e712ea8
Tri Dao
authored
Dec 24, 2022
1e712ea8
23 Dec, 2022
1 commit
Simplify FusedDense
· e68ebbe8
Tri Dao
authored
Dec 22, 2022
e68ebbe8
21 Dec, 2022
1 commit
Implement XPos (Sun et al.)
· 496e4f52
Tri Dao
authored
Dec 21, 2022
496e4f52
20 Dec, 2022
1 commit
Implement last_layer_subset optimization for BERT
· 13cdceb3
Tri Dao
authored
Dec 19, 2022
13cdceb3
19 Dec, 2022
1 commit
Implement BERT
· 5fb6df0e
Tri Dao
authored
Dec 18, 2022
5fb6df0e
14 Nov, 2022
1 commit
Add MLP, MHA, Block, Embedding modules
· d4b320b3
Tri Dao
authored
Nov 13, 2022
d4b320b3