Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
1e712ea8b07850b0875c12afcfa4ddb67ed08c43
Switch branch/tag
flash-attention
flash_attn
modules
mha.py
25 Dec, 2022
1 commit
Implement TensorParallel for MHA
· 1e712ea8
Tri Dao
authored
Dec 24, 2022
1e712ea8
23 Dec, 2022
1 commit
Simplify FusedDense
· e68ebbe8
Tri Dao
authored
Dec 22, 2022
e68ebbe8
21 Dec, 2022
1 commit
Implement XPos (Sun et al.)
· 496e4f52
Tri Dao
authored
Dec 21, 2022
496e4f52
20 Dec, 2022
1 commit
Implement last_layer_subset optimization for BERT
· 13cdceb3
Tri Dao
authored
Dec 19, 2022
13cdceb3
19 Dec, 2022
1 commit
Implement BERT
· 5fb6df0e
Tri Dao
authored
Dec 18, 2022
5fb6df0e
14 Nov, 2022
1 commit
Add MLP, MHA, Block, Embedding modules
· d4b320b3
Tri Dao
authored
Nov 13, 2022
d4b320b3