Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
0e8c46ae082088de7815e22935d2fba41f15a92b
Switch branch/tag
flash-attention
tests
modules
19 Aug, 2023
1 commit
Run isort and black on test files
· 0e8c46ae
Tri Dao
authored
Aug 18, 2023
0e8c46ae
26 Jul, 2023
1 commit
Implement ParallelGatedMlp (#251)
· 8ee62efc
Haodong Lyu
authored
Jul 27, 2023
8ee62efc
18 Jan, 2023
1 commit
[FusedDense] Support relu, rename FusedDenseGeluDense -> FusedMLP
· 88173a1a
Tri Dao
authored
Jan 17, 2023
88173a1a
07 Jan, 2023
1 commit
[TP] Implement TensorParallel without sequence parallel
· 93383bd5
Tri Dao
authored
Jan 07, 2023
93383bd5
27 Dec, 2022
1 commit
Implement Tensor Parallel for GPT model
· b4018a50
Tri Dao
authored
Dec 25, 2022
b4018a50
25 Dec, 2022
3 commits
Implement Tensor Parallel for GPT2Embeddings
· 78225c53
Tri Dao
authored
Dec 25, 2022
78225c53
Implement Tensor Parallel for transformer Block
· a8cfe515
Tri Dao
authored
Dec 25, 2022
a8cfe515
Implement TensorParallel for MHA
· 1e712ea8
Tri Dao
authored
Dec 24, 2022
1e712ea8