Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
5018ac6ac531aabdb05c8af1ba3d98a2235bdbde
Switch branch/tag
flash-attention
flash_attn
modules
embedding.py
18 Aug, 2023
1 commit
Run isort and black on python files
· f1a73d07
Tri Dao
authored
Aug 18, 2023
f1a73d07
16 Jan, 2023
1 commit
Reorder LN in Block, support OPT
· ff34123b
Tri Dao
authored
Jan 15, 2023
ff34123b
07 Jan, 2023
1 commit
[TP] Implement TensorParallel without sequence parallel
· 93383bd5
Tri Dao
authored
Jan 07, 2023
93383bd5
02 Jan, 2023
1 commit
[TP] Put parallel embeddings in separate modules
· 4cab4de5
Tri Dao
authored
Jan 02, 2023
4cab4de5
25 Dec, 2022
1 commit
Implement Tensor Parallel for GPT2Embeddings
· 78225c53
Tri Dao
authored
Dec 25, 2022
78225c53
19 Dec, 2022
1 commit
Implement BERT
· 5fb6df0e
Tri Dao
authored
Dec 18, 2022
5fb6df0e
14 Nov, 2022
1 commit
Add MLP, MHA, Block, Embedding modules
· d4b320b3
Tri Dao
authored
Nov 13, 2022
d4b320b3