Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
"test/vscode:/vscode.git/clone" did not exist on "bfd7738856b7ae154d0f57e16926786195a7f81d"
ada4710d70c7d6a657c6725614726219cd1b4efc
Switch branch/tag
flash-attention
flash_attn
utils
distributed.py
07 Jan, 2023
1 commit
[TP] Implement TensorParallel without sequence parallel
· 93383bd5
Tri Dao
authored
Jan 07, 2023
93383bd5
27 Dec, 2022
2 commits
Tweak CrossEntropyLoss to take process_group in init
· c6ecd40a
Tri Dao
authored
Dec 27, 2022
c6ecd40a
Implement Tensor Parallel for GPT model
· b4018a50
Tri Dao
authored
Dec 25, 2022
b4018a50
24 Dec, 2022
1 commit
Implement TensorParallel for FusedDense and FusedDenseGeluDense
· 226a1b72
Tri Dao
authored
Dec 23, 2022
226a1b72