Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
009a3e71ece11b2ec20629883f7d66212db52c7d
Switch branch/tag
flash-attention
training
src
29 Mar, 2023
1 commit
[Training] Fix lightning _PATH import
· 009a3e71
Tri Dao
authored
Mar 29, 2023
009a3e71
01 Jan, 2023
1 commit
[Loss] Use flash_attn.losses.cross_entropy.CrossEntropyLoss
· 71befc19
Tri Dao
authored
Dec 31, 2022
71befc19
27 Dec, 2022
1 commit
Implement Tensor Parallel for GPT model
· b4018a50
Tri Dao
authored
Dec 25, 2022
b4018a50
23 Dec, 2022
1 commit
Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss
· dff68c2b
Tri Dao
authored
Dec 23, 2022
dff68c2b
29 Nov, 2022
1 commit
Release training code
· 0bf5e500
Tri Dao
authored
Nov 28, 2022
0bf5e500