Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
"include/ck/utility/integral_constant.hpp" did not exist on "7a7fe160866b7b2893be698d77b70cc8cf754fb5"
71befc19e130ff65e9ad0f3113635c7a7ea9db60
Switch branch/tag
flash-attention
training
src
01 Jan, 2023
1 commit
[Loss] Use flash_attn.losses.cross_entropy.CrossEntropyLoss
· 71befc19
Tri Dao
authored
Dec 31, 2022
71befc19
27 Dec, 2022
1 commit
Implement Tensor Parallel for GPT model
· b4018a50
Tri Dao
authored
Dec 25, 2022
b4018a50
23 Dec, 2022
1 commit
Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss
· dff68c2b
Tri Dao
authored
Dec 23, 2022
dff68c2b
29 Nov, 2022
1 commit
Release training code
· 0bf5e500
Tri Dao
authored
Nov 28, 2022
0bf5e500