Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
"vscode:/vscode.git/clone" did not exist on "4976650f743ece43bb159a55372557c777a4489b"
5400fdc4acbbd06f6de90f40994319a40bf55a39
Switch branch/tag
flash-attention
csrc
xentropy
16 Sep, 2023
1 commit
[CE] Implement CrossEntropyLoss in Triton
· 5400fdc4
Tri Dao
authored
Sep 15, 2023
5400fdc4
15 Mar, 2023
1 commit
Support H100 for other CUDA extensions
· dc08ea1c
Tri Dao
authored
Mar 15, 2023
dc08ea1c
23 Dec, 2022
1 commit
Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss
· dff68c2b
Tri Dao
authored
Dec 23, 2022
dff68c2b
15 Nov, 2022
1 commit
Mention that some CUDA extensions have only been tested on A100s
· 43ab0b52
Tri Dao
authored
Nov 15, 2022
43ab0b52
14 Nov, 2022
1 commit
Add fused_dense and dropout_add_layernorm CUDA extensions
· fa6d1ce4
Tri Dao
authored
Nov 13, 2022
fa6d1ce4
13 Nov, 2022
1 commit
Add fused cross entropy loss
· 7c995381
Tri Dao
authored
Nov 12, 2022
7c995381