Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
08124c8f9cf88ba327e2c455ed2a302979f06c91
Switch branch/tag
flash-attention
tests
losses
test_cross_entropy_parallel.py
17 Dec, 2023
1 commit
[CrossEntropy] Implement logit_scale option
· 08124c8f
Tri Dao
authored
Dec 16, 2023
08124c8f
20 Nov, 2023
1 commit
[CrossEntropy] Simplify the case of large vocab with Tensor Parallel
· aaa14741
Tri Dao
authored
Nov 19, 2023
aaa14741
16 Sep, 2023
1 commit
[CE] Implement CrossEntropyLoss in Triton
· 5400fdc4
Tri Dao
authored
Sep 15, 2023
5400fdc4
19 Aug, 2023
1 commit
Run isort and black on test files
· 0e8c46ae
Tri Dao
authored
Aug 18, 2023
0e8c46ae
27 Dec, 2022
2 commits
Tweak CrossEntropyLoss to take process_group in init
· c6ecd40a
Tri Dao
authored
Dec 27, 2022
c6ecd40a
Implement Tensor Parallel for GPT model
· b4018a50
Tri Dao
authored
Dec 25, 2022
b4018a50
23 Dec, 2022
1 commit
Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss
· dff68c2b
Tri Dao
authored
Dec 23, 2022
dff68c2b
14 Nov, 2022
1 commit
Make nccl operations async in CrossEntropyLossParallel
· 343492ec
Tri Dao
authored
Nov 13, 2022
343492ec
13 Nov, 2022
1 commit
Add fused cross entropy loss
· 7c995381
Tri Dao
authored
Nov 12, 2022
7c995381