Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
ec6d22143b5d375e253b2ebfc563b26a43f43684
Switch branch/tag
flash-attention
flash_attn
ops
triton
cross_entropy.py
26 Apr, 2024
1 commit
[CrossEntropy] Change ignored_index -> ignore_index
· ec6d2214
Tri Dao
authored
Apr 26, 2024
ec6d2214
21 Jan, 2024
1 commit
return z_loss (#768)
· d8aacc51
Curtis "Fjord" Hawthorne
authored
Jan 21, 2024
d8aacc51
17 Dec, 2023
1 commit
[CrossEntropy] Implement logit_scale option
· 08124c8f
Tri Dao
authored
Dec 16, 2023
08124c8f
20 Nov, 2023
2 commits
[CrossEntropy] Simplify the case of large vocab with Tensor Parallel
· aaa14741
Tri Dao
authored
Nov 19, 2023
aaa14741
fix flash ce mp large vocab (#673)
· abf04a56
Shijie
authored
Nov 20, 2023
abf04a56
24 Oct, 2023
1 commit
[CrossEntropy] Fix triton cross_entropy_loss IMA for >=2B elements
· c79de85f
Tri Dao
authored
Oct 24, 2023
c79de85f
16 Sep, 2023
1 commit
[CE] Implement CrossEntropyLoss in Triton
· 5400fdc4
Tri Dao
authored
Sep 15, 2023
5400fdc4