Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
8a2ece89f7bd5d3124a6cae5fd95db5e85f07ee6
Switch branch/tag
flash-attention
csrc
layer_norm
29 Nov, 2022
1 commit
Release training code
· 0bf5e500
Tri Dao
authored
Nov 28, 2022
0bf5e500
17 Nov, 2022
1 commit
[LayerNorm] Compile for both sm70 and sm80
· 39ed597b
Tri Dao
authored
Nov 17, 2022
39ed597b
15 Nov, 2022
2 commits
Mention that some CUDA extensions have only been tested on A100s
· 43ab0b52
Tri Dao
authored
Nov 15, 2022
43ab0b52
[LayerNorm] Check cuda error after querying ctas_per_sm
· e4d3013e
Tri Dao
authored
Nov 15, 2022
e4d3013e
14 Nov, 2022
2 commits
Add GPT and ViT models
· 2e33fc8e
Tri Dao
authored
Nov 13, 2022
2e33fc8e
Add fused_dense and dropout_add_layernorm CUDA extensions
· fa6d1ce4
Tri Dao
authored
Nov 13, 2022
fa6d1ce4