Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
d2f4324f4c56e017fbf22dc421943793a8ca6c3b
Switch branch/tag
flash-attention
tests
ops
test_dropout_layer_norm.py
04 Jul, 2023
1 commit
[LayerNorm] Make sure memory addresses are aligned to 16 bytes
· d2f4324f
Tri Dao
authored
Jul 04, 2023
d2f4324f
31 Mar, 2023
1 commit
[LayerNorm] Implement LN with parallel residual, support dim 8k
· 393882bc
Tri Dao
authored
Mar 29, 2023
393882bc
07 Jan, 2023
1 commit
[LayerNorm] Implement RMS Norm
· 6738d947
Tri Dao
authored
Jan 06, 2023
6738d947
13 Dec, 2022
1 commit
[LayerNorm] Support taking subset of input or subset of output
· 5db33051
Tri Dao
authored
Dec 12, 2022
5db33051
11 Dec, 2022
1 commit
[LayerNorm] Fuse LayerScale
· ae137ed1
Tri Dao
authored
Dec 10, 2022
ae137ed1
09 Dec, 2022
1 commit
[LayerNorm] Support all dimensions up to 6k (if divisible by 8)
· 8c6609ae
Tri Dao
authored
Dec 08, 2022
8c6609ae
14 Nov, 2022
1 commit
Add fused_dense and dropout_add_layernorm CUDA extensions
· fa6d1ce4
Tri Dao
authored
Nov 13, 2022
fa6d1ce4