Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
"official/r1/transformer/dataset.py" did not exist on "faf4bbb36d576b0c328f179eb895eb6a48f284e1"
ef085cfcdac0efd7eed8e27a31ae6fbd27126ad4
Switch branch/tag
flash-attention
flash_attn
models
vit.py
16 Jan, 2023
1 commit
[ViT] Fix extra norm_0, use new LN order in Block
· ef085cfc
Tri Dao
authored
Jan 15, 2023
ef085cfc
23 Nov, 2022
1 commit
[ViT] Use dropout_add_ln for the 1st layer norm
· 1feb9426
Tri Dao
authored
Nov 23, 2022
1feb9426
14 Nov, 2022
1 commit
Add GPT and ViT models
· 2e33fc8e
Tri Dao
authored
Nov 13, 2022
2e33fc8e