Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
184b992dcb2a0890adaa19eb9b541c3e4f9d2a08
Switch branch/tag
flash-attention
tests
models
test_llama.py
29 Jul, 2023
1 commit
[GPT] Implement parallel LLaMa
· 184b992d
Tri Dao
authored
Jul 28, 2023
184b992d
26 Jul, 2023
2 commits
[GPT] Add LLaMa-13B to test
· 56ccaff1
Tri Dao
authored
Jul 26, 2023
56ccaff1
[Rotary] Fix tests when loading state dict with rotary inv_freqs
· 8e9820a5
Tri Dao
authored
Jul 26, 2023
8e9820a5
02 Jul, 2023
1 commit
[Rotary] Make sure frequency calculation is in fp32
· 62e98144
Tri Dao
authored
Jul 02, 2023
62e98144
05 May, 2023
1 commit
[LLaMa] Fix last norm layer to use RMSNorm instead of LayerNorm
· a9a4b4e4
Tri Dao
authored
May 04, 2023
a9a4b4e4
19 Apr, 2023
1 commit
Implement LLaMa
· 96d10f65
Tri Dao
authored
Apr 18, 2023
96d10f65