Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Commits
e02fd588aaaf24cb9af11df0ea7533f2a017c719
Switch branch/tag
flash-attention
tests
models
test_gpt_generation.py
08 Jan, 2023
1 commit
[Gen] Implement top-k and top-p sampling
· e02fd588
Tri Dao
authored
Jan 07, 2023
e02fd588
07 Jan, 2023
1 commit
[Gen] Test generation with rotary embedding
· 11be742a
Tri Dao
authored
Jan 07, 2023
11be742a
04 Jan, 2023
1 commit
[Gen] Add option to run generation with FT attention kernel
· a668890f
Tri Dao
authored
Jan 03, 2023
a668890f
28 Dec, 2022
1 commit
Implement generation for GPT
· 63670fd8
Tri Dao
authored
Dec 27, 2022
63670fd8