Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Repository
ba2fe7f378c938263e8b5eeeac0fb2766c754551
Switch branch/tag
flash-attention
flash_attn
utils
generation.py
Find file
Blame
History
Permalink
[Gen] Move allocate_inference_cache to within the model
· ba2fe7f3
Tri Dao
authored
Apr 20, 2023
ba2fe7f3
generation.py
13.6 KB
Edit
Web IDE
Replace generation.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace generation.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.