Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
flash-attention
Repository
78b7a1dc1869e03a39cf3d2e2d9e5dbb1f669810
Switch branch/tag
flash-attention
tests
models
test_gpt_generation_parallel.py
Find file
Blame
History
Permalink
[OPT] Load fp16 weights on CPU before moving to GPU
· 78b7a1dc
Tri Dao
authored
Jan 22, 2023
78b7a1dc
test_gpt_generation_parallel.py
6.38 KB
Edit
Web IDE
Replace test_gpt_generation_parallel.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace test_gpt_generation_parallel.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.