Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
kecinstone
2024pra-vllm
Repository
094f716bf2cdc213f2b812dbb489fbf6f4a4423c
Switch branch/tag
2024-pra-vllm
vllm
engine
arg_utils.py
Find file
Blame
History
Permalink
fix RAM OOM when load large models in tensor parallel mode. (#1395)
· 4bb6b671
boydfd
authored
Nov 21, 2023
Co-authored-by:
ran_lin
<
rlin@thoughtworks.com
>
4bb6b671
arg_utils.py
10.7 KB
Edit
Web IDE
Replace arg_utils.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace arg_utils.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.