"...git@developer.sourcefind.cn:2222/OpenDAS/vllm_cscc.git" did not exist on "f381cf2302460fb921d67fc1356b9d1b1a93e960"
fix RAM OOM when load large models in tensor parallel mode. (#1395)
Co-authored-by:
ran_lin <rlin@thoughtworks.com>
Showing
Please register or sign in to comment