"experiments/vscode:/vscode.git/clone" did not exist on "c132bf27ebebcdca7113e81555ee9fc01525ec3a"
fix RAM OOM when load large models in tensor parallel mode. (#1395)
Co-authored-by:
ran_lin <rlin@thoughtworks.com>
Showing
Please register or sign in to comment