"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "6f79d264422245d88c7a34032c1a8254a0c65752"
fix deepspeed load best model at end when the model gets sharded (#25057)
Showing
Please register or sign in to comment