"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "caf1d116a62a324a2b0ccfd92ca6c095d5368dde"
[modeling_utils] use less cpu memory with sharded checkpoint loading (#16844)
* less cpu memory with sharded checkpoint loading * Trigger CI * Trigger CI
Showing
Please register or sign in to comment