[modeling_utils] use less cpu memory with sharded checkpoint loading (#16844)
* less cpu memory with sharded checkpoint loading * Trigger CI * Trigger CI
Showing
Please register or sign in to comment
* less cpu memory with sharded checkpoint loading * Trigger CI * Trigger CI