[zero] sharded model support the reuse of fp16 shard (#495)
* sharded model supports reuse fp16 shard * rename variable * polish code * polish code * polish code
Showing
Please register or sign in to comment
* sharded model supports reuse fp16 shard * rename variable * polish code * polish code * polish code