[shardformer] Fix serialization error with Tensor Parallel state saving (#5018)
* Fix serialization error with Tensor Parallel state saving * Refactor state_dict CPU transfer using tree_map
Showing
Please register or sign in to comment
* Fix serialization error with Tensor Parallel state saving * Refactor state_dict CPU transfer using tree_map