[shardformer] Add dropout layer in shard model and refactor policy api (#3949)
* add dist dropout in model * update docstring and bert policy with dropout * refactor basepolicy and sharded, update bert * update format * update gpt2 policy * update bert policy * remove unused code * update readme for new policy usage
Showing
Please register or sign in to comment