"app/vscode:/vscode.git/clone" did not exist on "8681ba0ee255895e8c2e477b6bf079d77564b5c6"
[feat] ShardedDataParallel with autoreduce (#157)
* rewrite using autograd and Variable execution queue to make the reduce automatic * share buckets with OSS to remove duplication * some speed still likely on the table since the speed vs. bucketing does not match expectations, could be a follow up
Showing
Please register or sign in to comment