[perf] ShardedDDP: better handling of the callback queue, try to consume it as we go. (#254)
* Better handling of the callback queue, try to consume it as we go. * dumping buckets for the reduce part, always the same unused params issue
Showing
Please register or sign in to comment