-
mshoeybi authored
Allocate tensor in `communicate()` method directly on GPU (instead of allocating on CPU and then moving to GPU)
9ff6f473
Allocate tensor in `communicate()` method directly on GPU (instead of allocating on CPU and then moving to GPU)