[DIST] Message size to retrieve SHUFFLE_GLOBAL_NIDs is resulting in very large...
[DIST] Message size to retrieve SHUFFLE_GLOBAL_NIDs is resulting in very large messages and resulting in killed process (#4790) * Send out the message to the distributed lookup service in batches. * Update function signature for allgather_sizes function call. * Removed the unnecessary if statement . * Removed logging.info message, which is not needed.
Showing
Please register or sign in to comment