• kylasa's avatar
    [Distributed] use alltoall fix to bypass gloo - alltoallv bug in distributed partitioning (#4311) · c1e01b1d
    kylasa authored
    * Alltoall Fix to bypass gloo - alltoallv bug which is preventing further testing
    
    1. Replaced alltoallv gloo wrapper call with alltoall message.
    2. All the messages are padded to be of same length
    3. Receiving side unpads the messages and continues processing.
    
    * Code changes to address CI comments
    
    1. Removed unused functions from gloo_wrapper.py
    2. Changed the function signature of alltoallv_cpu_data as suggested.
    3. Added docstring to include more description of the functionality inside alltoallv_cpu_data. Included more asserts to validate the assumptions.
    
    * Changed the function name appropriately
    
    Changed the function name from "alltoallv_cpu_data" to alltoallv_cpu which I believe is appropriate because underlying functionality is providing alltoallv which is basically alltoall_cpu + padding
    
    * Added code and text to address the review comments.
    
    1. Changed the function name to indicate the local use of this function.
    2. Changed docstring to indicate the assumptions made by alltoallv_cpu function.
    
    * Removed unused function from import statement
    
    Removed unused/removed function from import statement.
    c1e01b1d
gloo_wrapper.py 6.26 KB