• Rhett Ying's avatar
    [Feature][Dist] change TP::Receiver/TP::Sender for multiple connections (#3574) · 37467e25
    Rhett Ying authored
    
    
    * [Feature] enable TP::Receiver wait for any numbers of senders
    
    * fix random unit test failure
    
    * avoid endless future wait
    
    * fix unit test failure
    
    * fix seg fault when finalize wait in receiver
    
    * [Feature] refactor sender connect logic and remove unnecessary sleeps in unit tests
    
    * fix lint
    
    * release RPCContext resources before process exits
    
    * [Debug] TPReceiver wait start log
    
    * [Debug] add log in get port
    
    * [Debug] add log
    
    * [ReDebug] revert time sleep in unit tests
    
    * [Debug] remove sleep for test_distri,test_mp
    
    * [debug] add more log
    
    * [debug] add listen_booted_ flag
    
    * [debug] restore commented code for queue
    
    * [debug] sleep more in rpc_client
    
    * restore change in tests
    
    * Revert "restore change in tests"
    
    This reverts commit 41a18926d181ec2517069389bfc41de2cc949280.
    
    * Revert "[debug] sleep more in rpc_client"
    
    This reverts commit a908e758eabca0a6ce62eb2e59baea02a840ac67.
    
    * Revert "[debug] restore commented code for queue"
    
    This reverts commit d3f993b3746e6bb6e2cc2f90204dd7e9461c6301.
    
    * Revert "[debug] add listen_booted_ flag"
    
    This reverts commit 244b2167d94942ff2a0acec8823b974975e52580.
    
    * Revert "[debug] add more log"
    
    This reverts commit 4b78447b0a575a824821dc7e25cca2246e6e30e2.
    
    * Revert "[Debug] remove sleep for test_distri,test_mp"
    
    This reverts commit e1df1aadcc8b1c2a0013ed77322ac391a8807612.
    
    * remove debug code
    
    * revert unnecessary change
    
    * revert unnecessary changes
    
    * always reset RPCContext when get started and reset all data
    
    * remove time.sleep in dist tests
    
    * fix lint
    
    * reset envs before each dist test
    
    * reset env properly
    
    * add time sleep when start each server
    
    * sleep for a while when boot server
    
    * replace wait_thread with callback
    
    * fix lint
    
    * add dglconnect handshake check
    Co-authored-by: default avatarJinjing Zhou <VoVAllen@users.noreply.github.com>
    37467e25
utils.py 906 Bytes