1. 17 Oct, 2022 1 commit
    • Rhett Ying's avatar
      [Dist] Reduce peak memory in DistDGL (#4687) · b1309217
      Rhett Ying authored
      * [Dist] Reduce peak memory in DistDGL: avoid validation, release memory once loaded
      
      * remove orig_id from ndata/edata for partition_graph()
      
      * delete orig_id from ndata/edata in dist part pipeline
      
      * reduce dtype size and format before saving graphs
      
      * fix lint
      
      * ETYPE requires to be int32/64 for CSRSortByTag
      
      * fix test failure
      
      * refine
      b1309217
  2. 12 Oct, 2022 1 commit
    • Rhett Ying's avatar
      Multiple dist dl sampler (#4704) · 408eba24
      Rhett Ying authored
      * [Dist] enable iterate multiple dist dataloaders simultaneously
      
      * format file
      
      * add support for any number of dataloaders
      
      * fix lint
      
      * refine code
      408eba24
  3. 10 Oct, 2022 1 commit
  4. 30 Sep, 2022 1 commit
  5. 16 Aug, 2022 1 commit
  6. 03 Aug, 2022 1 commit
  7. 01 Aug, 2022 1 commit
  8. 28 Jul, 2022 1 commit
  9. 11 Jul, 2022 2 commits
  10. 20 Jun, 2022 1 commit
  11. 16 Jun, 2022 1 commit
  12. 09 Jun, 2022 1 commit
  13. 08 Jun, 2022 1 commit
  14. 18 May, 2022 1 commit
  15. 11 May, 2022 1 commit
  16. 27 Apr, 2022 1 commit
    • Rhett Ying's avatar
      [Feature] enable socket net_type for rpc (#3951) · 37be02a4
      Rhett Ying authored
      * [Feature] enable socket net_type for rpc
      
      * fix lint
      
      * fix lint
      
      * fix build issue on windows
      
      * fix test failure on windows
      
      * fix test failure
      
      * fix cpp unit test failure
      
      * net_type blocking max_try_times
      
      * fix other comments
      
      * fix lint
      
      * fix comment
      
      * fix lint
      
      * fix cpp
      37be02a4
  17. 24 Mar, 2022 1 commit
  18. 14 Mar, 2022 1 commit
  19. 02 Mar, 2022 1 commit
  20. 30 Jan, 2022 2 commits
    • Rhett Ying's avatar
      [Fix] sleep for a while when launching clients which will connect to … (#3704) · 9c8c162a
      Rhett Ying authored
      * [Fix] sleep for a while when launching clients which will connect to multiple servers
      
      * pre-allocate more ports
      
      * no multiple partitions on single machine
      9c8c162a
    • Quan (Andy) Gan's avatar
      [Sampling] New sampling pipeline plus asynchronous prefetching (#3665) · 701b4fcc
      Quan (Andy) Gan authored
      * initial update
      
      * more
      
      * more
      
      * multi-gpu example
      
      * cluster gcn, finalize homogeneous
      
      * more explanation
      
      * fix
      
      * bunch of fixes
      
      * fix
      
      * RGAT example and more fixes
      
      * shadow-gnn sampler and some changes in unit test
      
      * fix
      
      * wth
      
      * more fixes
      
      * remove shadow+node/edge dataloader tests for possible ux changes
      
      * lints
      
      * add legacy dataloading import just in case
      
      * fix
      
      * update pylint for f-strings
      
      * fix
      
      * lint
      
      * lint
      
      * lint again
      
      * cherry-picking commit fa9f494
      
      * oops
      
      * fix
      
      * add sample_neighbors in dist_graph
      
      * fix
      
      * lint
      
      * fix
      
      * fix
      
      * fix
      
      * fix tutorial
      
      * fix
      
      * fix
      
      * fix
      
      * fix warning
      
      * remove debug
      
      * add get_foo_storage apis
      
      * lint
      701b4fcc
  21. 28 Jan, 2022 2 commits
  22. 26 Jan, 2022 1 commit
  23. 19 Jan, 2022 2 commits
  24. 11 Jan, 2022 1 commit
    • Rhett Ying's avatar
      [Feature][Dist] change TP::Receiver/TP::Sender for multiple connections (#3574) · 37467e25
      Rhett Ying authored
      
      
      * [Feature] enable TP::Receiver wait for any numbers of senders
      
      * fix random unit test failure
      
      * avoid endless future wait
      
      * fix unit test failure
      
      * fix seg fault when finalize wait in receiver
      
      * [Feature] refactor sender connect logic and remove unnecessary sleeps in unit tests
      
      * fix lint
      
      * release RPCContext resources before process exits
      
      * [Debug] TPReceiver wait start log
      
      * [Debug] add log in get port
      
      * [Debug] add log
      
      * [ReDebug] revert time sleep in unit tests
      
      * [Debug] remove sleep for test_distri,test_mp
      
      * [debug] add more log
      
      * [debug] add listen_booted_ flag
      
      * [debug] restore commented code for queue
      
      * [debug] sleep more in rpc_client
      
      * restore change in tests
      
      * Revert "restore change in tests"
      
      This reverts commit 41a18926d181ec2517069389bfc41de2cc949280.
      
      * Revert "[debug] sleep more in rpc_client"
      
      This reverts commit a908e758eabca0a6ce62eb2e59baea02a840ac67.
      
      * Revert "[debug] restore commented code for queue"
      
      This reverts commit d3f993b3746e6bb6e2cc2f90204dd7e9461c6301.
      
      * Revert "[debug] add listen_booted_ flag"
      
      This reverts commit 244b2167d94942ff2a0acec8823b974975e52580.
      
      * Revert "[debug] add more log"
      
      This reverts commit 4b78447b0a575a824821dc7e25cca2246e6e30e2.
      
      * Revert "[Debug] remove sleep for test_distri,test_mp"
      
      This reverts commit e1df1aadcc8b1c2a0013ed77322ac391a8807612.
      
      * remove debug code
      
      * revert unnecessary change
      
      * revert unnecessary changes
      
      * always reset RPCContext when get started and reset all data
      
      * remove time.sleep in dist tests
      
      * fix lint
      
      * reset envs before each dist test
      
      * reset env properly
      
      * add time sleep when start each server
      
      * sleep for a while when boot server
      
      * replace wait_thread with callback
      
      * fix lint
      
      * add dglconnect handshake check
      Co-authored-by: default avatarJinjing Zhou <VoVAllen@users.noreply.github.com>
      37467e25
  25. 06 Dec, 2021 2 commits
    • Jinjing Zhou's avatar
      [RPC] Use tensorpipe for rpc communication (#3335) · a3ce780d
      Jinjing Zhou authored
      * doesn't know whether works
      
      * add change
      
      * fix
      
      * fix
      
      * fix
      
      * remove
      
      * revert
      
      * lint
      
      * lint
      
      * fix
      
      * revert
      
      * lint
      
      * fix
      
      * only build rpc on linux
      
      * lint
      
      * lint
      
      * fix build on windows
      
      * fix windows
      
      * remove old test
      
      * fix cmake
      
      * Revert "remove old test"
      
      This reverts commit f1ea75c777c34cdc1f08c0589676ba6aee1feb29.
      
      * fix windows
      
      * fix
      
      * fix
      
      * fix indent
      
      * fix indent
      
      * address comment
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * lint
      
      * fix indent
      
      * fix lint
      
      * add introduction
      
      * fix
      
      * lint
      
      * lint
      
      * add more logs
      
      * fix
      
      * update xbyak for C++14 with gcc5
      
      * Remove channels
      
      * fix
      
      * add test script
      
      * fix
      
      * remove unused file
      
      * fix lint
      
      * add timeout
      a3ce780d
    • Quan (Andy) Gan's avatar
      [Distributed] Edge-type-specific fanouts for heterogeneous graphs (#3558) · eb08ef38
      Quan (Andy) Gan authored
      * first commit
      
      * second commit
      
      * spaghetti unit tests
      
      * rewrite test
      eb08ef38
  26. 12 Oct, 2021 1 commit
  27. 01 Sep, 2021 1 commit
  28. 29 Aug, 2021 1 commit
  29. 06 Aug, 2021 1 commit
  30. 28 Jul, 2021 2 commits
  31. 17 Jul, 2021 1 commit
  32. 13 Jul, 2021 1 commit
  33. 05 Jul, 2021 1 commit
  34. 02 Jul, 2021 1 commit