1. 07 Nov, 2022 1 commit
  2. 06 Nov, 2022 1 commit
  3. 04 Nov, 2022 3 commits
  4. 21 Sep, 2022 1 commit
  5. 19 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Bump DLPack to v0.7 and decouple DLPack from the core library (#4454) · cded5b80
      Xin Yao authored
      * rename `DLContext` to `DGLContext`
      
      * rename `kDLGPU` to `kDLCUDA`
      
      * replace DLTensor with DGLArray
      
      * fix linting
      
      * Unify DGLType and DLDataType to DGLDataType
      
      * Fix FFI
      
      * rename DLDeviceType to DGLDeviceType
      
      * decouple dlpack from the core library
      
      * fix bug
      
      * fix lint
      
      * fix merge
      
      * fix build
      
      * address comments
      
      * rename dl_converter to dlpack_convert
      
      * remove redundant comments
      cded5b80
  6. 27 Jul, 2022 1 commit
  7. 20 Jun, 2022 1 commit
  8. 08 Jun, 2022 1 commit
  9. 11 May, 2022 1 commit
  10. 27 Apr, 2022 1 commit
    • Rhett Ying's avatar
      [Feature] enable socket net_type for rpc (#3951) · 37be02a4
      Rhett Ying authored
      * [Feature] enable socket net_type for rpc
      
      * fix lint
      
      * fix lint
      
      * fix build issue on windows
      
      * fix test failure on windows
      
      * fix test failure
      
      * fix cpp unit test failure
      
      * net_type blocking max_try_times
      
      * fix other comments
      
      * fix lint
      
      * fix comment
      
      * fix lint
      
      * fix cpp
      37be02a4
  11. 24 Mar, 2022 1 commit
  12. 26 Jan, 2022 1 commit
  13. 19 Jan, 2022 1 commit
  14. 11 Jan, 2022 1 commit
    • Rhett Ying's avatar
      [Feature][Dist] change TP::Receiver/TP::Sender for multiple connections (#3574) · 37467e25
      Rhett Ying authored
      
      
      * [Feature] enable TP::Receiver wait for any numbers of senders
      
      * fix random unit test failure
      
      * avoid endless future wait
      
      * fix unit test failure
      
      * fix seg fault when finalize wait in receiver
      
      * [Feature] refactor sender connect logic and remove unnecessary sleeps in unit tests
      
      * fix lint
      
      * release RPCContext resources before process exits
      
      * [Debug] TPReceiver wait start log
      
      * [Debug] add log in get port
      
      * [Debug] add log
      
      * [ReDebug] revert time sleep in unit tests
      
      * [Debug] remove sleep for test_distri,test_mp
      
      * [debug] add more log
      
      * [debug] add listen_booted_ flag
      
      * [debug] restore commented code for queue
      
      * [debug] sleep more in rpc_client
      
      * restore change in tests
      
      * Revert "restore change in tests"
      
      This reverts commit 41a18926d181ec2517069389bfc41de2cc949280.
      
      * Revert "[debug] sleep more in rpc_client"
      
      This reverts commit a908e758eabca0a6ce62eb2e59baea02a840ac67.
      
      * Revert "[debug] restore commented code for queue"
      
      This reverts commit d3f993b3746e6bb6e2cc2f90204dd7e9461c6301.
      
      * Revert "[debug] add listen_booted_ flag"
      
      This reverts commit 244b2167d94942ff2a0acec8823b974975e52580.
      
      * Revert "[debug] add more log"
      
      This reverts commit 4b78447b0a575a824821dc7e25cca2246e6e30e2.
      
      * Revert "[Debug] remove sleep for test_distri,test_mp"
      
      This reverts commit e1df1aadcc8b1c2a0013ed77322ac391a8807612.
      
      * remove debug code
      
      * revert unnecessary change
      
      * revert unnecessary changes
      
      * always reset RPCContext when get started and reset all data
      
      * remove time.sleep in dist tests
      
      * fix lint
      
      * reset envs before each dist test
      
      * reset env properly
      
      * add time sleep when start each server
      
      * sleep for a while when boot server
      
      * replace wait_thread with callback
      
      * fix lint
      
      * add dglconnect handshake check
      Co-authored-by: default avatarJinjing Zhou <VoVAllen@users.noreply.github.com>
      37467e25
  15. 06 Dec, 2021 1 commit
    • Jinjing Zhou's avatar
      [RPC] Use tensorpipe for rpc communication (#3335) · a3ce780d
      Jinjing Zhou authored
      * doesn't know whether works
      
      * add change
      
      * fix
      
      * fix
      
      * fix
      
      * remove
      
      * revert
      
      * lint
      
      * lint
      
      * fix
      
      * revert
      
      * lint
      
      * fix
      
      * only build rpc on linux
      
      * lint
      
      * lint
      
      * fix build on windows
      
      * fix windows
      
      * remove old test
      
      * fix cmake
      
      * Revert "remove old test"
      
      This reverts commit f1ea75c777c34cdc1f08c0589676ba6aee1feb29.
      
      * fix windows
      
      * fix
      
      * fix
      
      * fix indent
      
      * fix indent
      
      * address comment
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * lint
      
      * fix indent
      
      * fix lint
      
      * add introduction
      
      * fix
      
      * lint
      
      * lint
      
      * add more logs
      
      * fix
      
      * update xbyak for C++14 with gcc5
      
      * Remove channels
      
      * fix
      
      * add test script
      
      * fix
      
      * remove unused file
      
      * fix lint
      
      * add timeout
      a3ce780d
  16. 28 Sep, 2021 1 commit
  17. 02 Sep, 2021 1 commit
  18. 25 Jul, 2021 1 commit
  19. 22 Mar, 2021 1 commit
  20. 26 Dec, 2020 1 commit
  21. 15 Dec, 2020 1 commit
  22. 09 Dec, 2020 1 commit
  23. 25 Sep, 2020 1 commit
  24. 19 Aug, 2020 1 commit
  25. 14 Aug, 2020 1 commit
    • Jinjing Zhou's avatar
      Fix hang when sending empty ndarray in rpc request (#2022) · 1f5f31ce
      Jinjing Zhou authored
      * aaa
      
      * revert
      
      * 111
      
      * fix
      
      * fff
      
      * 111
      
      * try
      
      * debug
      
      * fff
      
      * 111
      
      * 1111
      
      * fix
      
      * try
      
      * 111
      
      * 111
      
      * print
      
      * f
      
      * 111
      
      * revert
      
      * fix
      
      * revert
      
      * add timeout
      
      * Revert "add timeout"
      
      This reverts commit fb34f67f11ad6556824daa6efa29a6ec2c7c1d3e.
      1f5f31ce
  26. 13 Aug, 2020 1 commit
  27. 05 Aug, 2020 1 commit
    • Jinjing Zhou's avatar
      [Distributed] DistDataloader (#1901) · 4f499c7f
      Jinjing Zhou authored
      
      
      * 111
      
      * 111
      
      * fix
      
      * 111
      
      * fix
      
      * 11
      
      * fix
      
      * lint
      
      * Update __init__.py
      
      * lint
      
      * fix
      
      * lint
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * try fix
      
      * try fix
      
      * fix
      
      * Revert "fix"
      
      This reverts commit a0b954fd4e99b7df92b53db8334dcb583d6e1551.
      
      * fixes.
      
      * fix.
      
      * fix test.
      
      * fix exit.
      
      * fix.
      
      * fix
      
      * fix
      
      * lint
      
      * lint
      
      * lint
      
      * fix
      
      * Update .gitignore
      
      * 111
      
      * fix
      
      * 111
      
      * 111
      
      * fff
      
      * 1111
      
      * 111
      
      * 1325315
      
      * ffff
      
      * f???
      
      * fff
      
      * 1111
      
      * 111
      
      * fix
      
      * 111
      
      * asda
      
      * 1111
      
      * 11
      
      * 123
      
      * 啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊
      
      * spawn
      
      * 1231231
      
      * up
      
      * 111
      
      * fix
      
      * fix
      
      * Revert "fix"
      
      This reverts commit 7373f95312fdcaa36d2fc330bf242339e89c045d.
      
      * fix
      
      * fix
      
      * 1111
      
      * fix
      
      * fix tests
      
      * start kvclient as early as possible.
      
      * lint
      
      * fix test
      
      * lint
      
      * 1111
      
      * fix
      
      * fix
      
      * 111
      
      * fix
      
      * fix
      
      * 1
      
      * fix
      
      * fix
      
      * lint
      
      * fix
      
      * lint
      
      * lint
      
      * remove quit
      
      * fix
      
      * lint
      
      * fix
      
      * fix several
      
      * lint
      
      * fix minor
      
      * fix
      
      * lint
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      4f499c7f
  28. 30 Jul, 2020 1 commit
    • Chao Ma's avatar
      [KVStore] Support group barrier (#1880) · 5b515cf6
      Chao Ma authored
      
      
      * udpate
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * udpate
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      5b515cf6
  29. 23 Jul, 2020 1 commit
  30. 22 Jul, 2020 1 commit
  31. 21 Jul, 2020 1 commit
  32. 09 Jul, 2020 1 commit
    • mszarma's avatar
      Handle EINTR in TCP layer (#1766) · 83f71466
      mszarma authored
      
      
      Added handling EINTR signal by
      retrying when returned value and
      errno are related to EINTR interrupt
      error which my appear during tcp
      function execution.
      Added information about EAGAIN timeout
      failure in accept stage.
      Added printing out errno in logs.
      Co-authored-by: default avatarChao Ma <mctt90@gmail.com>
      83f71466
  33. 07 Jul, 2020 1 commit
  34. 03 Jul, 2020 1 commit
  35. 29 Jun, 2020 1 commit
  36. 28 Jun, 2020 1 commit
    • Minjie Wang's avatar
      [CUDA][Kernel] More CUDA kernels; Standardize the behavior for sorted COO/CSR (#1704) · 870da747
      Minjie Wang authored
      * add cub; array cumsum
      
      * CSRSliceRows
      
      * fix warning
      
      * operator << for ndarray; CSRSliceRows
      
      * add CSRIsSorted
      
      * add csr_sort
      
      * inplace coosort and outplace csrsort
      
      * WIP: coo is sorted
      
      * mv cuda_utils
      
      * add AllTrue utility
      
      * csr sort
      
      * coo sort
      
      * coo2csr for sorted coo arrays
      
      * CSRToCOO from sorted
      
      * pass tests for the new kernel changes
      
      * cannot use inplace sort
      
      * lint
      
      * try fix msvc error
      
      * Fix g.copy_to and g.asnumbits; ToBlock no longer uses CSC
      
      * stash
      
      * revert some hack
      
      * revert some changes
      
      * address comments
      
      * fix
      
      * fix to_block unittest
      
      * add todo note
      870da747
  37. 17 Jun, 2020 2 commits