1. 08 Nov, 2022 1 commit
  2. 07 Nov, 2022 4 commits
  3. 06 Nov, 2022 2 commits
  4. 04 Nov, 2022 4 commits
  5. 03 Nov, 2022 3 commits
  6. 02 Nov, 2022 1 commit
  7. 29 Oct, 2022 1 commit
    • Quan (Andy) Gan's avatar
      [Sampling] Enable sampling with edge masks in sample_etype_neighbors (#4749) · 2bca4759
      Quan (Andy) Gan authored
      * sample neighbors with masks
      
      * oops
      
      * refactor again
      
      * remove
      
      * remove debug code
      
      * rename macro
      
      * address comments
      
      * more stuff
      
      * remove
      
      * fix
      
      * try fix unit test
      
      * oops
      
      * fix test
      
      * oops
      
      * change name
      
      * rename a lot of stuff
      
      * oops
      
      * ugh
      
      * misc fixes
      
      * lint
      
      * address a lot of comments
      
      * lint
      
      * lint
      
      * fix
      
      * that was silly
      
      * fix
      
      * fix
      
      * fix
      
      * oops
      2bca4759
  8. 28 Oct, 2022 1 commit
  9. 19 Oct, 2022 1 commit
  10. 13 Oct, 2022 2 commits
  11. 11 Oct, 2022 1 commit
  12. 21 Sep, 2022 1 commit
  13. 19 Sep, 2022 2 commits
  14. 15 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Import PyTorch's CUDA stream management (#4503) · 9a00cf19
      Xin Yao authored
      * add set_stream
      
      * add .record_stream for NDArray and HeteroGraph
      
      * refactor dgl stream Python APIs
      
      * test record_stream
      
      * add unit test for record stream
      
      * use pytorch's stream
      
      * fix lint
      
      * fix cpu build
      
      * address comments
      
      * address comments
      
      * add record stream tests for dgl.graph
      
      * record frames and update dataloder
      
      * add docstring
      
      * update frame
      
      * add backend check for record_stream
      
      * remove CUDAThreadEntry::stream
      
      * record stream for newly created formats
      
      * fix bug
      
      * fix cpp test
      
      * fix None c_void_p to c_handle
      9a00cf19
  15. 06 Sep, 2022 1 commit
    • Chang Liu's avatar
      [Feature] Unify the cuda stream used in core library (#4480) · 1c9d2a03
      Chang Liu authored
      
      
      * Use an internal cuda stream for CopyDataFromTo
      
      * small fix white space
      
      * Fix to compile
      
      * Make stream optional in copydata for compile
      
      * fix lint issue
      
      * Update cub functions to use internal stream
      
      * Lint check
      
      * Update CopyTo/CopyFrom/CopyFromTo to use internal stream
      
      * Address comments
      
      * Fix backward CUDA stream
      
      * Avoid overloading CopyFromTo()
      
      * Minor comment update
      
      * Overload copydatafromto in cuda device api
      Co-authored-by: default avatarxiny <xiny@nvidia.com>
      1c9d2a03
  16. 05 Sep, 2022 2 commits
  17. 31 Aug, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Make TensorAdapter Stream Aware (#4472) · 2b766740
      Xin Yao authored
      * Allocate tensors in DGL's current stream
      
      * make tensoradaptor stream-aware
      
      * replace TAemtpy with cpu allocator
      
      * fix typo
      
      * try fix cpu allocation
      
      * clean header
      
      * redirect AllocDataSpace as well
      
      * resolve comments
      2b766740
  18. 23 Aug, 2022 1 commit
  19. 18 Aug, 2022 1 commit
  20. 15 Aug, 2022 1 commit
  21. 12 Aug, 2022 1 commit
  22. 09 Aug, 2022 1 commit
  23. 01 Aug, 2022 1 commit
  24. 29 Jul, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Add CUDA Weighted Neighborhood Sampling (#4064) · 86c81b4e
      Xin Yao authored
      
      
      * add weighted sampling without replacement (A-Chao)
      
      * improve Algorithm A-Chao with block-wise prefix sum
      
      * correctly fill out_idxs
      
      * implement weighted sampling with replacement
      
      * small fix
      
      * merge host-side code of weighted/uniform sampling
      
      * enable unit tests for cuda weighted sampling
      
      * move thrust/cub wrapper to the cmake file
      
      * update docs accordingly
      
      * fix linting
      
      * fix linting
      
      * fix unit test
      
      * Bump external CUB/Thrust versions
      
      * Fix code style and update description of algorithm design
      
      * [Feature] GPU support weighted graph neighbor sampling
      commit by pengqirong(OPPO)
      
      * merge pengqirong's implementation
      
      * revert the change to cub and thrust
      
      * fix linting
      
      * use DeviceSegmentedSort for better performance
      
      * add more comments
      
      * add necessary notes
      
      * add necessary notes
      
      * resolve some comments
      
      * define THRUST_CUB_WRAPPED_NAMESPACE
      
      * fix doc
      Co-authored-by: default avatar彭齐荣 <657017034@qq.com>
      86c81b4e
  25. 27 Jul, 2022 1 commit
  26. 26 Jul, 2022 1 commit
  27. 15 Jul, 2022 1 commit
  28. 09 Jul, 2022 1 commit