1. 06 Dec, 2022 1 commit
    • Chang Liu's avatar
      Add support for next cusparse release (#4974) · fb223d47
      Chang Liu authored
      * Add support for next cusparse release
      
      * Fix lint
      
      * Add switch and tune the performance
      
      * Fix lint issue
      
      * Fine tune the heuristics
      
      * Fix lint issue
      
      * Address comments
      
      * Minor fix
      
      * Address comments
      fb223d47
  2. 01 Dec, 2022 1 commit
  3. 24 Nov, 2022 1 commit
  4. 22 Nov, 2022 2 commits
    • Ping Gong's avatar
      [Performance] Leverage hashmap to accelerate CSRSliceMatrix<kDGLCUDA, IdType> (#4924) · aa419895
      Ping Gong authored
      
      
      * Leverage hashmap to accelerate CSRSliceMatrix
      
      * fix lint check
      
      * use `min` in cuda_runtime.ch
      
      * fix hash func
      
      * add some comments and adjust the <grid,block> of the _SegmentMaskColKernel kernel
      
      * set device and stream for thrust::for_each
      
      * use thrust::cuda::par_nosync
      Co-authored-by: default avatarXin Yao <xiny@nvidia.com>
      aa419895
    • Muhammed Fatih BALIN's avatar
      [Feature] (La)yer-Neigh(bor) sampling implementation (#4668) · bf264d00
      Muhammed Fatih BALIN authored
      
      
      * adding LABOR sampling
      
      * add ladies and pladies samplers
      
      * fix compile error after rebase
      
      * add reference for ladies sampler
      
      * Improve ladies implementation.
      
      * weighted labor sampling initial implementation draft
      fix indentation and small bug in ladies script
      
      * importance_sampling currently doesn't work with weights
      
      * fix weighted importance sampling
      
      * move labor example into its own folder
      
      * lint fixes
      
      * Improve documentation
      
      * remove examples from the main PR
      
      * fix linting by not using c++17 features
      
      * fix documentation of labor_sampler.py
      
      * update documentation for labor.py
      
      * reformat the labor.py file with black
      
      * fix linting errors
      
      * replace exception use with if
      
      * fix typo in error comment
      
      * fixing win64 build for ci
      
      * fixing weighted implementation, works now.
      
      * fix bug in the weighted case and importance_sampling==0
      
      * address part of the reviews
      
      * remove unused code paths from cuda
      
      * remove unused code path from cpu side
      
      * remove extra features of labor making use of random seed.
      
      * fix exclude_edges bug
      
      * remove pcg and seed logic from cpu implementation, seed logic should still work for cuda.
      
      * minor style change
      
      * refactor CPU implementation, take out the importance_sampling probability computation into a function.
      
      * improve CUDAWorkspaceAllocator
      
      * refactor importance_sampling part out to a function
      
      * minor optimization
      
      * fix linting issue
      
      * Revert "remove pcg and seed logic from cpu implementation, seed logic should still work for cuda."
      
      This reverts commit c250e07ac6d7e13f57e79e8a2c2f098d777378c2.
      
      * Revert "remove extra features of labor making use of random seed."
      
      This reverts commit 7f99034353080308f4783f27d9a08bea343fb796.
      
      * fix the documentation
      
      * disable NIDs
      
      * improve the documentation in the code
      
      * use the stream argument in pcg32 instead of skipping ahead t times, can discard the use of hashmap now since it is faster this way.
      
      * fix linting issue
      
      * address another round of reviews
      
      * further optimize CPU LABOR sampling implementation
      
      * fix linting error
      
      * update the comment
      
      * reformat
      
      * rename and rephrase comment
      
      * fix formatting according to new linting specs
      
      * fix compile error due to renaming, fix linting.
      
      * lint
      
      * rename DGLHeteroGraph to DGLGraph to match master
      
      * replace other occurrences of DGLHeteroGraph to DGLGraph
      Co-authored-by: default avatarMuhammed Fatih BALIN <m.f.balin@gmail.com>
      Co-authored-by: default avatarKaan Sancak <kaansnck@gmail.com>
      Co-authored-by: default avatarQuan Gan <coin2028@hotmail.com>
      bf264d00
  5. 15 Nov, 2022 4 commits
  6. 10 Nov, 2022 1 commit
  7. 08 Nov, 2022 2 commits
  8. 07 Nov, 2022 4 commits
  9. 06 Nov, 2022 2 commits
  10. 04 Nov, 2022 4 commits
  11. 03 Nov, 2022 3 commits
  12. 02 Nov, 2022 1 commit
  13. 29 Oct, 2022 1 commit
    • Quan (Andy) Gan's avatar
      [Sampling] Enable sampling with edge masks in sample_etype_neighbors (#4749) · 2bca4759
      Quan (Andy) Gan authored
      * sample neighbors with masks
      
      * oops
      
      * refactor again
      
      * remove
      
      * remove debug code
      
      * rename macro
      
      * address comments
      
      * more stuff
      
      * remove
      
      * fix
      
      * try fix unit test
      
      * oops
      
      * fix test
      
      * oops
      
      * change name
      
      * rename a lot of stuff
      
      * oops
      
      * ugh
      
      * misc fixes
      
      * lint
      
      * address a lot of comments
      
      * lint
      
      * lint
      
      * fix
      
      * that was silly
      
      * fix
      
      * fix
      
      * fix
      
      * oops
      2bca4759
  14. 28 Oct, 2022 1 commit
  15. 19 Oct, 2022 1 commit
  16. 13 Oct, 2022 2 commits
  17. 11 Oct, 2022 1 commit
  18. 21 Sep, 2022 1 commit
  19. 19 Sep, 2022 2 commits
  20. 15 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Import PyTorch's CUDA stream management (#4503) · 9a00cf19
      Xin Yao authored
      * add set_stream
      
      * add .record_stream for NDArray and HeteroGraph
      
      * refactor dgl stream Python APIs
      
      * test record_stream
      
      * add unit test for record stream
      
      * use pytorch's stream
      
      * fix lint
      
      * fix cpu build
      
      * address comments
      
      * address comments
      
      * add record stream tests for dgl.graph
      
      * record frames and update dataloder
      
      * add docstring
      
      * update frame
      
      * add backend check for record_stream
      
      * remove CUDAThreadEntry::stream
      
      * record stream for newly created formats
      
      * fix bug
      
      * fix cpp test
      
      * fix None c_void_p to c_handle
      9a00cf19
  21. 06 Sep, 2022 1 commit
    • Chang Liu's avatar
      [Feature] Unify the cuda stream used in core library (#4480) · 1c9d2a03
      Chang Liu authored
      
      
      * Use an internal cuda stream for CopyDataFromTo
      
      * small fix white space
      
      * Fix to compile
      
      * Make stream optional in copydata for compile
      
      * fix lint issue
      
      * Update cub functions to use internal stream
      
      * Lint check
      
      * Update CopyTo/CopyFrom/CopyFromTo to use internal stream
      
      * Address comments
      
      * Fix backward CUDA stream
      
      * Avoid overloading CopyFromTo()
      
      * Minor comment update
      
      * Overload copydatafromto in cuda device api
      Co-authored-by: default avatarxiny <xiny@nvidia.com>
      1c9d2a03
  22. 05 Sep, 2022 2 commits
  23. 31 Aug, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Make TensorAdapter Stream Aware (#4472) · 2b766740
      Xin Yao authored
      * Allocate tensors in DGL's current stream
      
      * make tensoradaptor stream-aware
      
      * replace TAemtpy with cpu allocator
      
      * fix typo
      
      * try fix cpu allocation
      
      * clean header
      
      * redirect AllocDataSpace as well
      
      * resolve comments
      2b766740