1. 13 Feb, 2023 1 commit
    • Quan (Andy) Gan's avatar
      enable sparse on windows and mac (#5277) · f62669b0
      Quan (Andy) Gan authored
      
      
      * enable sparse on windows and mac
      
      * that was stupid
      
      * let's see what's going on..
      
      * [Sparse] Fix the import error on Mac OS.
      
      When using template functions that are defined in source files from DGL,
      the loader of MacOS somehow cannot find their definitions. This fix simply
      avoids depending on template functions from DGL headers.
      
      With this fix, the sparse tests all pass on the MAC environment.
      
      * ok this is the problem
      
      * make errors clearer
      
      * uh
      
      * test
      
      * Update __init__.py
      
      * disabling ddp on windows
      
      ---------
      Co-authored-by: default avatarczkkkkkk <zekucai@gmail.com>
      f62669b0
  2. 09 Feb, 2023 1 commit
  3. 12 Jan, 2023 1 commit
  4. 06 Jan, 2023 1 commit
  5. 15 Dec, 2022 1 commit
  6. 12 Dec, 2022 2 commits
  7. 09 Dec, 2022 1 commit
  8. 06 Dec, 2022 1 commit
    • Chang Liu's avatar
      Add support for next cusparse release (#4974) · fb223d47
      Chang Liu authored
      * Add support for next cusparse release
      
      * Fix lint
      
      * Add switch and tune the performance
      
      * Fix lint issue
      
      * Fine tune the heuristics
      
      * Fix lint issue
      
      * Address comments
      
      * Minor fix
      
      * Address comments
      fb223d47
  9. 01 Dec, 2022 1 commit
  10. 24 Nov, 2022 1 commit
  11. 22 Nov, 2022 2 commits
    • Ping Gong's avatar
      [Performance] Leverage hashmap to accelerate CSRSliceMatrix<kDGLCUDA, IdType> (#4924) · aa419895
      Ping Gong authored
      
      
      * Leverage hashmap to accelerate CSRSliceMatrix
      
      * fix lint check
      
      * use `min` in cuda_runtime.ch
      
      * fix hash func
      
      * add some comments and adjust the <grid,block> of the _SegmentMaskColKernel kernel
      
      * set device and stream for thrust::for_each
      
      * use thrust::cuda::par_nosync
      Co-authored-by: default avatarXin Yao <xiny@nvidia.com>
      aa419895
    • Muhammed Fatih BALIN's avatar
      [Feature] (La)yer-Neigh(bor) sampling implementation (#4668) · bf264d00
      Muhammed Fatih BALIN authored
      
      
      * adding LABOR sampling
      
      * add ladies and pladies samplers
      
      * fix compile error after rebase
      
      * add reference for ladies sampler
      
      * Improve ladies implementation.
      
      * weighted labor sampling initial implementation draft
      fix indentation and small bug in ladies script
      
      * importance_sampling currently doesn't work with weights
      
      * fix weighted importance sampling
      
      * move labor example into its own folder
      
      * lint fixes
      
      * Improve documentation
      
      * remove examples from the main PR
      
      * fix linting by not using c++17 features
      
      * fix documentation of labor_sampler.py
      
      * update documentation for labor.py
      
      * reformat the labor.py file with black
      
      * fix linting errors
      
      * replace exception use with if
      
      * fix typo in error comment
      
      * fixing win64 build for ci
      
      * fixing weighted implementation, works now.
      
      * fix bug in the weighted case and importance_sampling==0
      
      * address part of the reviews
      
      * remove unused code paths from cuda
      
      * remove unused code path from cpu side
      
      * remove extra features of labor making use of random seed.
      
      * fix exclude_edges bug
      
      * remove pcg and seed logic from cpu implementation, seed logic should still work for cuda.
      
      * minor style change
      
      * refactor CPU implementation, take out the importance_sampling probability computation into a function.
      
      * improve CUDAWorkspaceAllocator
      
      * refactor importance_sampling part out to a function
      
      * minor optimization
      
      * fix linting issue
      
      * Revert "remove pcg and seed logic from cpu implementation, seed logic should still work for cuda."
      
      This reverts commit c250e07ac6d7e13f57e79e8a2c2f098d777378c2.
      
      * Revert "remove extra features of labor making use of random seed."
      
      This reverts commit 7f99034353080308f4783f27d9a08bea343fb796.
      
      * fix the documentation
      
      * disable NIDs
      
      * improve the documentation in the code
      
      * use the stream argument in pcg32 instead of skipping ahead t times, can discard the use of hashmap now since it is faster this way.
      
      * fix linting issue
      
      * address another round of reviews
      
      * further optimize CPU LABOR sampling implementation
      
      * fix linting error
      
      * update the comment
      
      * reformat
      
      * rename and rephrase comment
      
      * fix formatting according to new linting specs
      
      * fix compile error due to renaming, fix linting.
      
      * lint
      
      * rename DGLHeteroGraph to DGLGraph to match master
      
      * replace other occurrences of DGLHeteroGraph to DGLGraph
      Co-authored-by: default avatarMuhammed Fatih BALIN <m.f.balin@gmail.com>
      Co-authored-by: default avatarKaan Sancak <kaansnck@gmail.com>
      Co-authored-by: default avatarQuan Gan <coin2028@hotmail.com>
      bf264d00
  12. 15 Nov, 2022 4 commits
  13. 10 Nov, 2022 1 commit
  14. 08 Nov, 2022 2 commits
  15. 07 Nov, 2022 4 commits
  16. 06 Nov, 2022 2 commits
  17. 03 Nov, 2022 2 commits
  18. 02 Nov, 2022 1 commit
  19. 29 Oct, 2022 1 commit
    • Quan (Andy) Gan's avatar
      [Sampling] Enable sampling with edge masks in sample_etype_neighbors (#4749) · 2bca4759
      Quan (Andy) Gan authored
      * sample neighbors with masks
      
      * oops
      
      * refactor again
      
      * remove
      
      * remove debug code
      
      * rename macro
      
      * address comments
      
      * more stuff
      
      * remove
      
      * fix
      
      * try fix unit test
      
      * oops
      
      * fix test
      
      * oops
      
      * change name
      
      * rename a lot of stuff
      
      * oops
      
      * ugh
      
      * misc fixes
      
      * lint
      
      * address a lot of comments
      
      * lint
      
      * lint
      
      * fix
      
      * that was silly
      
      * fix
      
      * fix
      
      * fix
      
      * oops
      2bca4759
  20. 28 Oct, 2022 1 commit
  21. 13 Oct, 2022 2 commits
  22. 11 Oct, 2022 1 commit
  23. 21 Sep, 2022 1 commit
  24. 19 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Bump DLPack to v0.7 and decouple DLPack from the core library (#4454) · cded5b80
      Xin Yao authored
      * rename `DLContext` to `DGLContext`
      
      * rename `kDLGPU` to `kDLCUDA`
      
      * replace DLTensor with DGLArray
      
      * fix linting
      
      * Unify DGLType and DLDataType to DGLDataType
      
      * Fix FFI
      
      * rename DLDeviceType to DGLDeviceType
      
      * decouple dlpack from the core library
      
      * fix bug
      
      * fix lint
      
      * fix merge
      
      * fix build
      
      * address comments
      
      * rename dl_converter to dlpack_convert
      
      * remove redundant comments
      cded5b80
  25. 15 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Import PyTorch's CUDA stream management (#4503) · 9a00cf19
      Xin Yao authored
      * add set_stream
      
      * add .record_stream for NDArray and HeteroGraph
      
      * refactor dgl stream Python APIs
      
      * test record_stream
      
      * add unit test for record stream
      
      * use pytorch's stream
      
      * fix lint
      
      * fix cpu build
      
      * address comments
      
      * address comments
      
      * add record stream tests for dgl.graph
      
      * record frames and update dataloder
      
      * add docstring
      
      * update frame
      
      * add backend check for record_stream
      
      * remove CUDAThreadEntry::stream
      
      * record stream for newly created formats
      
      * fix bug
      
      * fix cpp test
      
      * fix None c_void_p to c_handle
      9a00cf19
  26. 06 Sep, 2022 1 commit
    • Chang Liu's avatar
      [Feature] Unify the cuda stream used in core library (#4480) · 1c9d2a03
      Chang Liu authored
      
      
      * Use an internal cuda stream for CopyDataFromTo
      
      * small fix white space
      
      * Fix to compile
      
      * Make stream optional in copydata for compile
      
      * fix lint issue
      
      * Update cub functions to use internal stream
      
      * Lint check
      
      * Update CopyTo/CopyFrom/CopyFromTo to use internal stream
      
      * Address comments
      
      * Fix backward CUDA stream
      
      * Avoid overloading CopyFromTo()
      
      * Minor comment update
      
      * Overload copydatafromto in cuda device api
      Co-authored-by: default avatarxiny <xiny@nvidia.com>
      1c9d2a03
  27. 05 Sep, 2022 1 commit
  28. 12 Aug, 2022 1 commit