1. 23 Sep, 2024 1 commit
  2. 05 Dec, 2023 1 commit
  3. 17 Aug, 2023 1 commit
  4. 11 Aug, 2023 1 commit
  5. 10 May, 2023 1 commit
  6. 06 Apr, 2023 1 commit
  7. 15 Mar, 2023 1 commit
  8. 21 Feb, 2023 1 commit
  9. 09 Feb, 2023 1 commit
  10. 07 Nov, 2022 2 commits
  11. 06 Nov, 2022 1 commit
  12. 04 Nov, 2022 1 commit
  13. 29 Oct, 2022 1 commit
    • Quan (Andy) Gan's avatar
      [Sampling] Enable sampling with edge masks in sample_etype_neighbors (#4749) · 2bca4759
      Quan (Andy) Gan authored
      * sample neighbors with masks
      
      * oops
      
      * refactor again
      
      * remove
      
      * remove debug code
      
      * rename macro
      
      * address comments
      
      * more stuff
      
      * remove
      
      * fix
      
      * try fix unit test
      
      * oops
      
      * fix test
      
      * oops
      
      * change name
      
      * rename a lot of stuff
      
      * oops
      
      * ugh
      
      * misc fixes
      
      * lint
      
      * address a lot of comments
      
      * lint
      
      * lint
      
      * fix
      
      * that was silly
      
      * fix
      
      * fix
      
      * fix
      
      * oops
      2bca4759
  14. 19 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Bump DLPack to v0.7 and decouple DLPack from the core library (#4454) · cded5b80
      Xin Yao authored
      * rename `DLContext` to `DGLContext`
      
      * rename `kDLGPU` to `kDLCUDA`
      
      * replace DLTensor with DGLArray
      
      * fix linting
      
      * Unify DGLType and DLDataType to DGLDataType
      
      * Fix FFI
      
      * rename DLDeviceType to DGLDeviceType
      
      * decouple dlpack from the core library
      
      * fix bug
      
      * fix lint
      
      * fix merge
      
      * fix build
      
      * address comments
      
      * rename dl_converter to dlpack_convert
      
      * remove redundant comments
      cded5b80
  15. 15 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Import PyTorch's CUDA stream management (#4503) · 9a00cf19
      Xin Yao authored
      * add set_stream
      
      * add .record_stream for NDArray and HeteroGraph
      
      * refactor dgl stream Python APIs
      
      * test record_stream
      
      * add unit test for record stream
      
      * use pytorch's stream
      
      * fix lint
      
      * fix cpu build
      
      * address comments
      
      * address comments
      
      * add record stream tests for dgl.graph
      
      * record frames and update dataloder
      
      * add docstring
      
      * update frame
      
      * add backend check for record_stream
      
      * remove CUDAThreadEntry::stream
      
      * record stream for newly created formats
      
      * fix bug
      
      * fix cpp test
      
      * fix None c_void_p to c_handle
      9a00cf19
  16. 06 Sep, 2022 1 commit
    • Chang Liu's avatar
      [Feature] Unify the cuda stream used in core library (#4480) · 1c9d2a03
      Chang Liu authored
      
      
      * Use an internal cuda stream for CopyDataFromTo
      
      * small fix white space
      
      * Fix to compile
      
      * Make stream optional in copydata for compile
      
      * fix lint issue
      
      * Update cub functions to use internal stream
      
      * Lint check
      
      * Update CopyTo/CopyFrom/CopyFromTo to use internal stream
      
      * Address comments
      
      * Fix backward CUDA stream
      
      * Avoid overloading CopyFromTo()
      
      * Minor comment update
      
      * Overload copydatafromto in cuda device api
      Co-authored-by: default avatarxiny <xiny@nvidia.com>
      1c9d2a03
  17. 27 Jul, 2022 1 commit
  18. 08 Jun, 2022 1 commit
  19. 11 May, 2022 1 commit
  20. 27 Apr, 2022 1 commit
    • Rhett Ying's avatar
      [Feature] enable socket net_type for rpc (#3951) · 37be02a4
      Rhett Ying authored
      * [Feature] enable socket net_type for rpc
      
      * fix lint
      
      * fix lint
      
      * fix build issue on windows
      
      * fix test failure on windows
      
      * fix test failure
      
      * fix cpp unit test failure
      
      * net_type blocking max_try_times
      
      * fix other comments
      
      * fix lint
      
      * fix comment
      
      * fix lint
      
      * fix cpp
      37be02a4
  21. 06 Dec, 2021 1 commit
  22. 10 Nov, 2021 1 commit
  23. 06 Nov, 2021 1 commit
  24. 04 Nov, 2021 1 commit
  25. 28 Sep, 2021 1 commit
  26. 17 Sep, 2021 1 commit
  27. 14 Sep, 2021 1 commit
  28. 01 Sep, 2021 2 commits
  29. 20 Aug, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature][DistDGL] Add NCCL support for range based partitions (#3213) · 7f927939
      nv-dlasalle authored
      * Implement range based NDArrayPartition
      
      * Finish implement range based partition support
      
      * Add unit test
      
      * Fix whitepace
      
      * Add Kernel suffix
      
      * Fix argument passing
      
      * Add doxygen docs and improve variable naming
      
      * Add unit test
      
      * Add function for converting a partition book
      
      * Add example to partition_op docs
      
      * Fix dtype conversion for mxnet and tensorflow
      7f927939
  30. 28 Jul, 2021 1 commit
    • xiang song(charlie.song)'s avatar
      [New Feature] Per edge type sampler for to_homogeneous graphs. (#3131) · ba7e7cf9
      xiang song(charlie.song) authored
      
      
      * fix.
      
      * fix.
      
      * fix.
      
      * fix.
      
      * Fix test
      
      * Deprecate old DistEmbedding impl, use synchronized embedding impl
      
      * Basic imple of heterogeneous on homogenenous sampling
      
      * make pass
      
      * Pass C++ test
      
      * Add python test code
      
      * lint
      
      * lint
      
      * Add MultiLayerEtypeNeighborSampler
      
      * Add unitest for single machine dataloader
      
      * Add dist dataloader test for edge type sampler
      
      * Fix lint
      
      * fix
      
      * support for per etype sample
      
      * Fix some bug and enable distributed training with per edge sample
      
      * fix
      
      * Now distributed training works
      
      * turn off some mxnet
      
      * turn off mxnet for some dist test
      
      * fix
      
      * upd
      
      * upd according to the comments
      
      * Fix
      
      * Fix test and now distributed works.
      
      * upd
      
      * upd
      
      * Fix
      
      * Fix bug
      
      * remove dead code.
      
      * upd
      
      * Fix
      
      * upd
      
      * Fix
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-71-112.ec2.internal>
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-2-66.ec2.internal>
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      ba7e7cf9
  31. 23 Jun, 2021 1 commit
    • Qidong Su's avatar
      [Feature] Biased Neighbor Sampling (#2987) · e56bbafd
      Qidong Su authored
      
      
      * update
      
      * update
      
      * update
      
      * update
      
      * lint
      
      * lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * lint
      
      * update
      
      * clone
      
      * update
      
      * update
      
      * update
      
      * update
      
      * replace idarray with ndarray
      
      * refactor cpp part
      
      * refactor python part
      
      * debug
      
      * refactor interface
      
      * test and doc
      
      * lint and test
      
      * lint
      
      * fix
      
      * fix
      
      * fix
      
      * const
      
      * doc
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix & doc
      
      * fix
      
      * fix
      
      * update
      
      * update
      
      * update
      
      * merge
      
      * doc
      
      * doc
      
      * lint
      
      * fix
      
      * more tests
      
      * doc
      
      * fix
      
      * fix
      
      * update
      
      * update
      
      * update
      
      * fix
      
      * fix
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      e56bbafd
  32. 11 Jun, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature] Allow using NCCL for communication in dgl.NodeEmbedding and dgl.SparseOptimizer (#2824) · 17d604b5
      nv-dlasalle authored
      
      
      * Split from NCCL PR
      
      * Fix type in comment
      
      * Expand documentation for sparse_all_to_all_push
      
      * Restore previous behavior in example
      
      * Re-work optimizer to use NCCL based on gradient location
      
      * Allow for running with embedding on CPU but using NCCL for gradient exchange
      
      * Optimize single partition case
      
      * Fix pylint errors
      
      * Add missing include
      
      * fix gradient indexing
      
      * Fix line continuation
      
      * Migrate 'first_step'
      
      * Skip tests without enough GPUs to run NCCL
      
      * Improve empty tensor handling for pytorch 1.5
      
      * Fix indentation
      
      * Allow multiple NCCL communicator to coexist
      
      * Improve handling of empty message
      
      * Update python/dgl/nn/pytorch/sparse_emb.py
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      
      * Update python/dgl/nn/pytorch/sparse_emb.py
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      
      * Keepy empty tensor dimensionaless
      
      * th.empty -> th.tensor
      
      * Preserve shape for empty non-zero dimension tensors
      
      * Use shared state, when embedding is shared
      
      * Add support for gathering an embedding
      
      * Fix typo
      
      * Fix more typos
      
      * Fix backend call
      
      * Use NodeDataLoader to take advantage of ddp
      
      * Update training script to share memory
      
      * Only squeeze last dimension
      
      * Better handle empty message
      
      * Keep embedding on the target device GPU if dgl_sparse if false in RGCN example
      
      * Fix typo in comment
      
      * Add asserts
      
      * Improve documentation in example
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      17d604b5
  33. 10 Jun, 2021 1 commit
  34. 20 May, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d
      nv-dlasalle authored
      
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)
      
      * Split NCCL wrapper from sparse optimizer and sparse embedding
      
      * Add more unit tests for single node nccl
      
      * Fix unit test for tf
      
      * Switch to device histogram
      
      * Fix histgram issues
      
      * Finish migration to histogram
      
      * Handle cases with zero send/recieve data
      
      * Start on partition object
      
      * Get compiling
      
      * Updates
      
      * Add unit tests
      
      * Switch to partition object
      
      * Fix linting issues
      
      * Rename partition file
      
      * Add python doc
      
      * Fix python assert and finish doxygen comments
      
      * Remove stubs for range based partition to satisfy pylint
      
      * Wrap unit test in GPU only
      
      * Wrap explicit cuda call in ifdef
      
      * Merge with partition.py
      
      * update docstrings
      
      * Cleanup partition_op
      
      * Add Workspace object
      
      * Switch to using workspace object
      
      * Move last remainder based function out of nccl_api
      
      * Add error messages
      
      * Update docs with examples
      
      * Fix linting erros
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      ae8dbe6d
  35. 27 Apr, 2021 1 commit
  36. 22 Apr, 2021 1 commit
  37. 24 Mar, 2021 1 commit
    • Quan (Andy) Gan's avatar
      [Feature] Sparse-sparse matrix multiplication, addition, and masking (#2753) · 929d8634
      Quan (Andy) Gan authored
      * test
      
      * more stuff
      
      * add test
      
      * fixes
      
      * optimize algo
      
      * replace unordered_map with arrays
      
      * lint
      
      * lint x2
      
      * oops
      
      * disable gpu csrmm tests
      
      * remove gpu invocation
      
      * optimize with openmp
      
      * remove python functions
      
      * add back with docstrings
      
      * lint
      
      * lint
      
      * update python interface
      
      * functionize
      
      * functionize
      
      * lint
      
      * lint
      929d8634
  38. 27 Jan, 2021 1 commit
    • nv-dlasalle's avatar
      [Performance] Improve COO to CSR, and sort columns of CSR only when necessary. (#2391) · 2576647c
      nv-dlasalle authored
      * Remove double-checking sorted
      
      * Remove sorting of CSR by default
      
      * Update unit test to use unsorted matix
      
      * delete whitespace
      
      * Expand unit tests
      
      * Replace cusparse sort
      
      * Fix row column sorting
      
      * Explicitly don't sort columns
      
      * Fix linting errors
      
      * Fix bit-width calculation
      
      * Fix sorting assertion and unit test
      
      * Fix linting
      
      * Improve CPU COO2CSR
      
      * Remove references
      
      * Rename and add documentation to edge encoding/decoding funcionts
      
      * Fix sorting keys as 64 bit
      
      * Revert cosmetic changes to unit tests
      
      * Update documentation
      
      * Update complexity documentation for coo to csr conversion
      
      * Remove COOIsSorted check in CPU implementation too
      2576647c