1. 24 Sep, 2024 2 commits
  2. 13 Sep, 2024 1 commit
  3. 29 Feb, 2024 1 commit
  4. 14 Jul, 2023 1 commit
  5. 08 Mar, 2023 1 commit
    • Xin Yao's avatar
      [Refactor] Replace third_party/nccl with PyTorch's NCCL backend (#4989) · 8d5d8962
      Xin Yao authored
      * expose GeneratePermutation
      
      * add sparse_all_to_all_push
      
      * add sparse_all_to_all_pull
      
      * add unit test
      
      * handle world_size=1
      
      * remove python nccl wrapper
      
      * remove the nccl dependency
      
      * use pinned memory to speedup D2H copy
      
      * fix lint
      
      * resolve comments
      
      * fix lint
      
      * fix ut
      
      * resolve comments
      8d5d8962
  6. 16 Feb, 2023 1 commit
  7. 07 Nov, 2022 1 commit
  8. 06 Nov, 2022 1 commit
  9. 04 Nov, 2022 1 commit
  10. 19 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Bump DLPack to v0.7 and decouple DLPack from the core library (#4454) · cded5b80
      Xin Yao authored
      * rename `DLContext` to `DGLContext`
      
      * rename `kDLGPU` to `kDLCUDA`
      
      * replace DLTensor with DGLArray
      
      * fix linting
      
      * Unify DGLType and DLDataType to DGLDataType
      
      * Fix FFI
      
      * rename DLDeviceType to DGLDeviceType
      
      * decouple dlpack from the core library
      
      * fix bug
      
      * fix lint
      
      * fix merge
      
      * fix build
      
      * address comments
      
      * rename dl_converter to dlpack_convert
      
      * remove redundant comments
      cded5b80
  11. 15 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Import PyTorch's CUDA stream management (#4503) · 9a00cf19
      Xin Yao authored
      * add set_stream
      
      * add .record_stream for NDArray and HeteroGraph
      
      * refactor dgl stream Python APIs
      
      * test record_stream
      
      * add unit test for record stream
      
      * use pytorch's stream
      
      * fix lint
      
      * fix cpu build
      
      * address comments
      
      * address comments
      
      * add record stream tests for dgl.graph
      
      * record frames and update dataloder
      
      * add docstring
      
      * update frame
      
      * add backend check for record_stream
      
      * remove CUDAThreadEntry::stream
      
      * record stream for newly created formats
      
      * fix bug
      
      * fix cpp test
      
      * fix None c_void_p to c_handle
      9a00cf19
  12. 23 Jun, 2022 1 commit
    • Triston's avatar
      [Fix] Fix compiler warnings - part 1 (#4051) · 1ad65879
      Triston authored
      
      
      * Fix a cub compile error for CUDA 11.5
      
      * Fix comparison of integer expressions of different signedness in coo_sort.cu file
      
      * Fix comparison of integer expressions of different signedness in cuda_compact_graph.cu file
      
      * Remove never referenced variable in spmm.cu
      
      * Fix comparison of integer expressions of different signedness in rowwise_pick.h file
      
      * Fix comparison of integer expressions of different signedness in choice.cc file
      
      * Remove never referenced variable col_data in spat_op_impl_coo.cc
      
      * Remove never referenced variable allowed in global_uniform.cc
      
      * Fix comparison of integer expressions of different signedness in graph.cc
      
      * Fix comparison of integer expressions of different signedness in graph_apis.cc
      
      * Fix the un-used ctx variable in ndarray_partition.cc file for cpu only build
      
      * Fix comparison of integer expressions of different signedness in libra_partition.cc
      
      * Fix comparison of integer expressions of different signedness in graph_op.cc
      Co-authored-by: default avatarTriston Cao <tristonc@nvidia.com>
      Co-authored-by: default avatarQuan (Andy) Gan <coin2028@hotmail.com>
      1ad65879
  13. 20 Aug, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature][DistDGL] Add NCCL support for range based partitions (#3213) · 7f927939
      nv-dlasalle authored
      * Implement range based NDArrayPartition
      
      * Finish implement range based partition support
      
      * Add unit test
      
      * Fix whitepace
      
      * Add Kernel suffix
      
      * Fix argument passing
      
      * Add doxygen docs and improve variable naming
      
      * Add unit test
      
      * Add function for converting a partition book
      
      * Add example to partition_op docs
      
      * Fix dtype conversion for mxnet and tensorflow
      7f927939
  14. 11 Jun, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature] Allow using NCCL for communication in dgl.NodeEmbedding and dgl.SparseOptimizer (#2824) · 17d604b5
      nv-dlasalle authored
      
      
      * Split from NCCL PR
      
      * Fix type in comment
      
      * Expand documentation for sparse_all_to_all_push
      
      * Restore previous behavior in example
      
      * Re-work optimizer to use NCCL based on gradient location
      
      * Allow for running with embedding on CPU but using NCCL for gradient exchange
      
      * Optimize single partition case
      
      * Fix pylint errors
      
      * Add missing include
      
      * fix gradient indexing
      
      * Fix line continuation
      
      * Migrate 'first_step'
      
      * Skip tests without enough GPUs to run NCCL
      
      * Improve empty tensor handling for pytorch 1.5
      
      * Fix indentation
      
      * Allow multiple NCCL communicator to coexist
      
      * Improve handling of empty message
      
      * Update python/dgl/nn/pytorch/sparse_emb.py
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      
      * Update python/dgl/nn/pytorch/sparse_emb.py
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      
      * Keepy empty tensor dimensionaless
      
      * th.empty -> th.tensor
      
      * Preserve shape for empty non-zero dimension tensors
      
      * Use shared state, when embedding is shared
      
      * Add support for gathering an embedding
      
      * Fix typo
      
      * Fix more typos
      
      * Fix backend call
      
      * Use NodeDataLoader to take advantage of ddp
      
      * Update training script to share memory
      
      * Only squeeze last dimension
      
      * Better handle empty message
      
      * Keep embedding on the target device GPU if dgl_sparse if false in RGCN example
      
      * Fix typo in comment
      
      * Add asserts
      
      * Improve documentation in example
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      17d604b5
  15. 20 May, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d
      nv-dlasalle authored
      
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)
      
      * Split NCCL wrapper from sparse optimizer and sparse embedding
      
      * Add more unit tests for single node nccl
      
      * Fix unit test for tf
      
      * Switch to device histogram
      
      * Fix histgram issues
      
      * Finish migration to histogram
      
      * Handle cases with zero send/recieve data
      
      * Start on partition object
      
      * Get compiling
      
      * Updates
      
      * Add unit tests
      
      * Switch to partition object
      
      * Fix linting issues
      
      * Rename partition file
      
      * Add python doc
      
      * Fix python assert and finish doxygen comments
      
      * Remove stubs for range based partition to satisfy pylint
      
      * Wrap unit test in GPU only
      
      * Wrap explicit cuda call in ifdef
      
      * Merge with partition.py
      
      * update docstrings
      
      * Cleanup partition_op
      
      * Add Workspace object
      
      * Switch to using workspace object
      
      * Move last remainder based function out of nccl_api
      
      * Add error messages
      
      * Update docs with examples
      
      * Fix linting erros
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      ae8dbe6d