1. 21 Jun, 2021 2 commits
  2. 14 Jun, 2021 2 commits
  3. 10 Jun, 2021 1 commit
  4. 03 Jun, 2021 1 commit
  5. 01 Jun, 2021 1 commit
    • Qidong Su's avatar
      [Feature][Sampler] Sort CSR by tag (#1664) · b8fe2b48
      Qidong Su authored
      
      
      * update
      
      * update
      
      * update
      
      * update
      
      * lint
      
      * lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * lint
      
      * update
      
      * clone
      
      * update
      
      * update
      
      * update
      
      * update
      
      * replace idarray with ndarray
      
      * refactor cpp part
      
      * refactor python part
      
      * debug
      
      * refactor interface
      
      * test and doc
      
      * lint and test
      
      * lint
      
      * fix
      
      * fix
      
      * fix
      
      * const
      
      * doc
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix & doc
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * update
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      b8fe2b48
  6. 31 May, 2021 1 commit
  7. 20 May, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d
      nv-dlasalle authored
      
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)
      
      * Split NCCL wrapper from sparse optimizer and sparse embedding
      
      * Add more unit tests for single node nccl
      
      * Fix unit test for tf
      
      * Switch to device histogram
      
      * Fix histgram issues
      
      * Finish migration to histogram
      
      * Handle cases with zero send/recieve data
      
      * Start on partition object
      
      * Get compiling
      
      * Updates
      
      * Add unit tests
      
      * Switch to partition object
      
      * Fix linting issues
      
      * Rename partition file
      
      * Add python doc
      
      * Fix python assert and finish doxygen comments
      
      * Remove stubs for range based partition to satisfy pylint
      
      * Wrap unit test in GPU only
      
      * Wrap explicit cuda call in ifdef
      
      * Merge with partition.py
      
      * update docstrings
      
      * Cleanup partition_op
      
      * Add Workspace object
      
      * Switch to using workspace object
      
      * Move last remainder based function out of nccl_api
      
      * Add error messages
      
      * Update docs with examples
      
      * Fix linting erros
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      ae8dbe6d
  8. 18 May, 2021 1 commit
  9. 17 May, 2021 1 commit
  10. 11 May, 2021 1 commit
  11. 27 Apr, 2021 1 commit
  12. 22 Apr, 2021 1 commit
  13. 15 Apr, 2021 1 commit
    • nv-dlasalle's avatar
      [Performance][GPU] Enable GPU uniform edge sampling (#2716) · e70138bb
      nv-dlasalle authored
      
      
      * Start on uniform GPU sampling
      
      * Save more work
      
      * Get cu file compiling
      
      * Update sampling
      
      * More changes
      
      * Get GPU sampling for uniform probabilities solved
      
      * Fix batch tensor migration
      
      * Fix
      
      * update kernels
      
      * expand blocking
      
      * Undo testing change
      
      * Cut down on sampling overhead
      
      * Fix replacement
      
      * Update unit tests
      
      * Add option to gpu sample in graphsage
      
      * Copy only csc to gpu
      
      * Add ogbn support
      
      * Fix linting
      
      * Remove nvtx from sample
      
      * Improve documentation and error checking
      
      * Expand documentation
      
      * Update assert checking
      
      * delete extra space
      
      * Use standard dataloader when dataset is a dictionary
      
      * ogb -> ogbn
      
      * Fix edge selection determinism
      
      * Fix typos
      
      * Remove nvtx
      
      * Add comment for self.fanout_arrays and assert
      
      * Fix linting
      
      * Migrate to scalarbatcher
      
      * Fix indentation
      
      * Fix batcher
      
      * Fix indexing
      
      * Only use databatcher for GPU
      
      * Convert to DGL NDArray to PyTorch Tensor
      
      * Add optimization for PyTorch's F.tensor() for list of GPU tensors
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      e70138bb
  14. 21 Feb, 2021 1 commit
  15. 08 Feb, 2021 1 commit
    • nv-dlasalle's avatar
      [Sampling] Implement `dgl.to_block()` for the GPU (#2339) · bc3a532f
      nv-dlasalle authored
      
      
      * Add start of to_block gpu implementation
      
      * Pull in more changes from 0.4.2 cuda_to_block
      
      * Move more code to IdArray
      
      * Refactor DeviceNodeMapMaker
      
      * Updates
      
      * get compiling
      
      * Integrate to_block
      
      * Fix ID allocation
      
      * Minor fixes
      
      * Cleanup cuda calls to use cuda_common
      
      * Reduce kernel calls
      
      * Lint cleanup
      
      * Expand documentation
      
      * Remove unused function
      
      * Rename variables for consistency
      
      * Add doxygen comments
      
      * Fix file extension
      
      * Remove raw asynccopy for deviceapi
      
      * Remove unused function
      
      * Fix block/tile configuration
      
      * Add cuda_device_common.cuh
      
      * Add basic hashtable
      
      * Migrate part of hashtable
      
      * Refactor to use external hashtable
      
      * Make functions members
      
      * Format hash table functions
      
      * Migrate duplicate filling
      
      * Move last function over
      
      * Refactor with cu file
      
      * lint c++ code
      
      * Move context check to C++ code
      
      * Use macro switch
      
      * Add missing files
      
      * Update docstring
      
      * update docs
      
      * Move atomic functions
      
      * Refactor hashtable
      
      * Fix linting
      
      * Expand docs
      
      * Fix mismatched argument names
      
      * Switch doxygen comments from using @param to \param
      Co-authored-by: default avatarJinjing Zhou <VoVAllen@users.noreply.github.com>
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      bc3a532f
  16. 05 Feb, 2021 1 commit
  17. 03 Feb, 2021 1 commit
  18. 25 Jan, 2021 1 commit
  19. 24 Jan, 2021 1 commit
  20. 17 Jan, 2021 1 commit
  21. 14 Jan, 2021 1 commit
  22. 06 Jan, 2021 1 commit
  23. 05 Jan, 2021 1 commit
  24. 04 Jan, 2021 2 commits
  25. 26 Dec, 2020 1 commit
  26. 22 Nov, 2020 1 commit
  27. 07 Nov, 2020 1 commit
  28. 05 Nov, 2020 1 commit
    • Xiangkun Hu's avatar
      [Bugfix] GINDataset check whether labels are all valid (#2319) · 5f198763
      Xiangkun Hu authored
      * PPIDataset
      
      * Revert "PPIDataset"
      
      This reverts commit 264bd0c960cfa698a7bb946dad132bf52c2d0c8a.
      
      * check whether the original labels are valid
      
      * add unit test for gin dataset
      
      * bug of asscalar in mxnet<=1.6
      
      * mxnet<=1.6 asscalar requires ndarray to be shaped (1,)
      
      * skip gpu while testing datasets
      
      * Update test_data.py
      
      * test of gin dataset takes too much time
      5f198763
  29. 30 Oct, 2020 1 commit
    • nv-dlasalle's avatar
      [Dataloading] Add class for copying tensors to/from the GPU on a non-default stream (#2284) · f673fc25
      nv-dlasalle authored
      * Add async transferer class
      
      * Add async ndarray copy interface
      
      * Add python bindings
      
      * Fix comment
      
      * Add python class
      
      * Fix linting issues
      
      * Add python unit test
      
      * Update python interface
      
      * move async_transferer to cuda only directory
      
      * Fix linting issue
      
      * Move out of contrib
      
      * Add doc strings
      
      * Move test compute from backend
      
      * Update comment
      
      * Fix test naming
      
      * Fix argument usage
      
      * Wrap/unwrap backend parameters
      
      * Move to dataloading
      
      * Move to 'dataloading'
      
      * Make GPU/CPU compatible
      
      * Fix unit tests
      
      * Add docs
      
      * Use only backend interface for datamovement in unit test
      f673fc25
  30. 17 Oct, 2020 1 commit
  31. 29 Sep, 2020 1 commit
  32. 24 Sep, 2020 1 commit
  33. 11 Sep, 2020 2 commits
  34. 10 Sep, 2020 2 commits
  35. 08 Sep, 2020 1 commit