"...en/git@developer.sourcefind.cn:renzhc/diffusers_dcu.git" did not exist on "3706aa3305b9e72fe0ca0b133f872f477a121fcd"
  1. 16 Dec, 2021 1 commit
  2. 03 Dec, 2021 1 commit
    • Israt Nisa's avatar
      [Feature] Add Min/max reducer in heterogeneous API for unary message functions (#3514) · cb0e1103
      Israt Nisa authored
      
      
      * min/max support for forward CPU heterograph
      
      * Added etype with each argU values
      
      * scatter_add needs fix
      
      * added scatter_add_hetero. Grads dont match for max reducer
      
      * storing ntype in argX
      
      * fixing scatter_add_hetero
      
      * hetero matches with torch's scatter add
      
      * works copy_e forward+cpu
      
      * added backward for copy_rhs
      
      * Computes gradient for all node types in one kernel
      
      * bug fix
      
      * unnitest for max/min on CPU
      
      * renamed scatter_add_hetero to update_grad_minmax_hetero
      
      * lint check and comment out cuda call for max. Code is for CPU only
      
      * lint check
      
      * replace inf with zero
      
      * minor
      
      * lint check
      
      * removed LIBXSMM code from hetro code
      
      * fixing backward operator of UpdateGradMinMaxHetero
      
      * removed backward from update_grad_minmax_hetero
      
      * docstring
      
      * improved docstring and coding style
      
      * Added pass by pointer for output
      
      * typos and pass by references
      
      * Support for copy_rhs
      
      * Added header <string>
      
      * fix bug in copy_u_max
      
      * Added comments and dimension check of all etypes
      
      * skip mxnet check
      
      * pass by pointer output arrays
      
      * updated docstring
      Co-authored-by: default avatarIsrat Nisa <nisisrat@amazon.com>
      Co-authored-by: default avatarQuan (Andy) Gan <coin2028@hotmail.com>
      cb0e1103
  3. 30 Nov, 2021 1 commit
  4. 17 Nov, 2021 1 commit
  5. 06 Nov, 2021 1 commit
  6. 04 Nov, 2021 1 commit
  7. 03 Nov, 2021 1 commit
  8. 15 Oct, 2021 1 commit
  9. 07 Sep, 2021 1 commit
  10. 06 Sep, 2021 1 commit
  11. 24 Aug, 2021 1 commit
  12. 19 Aug, 2021 1 commit
  13. 18 Aug, 2021 1 commit
  14. 17 Aug, 2021 1 commit
  15. 02 Aug, 2021 1 commit
  16. 16 Jul, 2021 1 commit
  17. 08 Jul, 2021 2 commits
  18. 06 Jul, 2021 1 commit
    • Israt Nisa's avatar
      [Feature] Add Heterograph support on Python for builtin unary msg functions... · 188152b8
      Israt Nisa authored
      
      [Feature] Add Heterograph support on Python for builtin unary msg functions (copy_u, copy_e) (#2989)
      
      * heterograph for binary func
      
      * Added SDDMM support
      
      * Added unittest
      
      * added binary test cases
      
      * unary mfuncs works
      
      * Fixed lint err
      
      * lint check and others
      
      * link check
      
      * fixed import *_hetero issue
      
      * lint check
      
      * replace torch with dgl backend
      
      * lint cehck
      
      * removed torch from test
      
      * skip mxnet unittest
      
      * skip gpu test
      
      * Remove unused/duplicated code
      
      * minor
      
      * changed data structure of ndata and edata
      
      * link check
      
      * reorganized
      
      * minor lint
      
      * minor lint
      
      * raise error for udf func
      
      * lint check
      
      * fix for CUDA 10.1
      
      * add a note for future cross-type max/min reducing
      
      * Add support CUDA < 11
      
      * lint check
      
      * tidied C code
      
      * remove dummy GSDDMM_hetero backward implementation
      Co-authored-by: default avatarIsrat Nisa <nisisrat@amazon.com>
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      Co-authored-by: default avatarQuan Gan <coin2028@hotmail.com>
      188152b8
  19. 22 Jun, 2021 1 commit
    • Israt Nisa's avatar
      [Kernel] Add heterograph support in CUDA kernels (SpMM, SDDMM) (#2925) · 1113f674
      Israt Nisa authored
      
      
      * Added heterograph support SpMM, SDDMM
      
      * bug fix cuda stream
      
      * add cudaStrm destroy and fix whitespace
      
      * Added heterograph support SpMM, SDDMM
      
      * bug fix cuda stream
      
      * add cudaStrm destroy and fix whitespace
      
      * changed max stream = 1
      
      * Fixed ctx
      
      * using default stream
      
      * Added heterograph support SpMM, SDDMM
      
      * bug fix cuda stream
      
      * add cudaStrm destroy and fix whitespace
      
      * changed max stream = 1
      
      * Fixed ctx
      
      * using default stream
      
      * fix bug in copy_rhs
      
      * changed by mistake
      
      * minor datatype change
      
      * added datatype check
      Co-authored-by: default avatarIsrat Nisa <nisisrat@amazon.com>
      1113f674
  20. 20 May, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d
      nv-dlasalle authored
      
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)
      
      * Split NCCL wrapper from sparse optimizer and sparse embedding
      
      * Add more unit tests for single node nccl
      
      * Fix unit test for tf
      
      * Switch to device histogram
      
      * Fix histgram issues
      
      * Finish migration to histogram
      
      * Handle cases with zero send/recieve data
      
      * Start on partition object
      
      * Get compiling
      
      * Updates
      
      * Add unit tests
      
      * Switch to partition object
      
      * Fix linting issues
      
      * Rename partition file
      
      * Add python doc
      
      * Fix python assert and finish doxygen comments
      
      * Remove stubs for range based partition to satisfy pylint
      
      * Wrap unit test in GPU only
      
      * Wrap explicit cuda call in ifdef
      
      * Merge with partition.py
      
      * update docstrings
      
      * Cleanup partition_op
      
      * Add Workspace object
      
      * Switch to using workspace object
      
      * Move last remainder based function out of nccl_api
      
      * Add error messages
      
      * Update docs with examples
      
      * Fix linting erros
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      ae8dbe6d
  21. 17 May, 2021 1 commit
  22. 28 Apr, 2021 1 commit
  23. 27 Apr, 2021 1 commit
  24. 15 Apr, 2021 1 commit
    • nv-dlasalle's avatar
      [Performance][GPU] Enable GPU uniform edge sampling (#2716) · e70138bb
      nv-dlasalle authored
      
      
      * Start on uniform GPU sampling
      
      * Save more work
      
      * Get cu file compiling
      
      * Update sampling
      
      * More changes
      
      * Get GPU sampling for uniform probabilities solved
      
      * Fix batch tensor migration
      
      * Fix
      
      * update kernels
      
      * expand blocking
      
      * Undo testing change
      
      * Cut down on sampling overhead
      
      * Fix replacement
      
      * Update unit tests
      
      * Add option to gpu sample in graphsage
      
      * Copy only csc to gpu
      
      * Add ogbn support
      
      * Fix linting
      
      * Remove nvtx from sample
      
      * Improve documentation and error checking
      
      * Expand documentation
      
      * Update assert checking
      
      * delete extra space
      
      * Use standard dataloader when dataset is a dictionary
      
      * ogb -> ogbn
      
      * Fix edge selection determinism
      
      * Fix typos
      
      * Remove nvtx
      
      * Add comment for self.fanout_arrays and assert
      
      * Fix linting
      
      * Migrate to scalarbatcher
      
      * Fix indentation
      
      * Fix batcher
      
      * Fix indexing
      
      * Only use databatcher for GPU
      
      * Convert to DGL NDArray to PyTorch Tensor
      
      * Add optimization for PyTorch's F.tensor() for list of GPU tensors
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      e70138bb
  25. 25 Mar, 2021 1 commit
  26. 22 Mar, 2021 1 commit
  27. 05 Mar, 2021 1 commit
  28. 21 Feb, 2021 1 commit
  29. 28 Jan, 2021 1 commit
  30. 27 Jan, 2021 1 commit
    • nv-dlasalle's avatar
      [Performance] Improve COO to CSR, and sort columns of CSR only when necessary. (#2391) · 2576647c
      nv-dlasalle authored
      * Remove double-checking sorted
      
      * Remove sorting of CSR by default
      
      * Update unit test to use unsorted matix
      
      * delete whitespace
      
      * Expand unit tests
      
      * Replace cusparse sort
      
      * Fix row column sorting
      
      * Explicitly don't sort columns
      
      * Fix linting errors
      
      * Fix bit-width calculation
      
      * Fix sorting assertion and unit test
      
      * Fix linting
      
      * Improve CPU COO2CSR
      
      * Remove references
      
      * Rename and add documentation to edge encoding/decoding funcionts
      
      * Fix sorting keys as 64 bit
      
      * Revert cosmetic changes to unit tests
      
      * Update documentation
      
      * Update complexity documentation for coo to csr conversion
      
      * Remove COOIsSorted check in CPU implementation too
      2576647c
  31. 25 Jan, 2021 1 commit
  32. 21 Jan, 2021 1 commit
  33. 27 Nov, 2020 1 commit
  34. 26 Nov, 2020 1 commit
  35. 22 Nov, 2020 1 commit
  36. 17 Nov, 2020 1 commit
  37. 12 Nov, 2020 2 commits
  38. 06 Nov, 2020 1 commit