Commits · 0437b16497041b83d454a08e2db7e56fb52560d5 · OpenDAS / dgl

"src/git@developer.sourcefind.cn:renzhc/diffusers_dcu.git" did not exist on "c3675d4c9bb9c02521cd2c1aec198460c1657256"

20 May, 2021 1 commit

[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d

nv-dlasalle authored May 20, 2021


[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)

* Split NCCL wrapper from sparse optimizer and sparse embedding

* Add more unit tests for single node nccl

* Fix unit test for tf

* Switch to device histogram

* Fix histgram issues

* Finish migration to histogram

* Handle cases with zero send/recieve data

* Start on partition object

* Get compiling

* Updates

* Add unit tests

* Switch to partition object

* Fix linting issues

* Rename partition file

* Add python doc

* Fix python assert and finish doxygen comments

* Remove stubs for range based partition to satisfy pylint

* Wrap unit test in GPU only

* Wrap explicit cuda call in ifdef

* Merge with partition.py

* update docstrings

* Cleanup partition_op

* Add Workspace object

* Switch to using workspace object

* Move last remainder based function out of nccl_api

* Add error messages

* Update docs with examples

* Fix linting erros
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

ae8dbe6d

17 May, 2021 1 commit

[Feature] Python interface for adjacency matrix summation and multiplication (#2893) · 657c220d

Quan (Andy) Gan authored May 17, 2021

* test commit

* fixes

* oops

* add docs

* lint

* why does it say I have a trailing whitespace

* oh ok

* fixes

* why there's an invalid argument error

* address comments

* fix

* address comments

657c220d

28 Apr, 2021 1 commit

Fix cu11 compile (#2879) · 703d4b93

xiang song(charlie.song) authored Apr 28, 2021


Co-authored-by: Ubuntu <ubuntu@ip-172-31-1-191.ec2.internal>
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

703d4b93

27 Apr, 2021 1 commit

[Feature] Add cuda support for Sparse Matrix multiplication, summation and masking (#2782) · ab2bd1f1

Israt Nisa authored Apr 27, 2021



* init cuda support

* cuSPARSE err

* passed unittest for csr_mm/SpGEMM. int64 not supported

* Debugging cuSPARSE error 3

* csrgeam only supports int32?

* disabling int64 for cuda

* refactor and add CSRMask

* lint

* oops

* remove todo

* rewrite CSRMask with CSRGetData

* lint

* fix test

* address comments

* lint

* fix

* addresses comments and rename BUG_ON
Co-authored-by: Israt Nisa <nisisrat@amazon.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-30-71.ec2.internal>
Co-authored-by: Quan Gan <coin2028@hotmail.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

ab2bd1f1

16 Apr, 2021 1 commit

[Performance] Track sorted status of COO from creation (#2645) · bbebde46

nv-dlasalle authored Apr 15, 2021



* Add row/col sorted flags

* improve sorting paths

* Remove print statement

* Keep track of sorted matrices

* Remove sort check in to_block

* Improve CPU sorted COO->CSR

* Handle the zero edge case

* Remove omp default clause to work with MSVC

* Update comments on sorted COO->CSR cpu implementatoin

* Expose sorted to python interface

* Make check_sorted default to false for dgl.graph()

* remove check sorted; add utests

* remove check_sorted flag
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

bbebde46

15 Apr, 2021 1 commit

[Performance][GPU] Enable GPU uniform edge sampling (#2716) · e70138bb

nv-dlasalle authored Apr 14, 2021



* Start on uniform GPU sampling

* Save more work

* Get cu file compiling

* Update sampling

* More changes

* Get GPU sampling for uniform probabilities solved

* Fix batch tensor migration

* Fix

* update kernels

* expand blocking

* Undo testing change

* Cut down on sampling overhead

* Fix replacement

* Update unit tests

* Add option to gpu sample in graphsage

* Copy only csc to gpu

* Add ogbn support

* Fix linting

* Remove nvtx from sample

* Improve documentation and error checking

* Expand documentation

* Update assert checking

* delete extra space

* Use standard dataloader when dataset is a dictionary

* ogb -> ogbn

* Fix edge selection determinism

* Fix typos

* Remove nvtx

* Add comment for self.fanout_arrays and assert

* Fix linting

* Migrate to scalarbatcher

* Fix indentation

* Fix batcher

* Fix indexing

* Only use databatcher for GPU

* Convert to DGL NDArray to PyTorch Tensor

* Add optimization for PyTorch's F.tensor() for list of GPU tensors
Co-authored-by: Da Zheng <zhengda1936@gmail.com>

e70138bb

25 Mar, 2021 1 commit
- [Bug] Disable cpu fp16 (#2783) · 0b57ce18
  Quan (Andy) Gan authored Mar 25, 2021
```
* disable cpu fp16

* spell mistakes
```
  0b57ce18
24 Mar, 2021 1 commit

[Feature] Sparse-sparse matrix multiplication, addition, and masking (#2753) · 929d8634

Quan (Andy) Gan authored Mar 24, 2021

* test

* more stuff

* add test

* fixes

* optimize algo

* replace unordered_map with arrays

* lint

* lint x2

* oops

* disable gpu csrmm tests

* remove gpu invocation

* optimize with openmp

* remove python functions

* add back with docstrings

* lint

* lint

* update python interface

* functionize

* functionize

* lint

* lint

929d8634

22 Mar, 2021 1 commit

[Bugfix] Wrap cub with CUB_NS_PREFIX and remove dependency on Thrust to... · 0ff7127a

nv-dlasalle authored Mar 22, 2021


[Bugfix] Wrap cub with CUB_NS_PREFIX and remove dependency on Thrust to linking issues with Torch 1.8 (#2758)

* Wrap cub with prefixes and remove thrust

* Using counting iterator
Co-authored-by: Zihao Ye <expye@outlook.com>
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

0ff7127a

09 Mar, 2021 1 commit

[Feature] Add edge coarsening for homogeneous undirected graphs (#2691) · c88fca50

Tianqi Zhang (张天启) authored Mar 09, 2021



* finish graph matching gpu version

* use C++ shuffle

* finish graph matching

* fix bug

* fix bug

* change name and use swap

* upt

* fix format problem

* fix format problem

* stronger test

* upt

* upt

* change python api

* upt

* upt

* format check

* upt

* upt

* fix bug
Co-authored-by: Tong He <hetong007@gmail.com>

c88fca50

05 Mar, 2021 1 commit
- fix doc typo (#2721) · 62dd1c86
  maqy1995 authored Mar 05, 2021
  
  62dd1c86
21 Feb, 2021 1 commit

[Feature] Support aggregate multiple edge features in to_simple. (#2623) · e6bf54cd

Zihao Ye authored Feb 21, 2021

* upd

* fix

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* fix

* refactor

* upd test

* large feat_len or n in segment reduce

* lint

e6bf54cd

29 Jan, 2021 1 commit
- fix build problems (#2594) · 460bb42d
  Quan (Andy) Gan authored Jan 29, 2021
  
  460bb42d
28 Jan, 2021 1 commit

[feature] Supporting half precision floating data type (fp16). (#2552) · 7bab1365

Zihao Ye authored Jan 28, 2021



* add tvm as submodule

* compilation is ok but calling fails

* can call now

* pack multiple modules, change names

* upd

* upd

* upd

* fix cmake

* upd

* upd

* upd

* upd

* fix

* relative path

* upd

* upd

* upd

* singleton

* upd

* trigger

* fix

* upd

* count reducible

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* only keep related files

* upd

* upd

* upd

* upd

* lint

* lint

* lint

* lint

* pylint

* upd

* upd

* compilation

* fix

* upd

* upd

* upd

* upd

* upd

* upd

* upd doc

* refactor

* fix

* upd number
Co-authored-by: Zhi Lin <linzhilynn@gmail.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-42-78.us-east-2.compute.internal>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-21-156.us-east-2.compute.internal>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

7bab1365

27 Jan, 2021 1 commit

[Performance] Improve COO to CSR, and sort columns of CSR only when necessary. (#2391) · 2576647c

nv-dlasalle authored Jan 26, 2021

* Remove double-checking sorted

* Remove sorting of CSR by default

* Update unit test to use unsorted matix

* delete whitespace

* Expand unit tests

* Replace cusparse sort

* Fix row column sorting

* Explicitly don't sort columns

* Fix linting errors

* Fix bit-width calculation

* Fix sorting assertion and unit test

* Fix linting

* Improve CPU COO2CSR

* Remove references

* Rename and add documentation to edge encoding/decoding funcionts

* Fix sorting keys as 64 bit

* Revert cosmetic changes to unit tests

* Update documentation

* Update complexity documentation for coo to csr conversion

* Remove COOIsSorted check in CPU implementation too

2576647c

25 Jan, 2021 1 commit
- [feature] Implement missing CUDA operators for COO format (part 1). (#2565) · 0f9056ed
  Zihao Ye authored Jan 25, 2021
```
* upd

* upd

* upd

* upd

* fix

* upd

* upd
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
```
  0f9056ed
21 Jan, 2021 1 commit
- Use ALG2 for SpMM in cuSparse (#2550) · 9d90faf0
  nv-dlasalle authored Jan 20, 2021
  
  9d90faf0
31 Dec, 2020 1 commit
- [Feature] Tvm integration (#2367) · 4208ce2b
  Zhi Lin authored Dec 31, 2020
```
Co-authored-by: Zihao Ye <expye@outlook.com>
```
  4208ce2b
25 Dec, 2020 1 commit

[Performance] Use allocator from PyTorch if possible (#2328) · 9a7235fa

Quan (Andy) Gan authored Dec 25, 2020

* first commit

* some thoughts

* move around

* more commit

* more fixes

* now it uses torch allocator

* fix symbol export error

* fix

* fixes

* test fix

* add script

* building separate library per version

* fix for vs2019

* more fixes

* fix on windows build

* update jenkinsfile

* auto copy built dlls for windows

* lint and installation guide update

* fix

* specify conda environment

* set environment for ci

* fix

* fix

* fix

* fix again

* revert

* fix cmake

* fix

* switch to using python interpreter path

* remove scripts

* debug

* oops sorry

* Update index.rst

* Update index.rst

* copies automatically, no need for this

* do not print message if library not found

* tiny fixes

* debug on nightly

* replace add_compile_definitions to make CMake 3.5 happy

* fix linking to wrong lib for multiple pytorch envs

* changed building strategy

* fix nightly

* fix windows

* fix windows again

* setup bugfix

* address comments

* change README

9a7235fa

17 Dec, 2020 1 commit
- [hotfix] Make USE_AVX a flag in cmake to avoid compilation error for arm user (#2428) · e379e525
  Zihao Ye authored Dec 17, 2020
```
* upd cmake

* upd

* format
```
  e379e525
10 Dec, 2020 1 commit

[Performance][Hotfix] Disable openmp in arithmetic operation (#2412) · 9dff5419

Quan (Andy) Gan authored Dec 10, 2020



* disable openmp in arithmetic operation

* lint

* Update array_op_impl.cc
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

9dff5419

27 Nov, 2020 1 commit
- [doc] Add docstring for segment reduce. (#2375) · 6b02babb
  Zihao Ye authored Nov 27, 2020
  
  6b02babb
26 Nov, 2020 2 commits
- disable gespmm (#2371) · 5ac74f86
  Zihao Ye authored Nov 26, 2020
  
  5ac74f86
- [Performance]: Remove indptr correction for CooToCSR (#2356) · 6897f55a
  IzabelaMazur authored Nov 26, 2020
  
  6897f55a
22 Nov, 2020 1 commit
- [Performance] Use segment operators for graph readout. (#2361) · 3adbfa18
  Zihao Ye authored Nov 23, 2020
```
* upd

* upd

* update

* upd

* upd

* upd

* fix

* lint

* lint

* pylint

* doc
```
  3adbfa18
17 Nov, 2020 2 commits

upd (#2352) · 061c2a36
Zihao Ye authored Nov 17, 2020

061c2a36

[Performance] Dynamic cpu kernel V3 for SpMMSumCsr all Ops (#2309) · f8ebcd7f

pawelpiotrowicz authored Nov 17, 2020



* support AVX512

* env DGL_CPU_INTEL_KERNEL_ENABLED=1

* env DGL_CPU_INTEL_KERNEL_LOG=1

* Add unittest test_spmm.cc
Co-authored-by: Izabela Mazur <izabela.mazur@intel.com>
Co-authored-by: Michal Szarmach <michal.szarmach@intel.com>

Review patch

f8ebcd7f

13 Nov, 2020 1 commit

[Bug] Multiple fixes for CUDA 11 support (#2333) · 501b2b75

Quan (Andy) Gan authored Nov 13, 2020



* multiple fixes

* fix CI

* fiddle

* revert stubs

* remove stubs

* poke

* remove linking of driver library

* minor
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

501b2b75

12 Nov, 2020 2 commits
- upd (#2336) · d89f825d
  Zihao Ye authored Nov 12, 2020
  
  d89f825d
- [Kernel] Use tree reduction for SDDMM-dot (#2335) · 92a3d07d
  Zihao Ye authored Nov 12, 2020
```
* multiple fixes

* fix CI

* fiddle

* revert stubs

* upd

* upd

* unmerge

* unmerge
Co-authored-by: Quan Gan <coin2028@hotmail.com>
```
  92a3d07d
06 Nov, 2020 1 commit

[kernel] Select GE-SpMM when feature size is large. (#2306) · 272cb9e2

Zihao Ye authored Nov 06, 2020

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* fix

* upd

* upd

* upd

* upd

272cb9e2

15 Sep, 2020 1 commit
- Loop reorder (#2201) · b25bbe64
  Zihao Ye authored Sep 15, 2020
  
  b25bbe64
11 Sep, 2020 1 commit
- [Bug] fix cumsum on an empty array with prepend_zero returning an empty array (#2179) · 7d8522a2
  Quan (Andy) Gan authored Sep 11, 2020
```
* fix cumsum

* udp
Co-authored-by: Zihao <expye@outlook.com>
```
  7d8522a2
10 Sep, 2020 2 commits

[performance] Batch DGLGraph in C++ end. (#2155) · cbd55eb1

Zihao Ye authored Sep 11, 2020



* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* fix

* upd

* upd

* upd

* upd

* fix

* upd
Co-authored-by: VoVAllen <jz1749@nyu.edu>

cbd55eb1

[hotfix] Skip CUDA kernel launch when number of blocks/threads is zero. (#2144) · 2c04ecb5
Zihao Ye authored Sep 10, 2020
```
* upd

* upd

* upd

* upd

* lint

* upd

* upd

* fmt
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
```
2c04ecb5

27 Aug, 2020 1 commit
- [Feature] Use new cusparse API to support CUDA 11. (#1979) · 5cff2f1c
  Zihao Ye authored Aug 27, 2020
```
* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd
```
  5cff2f1c
14 Aug, 2020 1 commit

[WIP][Kernel] Set the built-in reduce result of zero-degree nodes to 0 in C (#2017) · 63e2ba23

Quan (Andy) Gan authored Aug 14, 2020



* test idea

* cuda kernels

* lint and fixes

* lint

* change to another strategy

* use infinity

* fix
Co-authored-by: Zihao Ye <expye@outlook.com>

63e2ba23

13 Aug, 2020 1 commit
- [hotfix] Fix cuda illegal memory access error in SpMMCoo. (#2015) · e6f6ce27
  Zihao Ye authored Aug 13, 2020
```
* up

* pylint

* upd
```
  e6f6ce27
01 Aug, 2020 1 commit

[bugfix] Fix the memory leak issue of Cluster GAT under 0.5 kernel and... · 34a067ea

Zihao Ye authored Aug 02, 2020

[bugfix] Fix the memory leak issue of Cluster GAT under 0.5 kernel and simplify the bipartite GAT. (#1908)

* uipd

* upd

* upd

* upd

* upd

34a067ea

30 Jul, 2020 1 commit

[CUDA][Kernel] A bunch of int64 kernels for COO and CSR (#1883) · f4608c22

Minjie Wang authored Jul 30, 2020

* COO sort

* COOToCSR

* CSR2COO

* CSRSort; CSRTranspose

* pass all CSR tests

* lint

* remove int32 conversion

* fix tensorflow nn tests

* turn on CI

* fix

* addreess comments

f4608c22