Commits · ae8dbe6d3ca38dbac1089b51bbfb0e328a19a4be · OpenDAS / dgl

20 May, 2021 1 commit

[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d

nv-dlasalle authored May 20, 2021


[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)

* Split NCCL wrapper from sparse optimizer and sparse embedding

* Add more unit tests for single node nccl

* Fix unit test for tf

* Switch to device histogram

* Fix histgram issues

* Finish migration to histogram

* Handle cases with zero send/recieve data

* Start on partition object

* Get compiling

* Updates

* Add unit tests

* Switch to partition object

* Fix linting issues

* Rename partition file

* Add python doc

* Fix python assert and finish doxygen comments

* Remove stubs for range based partition to satisfy pylint

* Wrap unit test in GPU only

* Wrap explicit cuda call in ifdef

* Merge with partition.py

* update docstrings

* Cleanup partition_op

* Add Workspace object

* Switch to using workspace object

* Move last remainder based function out of nccl_api

* Add error messages

* Update docs with examples

* Fix linting erros
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

ae8dbe6d

19 May, 2021 1 commit

[Feature] Add bruteforce implementation for KNN with O(Nk) space complexity (#2892) · 5d7e80f4

Tianqi Zhang (张天启) authored May 19, 2021



* add bruteforce impl

* add support for bruteforce-sharemem

* modify python API

* add tests

* change file path

* change python API

* fix lint

* fix test

* also check worst_dist in the last few dim

* use heap and early-stop on CPU

* fix lint

* fix lint

* add device check

* use cuda function to determine max shared mem

* use cuda to determine block info

* add memory free for tmp var

* update doc-string and add dist option

* fix lint

* add more tests
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

5d7e80f4

18 May, 2021 1 commit

[Distributed] add distributed in-degree and out-degree. (#2918) · 6e7f19f2

Da Zheng authored May 18, 2021



* add distributed in-degree and out-degree.

* update comments.

* fix a bug.

* add tests.

* add tests.

* fix a bug.

* fix docstring.

* update doc.

* fix

* fix.
Co-authored-by: Zheng <dzzhen@3c22fba32af5.ant.amazon.com>
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

6e7f19f2

17 May, 2021 2 commits

[Example] add latent dirichlet allocation (#2883) · c0184365

yifeim authored May 17, 2021



* add lda model

* tweak latent dirichlet allocation

* Update README.md

* Update README.md

* update example index

* update header

* minor tweak

* add example test

* update doc

* Update README.md

* Update README.md

* add partial_fit for free

* Update examples/pytorch/lda/lda_model.py
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

* Update examples/pytorch/lda/example_20newsgroups.py
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

* Update lda_model.py

* bugfix torch Gamma uses rate parameter
Co-authored-by: Yifei Ma <yifeim@amazon.com>
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

c0184365

[Feature] Python interface for adjacency matrix summation and multiplication (#2893) · 657c220d

Quan (Andy) Gan authored May 17, 2021

* test commit

* fixes

* oops

* add docs

* lint

* why does it say I have a trailing whitespace

* oh ok

* fixes

* why there's an invalid argument error

* address comments

* fix

* address comments

657c220d

11 May, 2021 1 commit
- Remove __len__ method from DGLGraph (#2902) · 103444c5
  Quan (Andy) Gan authored May 11, 2021
```
* Update heterograph.py

* remove unit tests

* replace tutorial
```
  103444c5
07 May, 2021 1 commit

[Dataloading] Make loader iters iterator (#2886) · bfef789e

Justus Schock authored May 07, 2021



* Make loader items iterator

* Update test_dataloader.py

* Update __init__.py

* Update test_dataloader.py
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

bfef789e

03 May, 2021 1 commit

[Distributed] Distributed node embedding and sparse optimizer (#2733) · 975eb8fc

xiang song(charlie.song) authored May 03, 2021



* Draft for sparse emb

* add some notes

* Fix

* Add sparse optim for dist pytorch

* Update test

* Fix

* upd

* upd

* Fix

* Fix

* Fix bug

* add transductive exmpale

* Fix example

* Some fix

* Upd

* Fix lint

* lint

* lint

* lint

* upd

* Fix lint

* lint

* upd

* remove dead import

* update

* lint

* update unitest

* update example

* Add adam optimizer

* Add unitest and update data

* upd

* upd

* upd

* Fix docstring and fix some bug in example code

* Update rgcn readme
Co-authored-by: Ubuntu <ubuntu@ip-172-31-57-25.ec2.internal>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-24-210.ec2.internal>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-2-66.ec2.internal>

975eb8fc

27 Apr, 2021 2 commits

[Feature] Add cuda support for Sparse Matrix multiplication, summation and masking (#2782) · ab2bd1f1

Israt Nisa authored Apr 27, 2021



* init cuda support

* cuSPARSE err

* passed unittest for csr_mm/SpGEMM. int64 not supported

* Debugging cuSPARSE error 3

* csrgeam only supports int32?

* disabling int64 for cuda

* refactor and add CSRMask

* lint

* oops

* remove todo

* rewrite CSRMask with CSRGetData

* lint

* fix test

* address comments

* lint

* fix

* addresses comments and rename BUG_ON
Co-authored-by: Israt Nisa <nisisrat@amazon.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-30-71.ec2.internal>
Co-authored-by: Quan Gan <coin2028@hotmail.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

ab2bd1f1

[NN] Fix GATConv for Broadcasting with Residual Connections (#2867) · e18c2ab4
Mufei Li authored Apr 27, 2021
```
* Update

* update
Co-authored-by: Ubuntu <ubuntu@ip-172-31-59-108.us-west-2.compute.internal>
```
e18c2ab4

26 Apr, 2021 1 commit

[Distributed] Fix a bug in graph partition. (#2869) · e7046f1e

Da Zheng authored Apr 26, 2021



* update distributed training doc.

* explain data split.

* fix message passing.

* id mapping.

* fix.

* test data reshuffling.

* fix a bug.

* fix test.

* Revert "fix."

This reverts commit 2d025e9e1a5c05c3da9b803a035a788ced59bd77.

* Revert "id mapping."

This reverts commit 2a6a93ceb81fbdff86e6e9e5a58e1ace1e9d9882.

* Revert "fix message passing."

This reverts commit ed8a86bf2b015e5e4f64ba160e81b207ad2a1d65.

* Revert "explain data split."

This reverts commit 4338ddf8a336014cf92d4cb9a1db02b9badc0e55.

* Revert "update distributed training doc."

This reverts commit dda1c35c44536934c19715534f01f832afda6ad2.

* add more tests.

* fix.

* fix.

* fix.
Co-authored-by: Zheng <dzzhen@3c22fba32af5.ant.amazon.com>
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

e7046f1e

25 Apr, 2021 1 commit

[Bug Fix] Fix sparse opt bug (#2859) · c37e0364

xiang song(charlie.song) authored Apr 25, 2021



* Fix #2856

* upd

* Fix unitest

* upd

* upd

* upd

* Fix
Co-authored-by: Ubuntu <ubuntu@ip-172-31-57-25.ec2.internal>

c37e0364

22 Apr, 2021 2 commits

[Distributed] Return the ID mapping in graph partitioning. (#2857) · d76af4d4

Da Zheng authored Apr 22, 2021



* return mapping.

* support heterogeneous graph.

* more test.

* fix lint.

* fix for diff backends.

* fix.

* fix.
Co-authored-by: Zheng <dzzhen@3c22fba32af5.ant.amazon.com>
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

d76af4d4

[Sampler] BiasedChoice sampler (#1665) · 6b022d2f

Qidong Su authored Apr 22, 2021



* update

* update

* update

* update

* update

* update

* update

* fix

* fix

* update

* doc

* doc

* fix

* fix
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

6b022d2f

16 Apr, 2021 1 commit

[Performance] Track sorted status of COO from creation (#2645) · bbebde46

nv-dlasalle authored Apr 15, 2021



* Add row/col sorted flags

* improve sorting paths

* Remove print statement

* Keep track of sorted matrices

* Remove sort check in to_block

* Improve CPU sorted COO->CSR

* Handle the zero edge case

* Remove omp default clause to work with MSVC

* Update comments on sorted COO->CSR cpu implementatoin

* Expose sorted to python interface

* Make check_sorted default to false for dgl.graph()

* remove check sorted; add utests

* remove check_sorted flag
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

bbebde46

15 Apr, 2021 1 commit

[Performance][GPU] Enable GPU uniform edge sampling (#2716) · e70138bb

nv-dlasalle authored Apr 14, 2021



* Start on uniform GPU sampling

* Save more work

* Get cu file compiling

* Update sampling

* More changes

* Get GPU sampling for uniform probabilities solved

* Fix batch tensor migration

* Fix

* update kernels

* expand blocking

* Undo testing change

* Cut down on sampling overhead

* Fix replacement

* Update unit tests

* Add option to gpu sample in graphsage

* Copy only csc to gpu

* Add ogbn support

* Fix linting

* Remove nvtx from sample

* Improve documentation and error checking

* Expand documentation

* Update assert checking

* delete extra space

* Use standard dataloader when dataset is a dictionary

* ogb -> ogbn

* Fix edge selection determinism

* Fix typos

* Remove nvtx

* Add comment for self.fanout_arrays and assert

* Fix linting

* Migrate to scalarbatcher

* Fix indentation

* Fix batcher

* Fix indexing

* Only use databatcher for GPU

* Convert to DGL NDArray to PyTorch Tensor

* Add optimization for PyTorch's F.tensor() for list of GPU tensors
Co-authored-by: Da Zheng <zhengda1936@gmail.com>

e70138bb

13 Apr, 2021 1 commit
- [Distributed] Fix a bug for graphs without node/edge data. (#2838) · de5e8e23
  Da Zheng authored Apr 12, 2021
```
* fix.

* test distributed graph without node/edge data.

* remove some tests.

* fix lint
```
  de5e8e23
09 Apr, 2021 1 commit

[Feature] Add kd-tree implementation (CPU) for kNN (#2767) · e83d0a80

Tianqi Zhang (张天启) authored Apr 09, 2021



* add submodule nanoflann

* finish python API for knn

* finish ndarray adaptor

* finish cpu-kdtree version of knn

* use openmp

* add endline

* upt

* upt

* fix format and code style

* upt

* add warning for gpu-cpu copy

* avoid contiguous copy
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
Co-authored-by: Tong He <hetong007@gmail.com>

e83d0a80

01 Apr, 2021 1 commit
- [CI] Disable distributed KVStore UT (#2805) · 54c74803
  Minjie Wang authored Apr 01, 2021
  
  54c74803
30 Mar, 2021 1 commit

[Distributed] Simplify distributed API (#2775) · e36c5db6

Da Zheng authored Mar 29, 2021



* remove num_workers.

* remove num_workers.

* remove num_workers.

* remove num-servers.

* update error message.

* update docstring.

* fix docs.

* fix tests.

* fix test.

* fix.

* print messages in test.

* fix.

* fix test.

* fix.
Co-authored-by: Ubuntu <ubuntu@ip-172-31-9-132.us-west-1.compute.internal>

e36c5db6

25 Mar, 2021 1 commit

[NN] tf nn for edgeConv (#2741) · 03482f0a

kyawlinoo authored Mar 25, 2021



* tf nn for edgeConv

* Auto stash before merge of "tf_working" and "origin/tf_working"

* clean up

* added test for edge_conv

* fix

* fix

* fix
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: Quan Gan <coin2028@hotmail.com>

03482f0a

24 Mar, 2021 1 commit

[Feature] Sparse-sparse matrix multiplication, addition, and masking (#2753) · 929d8634

Quan (Andy) Gan authored Mar 24, 2021

* test

* more stuff

* add test

* fixes

* optimize algo

* replace unordered_map with arrays

* lint

* lint x2

* oops

* disable gpu csrmm tests

* remove gpu invocation

* optimize with openmp

* remove python functions

* add back with docstrings

* lint

* lint

* update python interface

* functionize

* functionize

* lint

* lint

929d8634

18 Mar, 2021 2 commits

[performance] Optimize the association order of AXW in GraphSAGE. (#2747) · edf64463

Zihao Ye authored Mar 19, 2021



* upd

* lint

* upd

* upd

* compatibility

* upd

* upd
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

edf64463

[Pickle] Fix HeteroGraphConv pickle problem (#2761) · 366cc7eb
Jinjing Zhou authored Mar 18, 2021
```
* fix pickle problem

* lint

* add pickle tests

* fix

* fix

* fix

* fix

* fix for windows
```
366cc7eb

09 Mar, 2021 1 commit

[Feature] Add edge coarsening for homogeneous undirected graphs (#2691) · c88fca50

Tianqi Zhang (张天启) authored Mar 09, 2021



* finish graph matching gpu version

* use C++ shuffle

* finish graph matching

* fix bug

* fix bug

* change name and use swap

* upt

* fix format problem

* fix format problem

* stronger test

* upt

* upt

* change python api

* upt

* upt

* format check

* upt

* upt

* fix bug
Co-authored-by: Tong He <hetong007@gmail.com>

c88fca50

03 Mar, 2021 1 commit
- [NN] Support Unidirectional Bipartite Graphs in CFConv (#2674) · 7380d61e
  Mufei Li authored Mar 03, 2021
```
* Update

* update

* Update
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
```
  7380d61e
21 Feb, 2021 1 commit

[Feature] Support aggregate multiple edge features in to_simple. (#2623) · e6bf54cd

Zihao Ye authored Feb 21, 2021

* upd

* fix

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* fix

* refactor

* upd test

* large feat_len or n in segment reduce

* lint

e6bf54cd

20 Feb, 2021 1 commit

[Doc] Tutorials re-organization (#2683) · 8a07ab77

Minjie Wang authored Feb 20, 2021

* reorg

* change titles

* rm some stale API doc; minor fix

* fix docs

* add warning

* rm new-tutorial run in ci

* lint

8a07ab77

19 Feb, 2021 1 commit
- revert (#2672) · d93a9759
  Quan (Andy) Gan authored Feb 19, 2021
  
  d93a9759
12 Feb, 2021 1 commit
- fix and lots of tests (#2650) · 9e630101
  Quan (Andy) Gan authored Feb 12, 2021
  
  9e630101
08 Feb, 2021 2 commits

[Doc] Fix inconsistencies and GPU code (#2642) · 117dd252
Quan (Andy) Gan authored Feb 08, 2021
```
* fix inconsistencies and GPu

* bug fixes

* fix

* trigger new tutorials
```
117dd252

[Sampling] Implement `dgl.to_block()` for the GPU (#2339) · bc3a532f

nv-dlasalle authored Feb 07, 2021



* Add start of to_block gpu implementation

* Pull in more changes from 0.4.2 cuda_to_block

* Move more code to IdArray

* Refactor DeviceNodeMapMaker

* Updates

* get compiling

* Integrate to_block

* Fix ID allocation

* Minor fixes

* Cleanup cuda calls to use cuda_common

* Reduce kernel calls

* Lint cleanup

* Expand documentation

* Remove unused function

* Rename variables for consistency

* Add doxygen comments

* Fix file extension

* Remove raw asynccopy for deviceapi

* Remove unused function

* Fix block/tile configuration

* Add cuda_device_common.cuh

* Add basic hashtable

* Migrate part of hashtable

* Refactor to use external hashtable

* Make functions members

* Format hash table functions

* Migrate duplicate filling

* Move last function over

* Refactor with cu file

* lint c++ code

* Move context check to C++ code

* Use macro switch

* Add missing files

* Update docstring

* update docs

* Move atomic functions

* Refactor hashtable

* Fix linting

* Expand docs

* Fix mismatched argument names

* Switch doxygen comments from using @param to \param
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

bc3a532f

05 Feb, 2021 1 commit
- [Test] Add test for Cython and force rebuild when run setup.py (#2635) · 0346b0aa
  Jinjing Zhou authored Feb 05, 2021
  
  0346b0aa
03 Feb, 2021 1 commit
- [bugfix] Solve the boundary issue in backward function of segment sum (#2610) · fb4a0508
  Zihao Ye authored Feb 03, 2021
```
* upd

* trigger

* upd
```
  fb4a0508
29 Jan, 2021 1 commit

[Bug] Heterogeneous graph convolution bugfix (#2578) · 1f6eba9e

Quan (Andy) Gan authored Jan 29, 2021



* fix heterograph conv

* remove test cases

* fix test

* fix test

* fix test
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

1f6eba9e

28 Jan, 2021 1 commit

[NN] add multihead in DotGatConv (#2549) · e4ddafe9

Chen Sirui authored Jan 28, 2021



* add multihead in DotGatConv

* Fix spacing issue

* Add Unit test for dotgat

* Modified Unit test for dotgat

* Add transformer like divisor

* Update dotgatconv.py
Co-authored-by: Chen <chesirui@3c22fbe5458c.ant.amazon.com>
Co-authored-by: Zihao Ye <expye@outlook.com>

e4ddafe9

27 Jan, 2021 2 commits

[Feature] Add support for sparse embedding (#2451) · a7e941c3

xiang song(charlie.song) authored Jan 28, 2021



* Add sparse embedding for dgl and update rgcn example

* upd

* Fix

* Revert "Fix"

This reverts commit 4da87cdfb8b8c3506b7fc7376cd2385ba8045c2a.

* Fix

* upd

* upd

* Fix

* Add unitest and update impl

* fix

* Clean up rgcn example code

* upd

* upd

* update

* Fix

* update score

* sparse for sage

* remove model sparse

* upd

* upd

* remove global norm

* revert delete model_sparse.py

* update according to comments

* Fix doc

* upd

* Fix test

* upd

* lint

* lint

* lint

* upd

* upd

* clean up
Co-authored-by: Ubuntu <ubuntu@ip-172-31-56-220.ec2.internal>

a7e941c3

[Performance] Improve COO to CSR, and sort columns of CSR only when necessary. (#2391) · 2576647c

nv-dlasalle authored Jan 26, 2021

* Remove double-checking sorted

* Remove sorting of CSR by default

* Update unit test to use unsorted matix

* delete whitespace

* Expand unit tests

* Replace cusparse sort

* Fix row column sorting

* Explicitly don't sort columns

* Fix linting errors

* Fix bit-width calculation

* Fix sorting assertion and unit test

* Fix linting

* Improve CPU COO2CSR

* Remove references

* Rename and add documentation to edge encoding/decoding funcionts

* Fix sorting keys as 64 bit

* Revert cosmetic changes to unit tests

* Update documentation

* Update complexity documentation for coo to csr conversion

* Remove COOIsSorted check in CPU implementation too

2576647c

26 Jan, 2021 1 commit

[NN] Support scalar edge weight for GraphConv, SAGEConv and GINConv (#2557) · 0855d255

Tong He authored Jan 26, 2021

* add edge weight in forward

* fix lint

* fix

* fix

* address comments

* add utils

* add util to normalize in gcn way

* fix lint

* add unittest

* fix lint

* fix docstring

* fix docstring

* address comments

* improve notation consistence

* use preferred fn

0855d255

25 Jan, 2021 1 commit
- [feature] Implement missing CUDA operators for COO format (part 1). (#2565) · 0f9056ed
  Zihao Ye authored Jan 25, 2021
```
* upd

* upd

* upd

* upd

* fix

* upd

* upd
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
```
  0f9056ed