Commits · 5854ef5ec7bba7262cf80d544fa2246c57670746 · OpenDAS / dgl

07 Mar, 2023 3 commits
- [Enhancement]Speed up ToBlockCPU with concurrent id hash map (#5297) · 5854ef5e
  peizhou001 authored Mar 07, 2023
  
  5854ef5e
- [Misc] Update stale bot policy (#5419) · cce31e9a
  Minjie Wang authored Mar 07, 2023
```
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
```
  cce31e9a
- [Dev] enable to specify torch version when create conda env (#5430) · 6093cc5b
  Rhett Ying authored Mar 07, 2023
```
* [Dev] enable to specify torch version when create conda env

* Update script/create_dev_conda_env.sh

* Update script/create_dev_conda_env.sh
```
  6093cc5b
06 Mar, 2023 4 commits

[DistDGL][UserEx]Sync parmetis_wrapper with changes in metadata.json (#5385) · 7b766393

kylasa authored Mar 06, 2023

* Sync parmetis_wrapper with changes in metadata.json

1. In the preprocess.py, make sure that num_partitions is defined as input argument. Also, align 'input_dir' with the input dataset. schema_file is assumed to be located inside the input_dir. Also, graph_stats.txt file is assumed to be present in the input_dir.

2. Use DGL_HOME environment variable so that parmetis_wrapper command can be run anywhere.

* Fix CI test failure cases.

* Addressing CI review comments.

* Addressing CI test failures.

* Applying lintrunner patch

7b766393

Support for no. of chunks smaller than no. of partitions. (#5390) · 894ad1e3

kylasa authored Mar 06, 2023

* Support for no. of chunks smaller than no. of partitions and Adding appropriate test cases.

Following changes are made with this PR.
1. Code changes for handling no. of chunks smaller than no. of partitions
2. Adding new test cases, which were previously deleted, for no. of chunks smaller than no. of partitions.
3. Also adding test cases, where multiple partitions are handled by a single process.

* Committing the missing files in this commit.

* lintrunner patch.

* lintrunner check

* lintrunner patch here.

* CI review comments.

894ad1e3

[Bugfix] Fix duplicate worker_init_fn argument when provided in DataLoader (#5420) · 851d66fa
Quan (Andy) Gan authored Mar 06, 2023
```
* fix duplicate worker_init_fn

* lint

* lint again

* uugh
```
851d66fa
[BugFix] fix torch cuda version (#5426) · 26b245a0
Rhett Ying authored Mar 06, 2023

26b245a0

04 Mar, 2023 1 commit

[Model] Add `dgl.nn.CuGraphGATConv` model (#5168) · bfd411d0

Tingyu Wang authored Mar 04, 2023



* add CuGraphGATConv model

* lintrunner

* update model to reflect changes in make_mfg_csr(), move max_in_degree to forward()

* simplify pytest markers

* fall back to FG option for large fanout

* update error msg

* add feat_drop and activation options

* add residual option

* Update python/dgl/nn/pytorch/conv/cugraph_gatconv.py
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

* Update python/dgl/nn/pytorch/conv/cugraph_gatconv.py
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

* reset res_fc

---------
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

bfd411d0

03 Mar, 2023 2 commits

[Utils] Edge and LINKX homophily measure (#5382) · f00cd6ef

Mufei Li authored Mar 03, 2023



* Update

* lint

* lint

* r prefix

* CI

* lint

* skip TF

* Update

* edge homophily

* linkx homophily

* format

* skip TF

* fix test

* update

* lint

* lint

* review

* lint

* update

* lint

* update

* CI

---------
Co-authored-by: Ubuntu <ubuntu@ip-172-31-36-188.ap-northeast-1.compute.internal>

f00cd6ef

[FIX] Ubuntu 22 build fix (#5272) · 85526e34
bgawrych authored Mar 03, 2023
```
* Fix ubuntu 22 build

* Add one more flag
```
85526e34

02 Mar, 2023 1 commit
- [Doc] Fix typos in minibatch-edge.rst (#5308) · 34d64754
  czkkkkkk authored Mar 02, 2023
  
  34d64754
01 Mar, 2023 3 commits
- Revert "Set USE_LIBXSMM default to OFF. (#5287)" (#5392) · 325e795a
  Hongzhi (Steve), Chen authored Mar 01, 2023
```
This reverts commit a5e31391.
```
  325e795a
- [Dataset] Add CLUSTER dataset (#5389) · a53ecd22
  Zhiteng Li authored Mar 01, 2023
```
* add CLUSTER dataset

* refine according to dongyu's comments

---------
Co-authored-by: rudongyu <ru_dongyu@outlook.com>
```
  a53ecd22
- removed pragma omp for (#5334) · 308bd6f5
  Kacper Pietkun authored Mar 01, 2023
  
  308bd6f5
28 Feb, 2023 2 commits

Distributed Lookup Service Robustness (#5387) · cf752077

kylasa authored Feb 28, 2023

Handling corner cases in the distributed lookup service. When the get partition ids function is invoked with empty request. This is needed because we are using alltoall function in the get_partition_ids function.

cf752077

[Sparse] Support converson to/from torch sparse tensor. (#5388) · 999c6245
czkkkkkk authored Feb 28, 2023
```
* [Sparse] Support converson to/from torch sparse tensor.

* Update
```
999c6245

27 Feb, 2023 3 commits
- [Refactor] Extract common code in gpu and cpu ToBLock (#5305) · 11d12f3c
  peizhou001 authored Feb 27, 2023
  
  11d12f3c
- [CI] enable more options for conda env creation (#5386) · 2238386a
  Rhett Ying authored Feb 27, 2023
```
* [CI] enable more options for conda env creation

* update
```
  2238386a
- [CI] add always_yes mode for conda evn creation (#5384) · c396942d
  Rhett Ying authored Feb 27, 2023
```
* add always_yes mode for conda evn creation

* Update create_dev_conda_env.sh
```
  c396942d
25 Feb, 2023 1 commit

[DistDGL][Feature_Request]Changes in the metadata.json file for input graph dataset. (#5310) · a14f69c9

kylasa authored Feb 24, 2023

* Implemented the following changes.

* Remove NUM_NODES_PER_CHUNK
* Remove NUM_EDGES_PER_CHUNK
* Remove the dependency between no. of edge files per edge type and no. of partitions
* Remove the dependency between no. of edge feature files per edge type and no. of partitions
* Remove the dependency between no. of edge feature files and no. of edge files per edge type.
* Remove the dependency between no. of node feature files and no. of partitions
* Add “node_type_counts”. This will be a list of integers. Each integer will represent total count of a node-type. The index in this list and the index in the “node_type” will be the same for a given node-type.
* Add “edge_type_counts”. This will be a list of integers. Each integer will represent total count of an edge-type. The index in this list and the index in the “edge_type” list will be the same for a given edge-type.

* Applying lintrunner patch.

* Adding missing keys to the metadata in the unit test framework.

* lintrunner patch.

* Resolving CI test failures due to merge conflicts.

* Applying lintrunner patch

* applying lintrunner patch

* Replacing tabspace with spaces - to satisfy lintrunner

* Fixing the CI Test Failure cases.

* Applying lintrunner patch

* lintrunner complaining about a blank line.

* Resolving issues with print statement for NoneType

* Removed tests for the arbitrary chunks tests. Since this functionality is not supported anymore.

* Addressing CI review comments.

* addressing CI review comments

* lintrunner patch

* lintrunner patch.

* Addressing CI review comments.

* lintrunner patch.

a14f69c9

24 Feb, 2023 2 commits

[Utils] Node homophily measure (#5376) · fcf5ad5f

Mufei Li authored Feb 24, 2023



* Update

* lint

* lint

* r prefix

* CI

* lint

* skip TF

* Update

---------
Co-authored-by: Ubuntu <ubuntu@ip-172-31-36-188.ap-northeast-1.compute.internal>

fcf5ad5f

[Sparse] Support column-wise softmax (#5377) · 5ffd2a02

czkkkkkk authored Feb 24, 2023



* [Sparse] Support column-wise softmax

* Update python/dgl/sparse/softmax.py
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

---------
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

5ffd2a02

23 Feb, 2023 6 commits

New script for customers to validate partitioned graph objects (#5340) · c42fa8a5
kylasa authored Feb 23, 2023
```
* A new script to validate graph partitioning pipeline

* Addressing CI review comments.

* lintrunner patch.
```
c42fa8a5

[DistDGL][Robustness]Uneven distribution of input graph files for nodes/edges and features. (#5227) · bbc538d9

kylasa authored Feb 23, 2023

* Uneven distribution of nodes/edges/features

To handle unevenly sized files for nodes/edges and feature files for nodes and edges, we have to synchronize before starting large no. of messages (either one large message or a burst of messages).

* Applying lintrunner patch.

* Removing tabspaces for lintrunner.

* lintrunner patch.

* removed issues introduced by the merge conflicts. Lots of code was repeated

bbc538d9

[DistDGL][Mem_Optimizations]get_partition_ids, service provided by the... · 61b6edab

kylasa authored Feb 23, 2023

[DistDGL][Mem_Optimizations]get_partition_ids, service provided by the distributed lookup service has high memory footprint (#5226)

* get_partition_ids, service provided by the distributed lookup service has high memory footprint

'get_partitionid' function, which is used to retrieve owner processes of the given list of global node ids, has high memory footprint. Currently this is of the order of 8x compared to the size of the input list.

For massively large datasets, this memory needs are very unrealistic and may result in OOM. In the case of CoreGraph, when retrieving owner of an edge list of size 6 Billion edges, the memory needs can be as high as 8*8*8 = 256 GB.

To limit the amount of memory used by this function, we split the size of the message sent to the distributed lookup service, so that each message is limited by the number of global node ids, which is 200 million. This reduced the memory footprint of this entire function to be no more than 0.2 * 8 * 8 = 13 GB. which is within reasonable limits.

Now since we send multiple small messages compared to one large message to the distributed lookup service, this may consume more wall-clock-time compared to earlier implementation.

* lintrunner patch.

* using np.ceil() per suggestion.

* converting the output of np.ceil() as ints.

61b6edab

[Bugfix] fixed leak in SpMMCreateBlocks (#5210) · 99937422
Kacper Pietkun authored Feb 23, 2023
```
* fixed leak in SpMMCreateBlocks

* clang format
```
99937422

[Model] Implemented SubgraphX Explainer for Homogeneous graph (#5315) · 45153fc0

Kunal Mukherjee authored Feb 22, 2023



* subgraphx commit

* nits

* newline eof added

* lint fix

* test script updated to use default values

* lint fix

* graphs that are used for test cases are updated to a small graph

* lint formatted

* test paramter adj to complete the test under 20s

* lint fixes

---------
Co-authored-by: kxm180046 <kxm180046@utdallas.edu>

45153fc0

[Sparse] Stack SparseMatrix COO row and column coordinates into one tensor. (#5314) · 73a508e1
czkkkkkk authored Feb 23, 2023

73a508e1

22 Feb, 2023 5 commits

[DistDGL] Memory optimization to reduce memory footprint of the Dist Graph... · 5ea04713

kylasa authored Feb 22, 2023

[DistDGL] Memory optimization to reduce memory footprint of the Dist Graph partitioning pipeline. (#5130)

* Wrap np.argsort() in a function. This

Use a python wrapper for the np.argsort() function for better usage of systems memory.

* lintrunner patch.

* lintrunner patch.

* Changes to address code review comments.

5ea04713

[Refactor] Add default ffi namespce capi (#5359) · 7ff04152
peizhou001 authored Feb 22, 2023

7ff04152
[Sparse] Lower the accuracy requirement of test_twirls in test. (#5364) · 30b89e6a
Hongzhi (Steve), Chen authored Feb 22, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
30b89e6a

[Model] Add `dgl.nn.CuGraphSAGEConv` model (#5137) · bcf9923b

Tingyu Wang authored Feb 22, 2023



* add CuGraphSAGEConv model

* fix lint issues

* update model to reflect changes in make_mfg_csr(), move max_in_degree to forward()

* lintrunner

* allow reset_parameters()

* remove norm option, simplify test

* allow full graph fallback option, add example

* address comments

* address reviews

---------
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

bcf9923b

[Release] Bump nightly version (#5357) · d8ca6317
Quan (Andy) Gan authored Feb 22, 2023
```
* bump version

* Update update_version.py
```
d8ca6317

21 Feb, 2023 7 commits
- [Misc] Autoformat python dgl. (#5335) · 9ce80e85
  Hongzhi (Steve), Chen authored Feb 21, 2023
```
* autofix

* sort

* sort

---------
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  9ce80e85
- [Misc] Fix typo in status.py (#5362) · 02f2526b
  Hongzhi (Steve), Chen authored Feb 21, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  02f2526b
- [Misc] Treat aborted as failure in Jenkins. (#5361) · 085b19d7
  Hongzhi (Steve), Chen authored Feb 21, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  085b19d7
- autofix (#5337) · 529b2662
  Hongzhi (Steve), Chen authored Feb 21, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  529b2662
- [Misc] All overrun on master CI. (#5360) · 5bfa8137
  Hongzhi (Steve), Chen authored Feb 21, 2023
```
* exclude_master

* fix

---------
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  5bfa8137
- [Misc] Update Jenkins status. (#5356) · e41ce0c6
  Hongzhi (Steve), Chen authored Feb 21, 2023
```
* test

* blabla

* add

* reformat

* balbla

* rollback

* remove

---------
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  e41ce0c6
- [CI] use more light-weight node for lint check (#5358) · 197f1d25
  Rhett Ying authored Feb 21, 2023
  
  197f1d25