Commits · 96cd2ee6533a13d2d7a74039380cfe615a3c5b13 · OpenDAS / dgl

06 Nov, 2021 1 commit

[Performance][GPU] Improve _SegmentCopyKernel() (#3470) · 96cd2ee6

ayasar70 authored Nov 06, 2021



* Based on issue #3436. Improving _SegmentCopyKernel s GPU utilization by switching to nonzero based thread assignment

* fixing lint issues

* Update cub for cuda 11.5 compatibility (#3468)

* fixing type mismatch

* tx guaranteed to be smaller than nnz. Hence removing last check

* minor: updating comment

* adding three unit tests for csr slice method to cover some corner cases
Co-authored-by: Abdurrahman Yasar <ayasar@nvidia.com>
Co-authored-by: nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

96cd2ee6

05 Nov, 2021 2 commits
- fix test dataloader (#3482) · efe0b061
  Xin Yao authored Nov 05, 2021
```
Co-authored-by: Rhett Ying <85214957+Rhett-Ying@users.noreply.github.com>
```
  efe0b061
- [Doc] Evaluation Tutorial for Link Prediction (#3463) · c5ae54bf
  Quan (Andy) Gan authored Nov 05, 2021
```
* link prediction tutorial

* add performance tip

* Update L2_large_link_prediction.py
```
  c5ae54bf
04 Nov, 2021 3 commits

[BugFix] Fix bugs in GPU sampling and enable unit tests for dataloaders on the GPU (#3474) · b717c8bf

Xin Yao authored Nov 05, 2021



* enable unit tests for dataloader on the GPU

* fix compatibility

* copyright

* fix linting
Co-authored-by: nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>

b717c8bf

[Feature] aten::Relabel_() for the GPU (#3445) · d3ae7544

Xin Yao authored Nov 04, 2021



* relabel gpu

* unittest for ralebl_ on the GPU

* finish Relabel_ for the GPU

* copyright

* re-enable the unittest for edge_subgrah on the GPU

* fix unittest for tensorflow

* use a fixed number of threads
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

d3ae7544

[Feature] k-hop Subgraph Extraction (#3458) · f46080a4

Mufei Li authored Nov 04, 2021

* Update

* Fix

* Fix

* Update

* Update

* Update

* Fix CI

* Fix

* Fix

* Fix

* Update

* Update

* Update

* Fix

* Fix

* Fix for TF

f46080a4

03 Nov, 2021 3 commits

Update CONTRIBUTORS.md (#3475) · 64f20eea
Shaked Brody authored Nov 03, 2021

64f20eea

[NN][Model] GATv2 (#3473) · e2f33fd5

Shaked Brody authored Nov 03, 2021



* [Model][Core] GATv2

* lint

* gatv2conv.py

* lint

* lint

* style and docs

* lint

* gatv2conv fix
Co-authored-by: Shaked Brody shakedbr@campus.technion.ac.il <shakedbr@tangerine.cslcs.technion.ac.il>
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

e2f33fd5

Update cub for cuda 11.5 compatibility (#3468) · f5102145
nv-dlasalle authored Nov 02, 2021

f5102145

29 Oct, 2021 1 commit
- fix compatibility with PyTorch 1.10 (#3454) · cac25f63
  Quan (Andy) Gan authored Oct 29, 2021
  
  cac25f63
28 Oct, 2021 1 commit
- update contributors (#3451) · 9067565a
  Xin Yao authored Oct 28, 2021
  
  9067565a
27 Oct, 2021 1 commit

[NN] Add EGATConv nn.module (#3425) · 51c65097

Kamil Kamiński authored Oct 27, 2021



* added nn pytorch egatconv

* aligned with test build

* aligned with test build

* fixed wihite spaces

* fixed wihite spaces

* fixed wihite spaces

* added missing egatconv in imports

* added indentation in forward

* GATConv based implementation

* removed **kw_args

* added dgl relative imports

* PR corrections

* added DGL Error to EGATConv imports

* Update test_nn.py
Co-authored-by: Argusmocny <k.kaminski@cent.uw.edu.pl>
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

51c65097

26 Oct, 2021 2 commits
- Fix #3437 (#3440) · a9c83bce
  Jinjing Zhou authored Oct 26, 2021
  
  a9c83bce
- Update README.md (#3442) · 579cd3eb
  Hongyu Cai authored Oct 26, 2021
  
  579cd3eb
21 Oct, 2021 1 commit

[Sampling] Implement dgl.compact_graphs() for the GPU (#3423) · a8c81018

Xin Yao authored Oct 21, 2021

* gpu compact graph template

* cuda compact graph draft

* fix typo

* compact graphs

* pass unit test but fail in training

* example using EdgeDataLoader on the GPU

* refactor cuda_compact_graph and cuda_to_block

* update training scripts

* fix linting

* fix linting

* fix exclude_edges for the GPU

* add --data-cpu & fix copyright

a8c81018

19 Oct, 2021 1 commit
- [Doc] remove duplicate papers (#3393) · 308e52a3
  Cheng Wan authored Oct 19, 2021
```
* remove duplicate paper

* Update README.md
```
  308e52a3
18 Oct, 2021 4 commits

[Fix] Split nccl sparse push into two groups (#3404) · c560040f
nv-dlasalle authored Oct 18, 2021

c560040f

[Peformance] Parallelize CSRSliceRows() (#3409) · aa11aaa4

David Min authored Oct 18, 2021



* parallelize CSRRowSlice()

* use parallel_for for the second loop
Co-authored-by: nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

aa11aaa4

[BugFix] Avoid Memory Leak Issue in PyTorch Backend (#3386) · ff94ee80

Cheng Wan authored Oct 18, 2021



* try to avoid memory leak

* try to avoid memory leak

* avoid memory leak with no hope

* Revert "avoid memory leak with no hope"

This reverts commit c77befe9479f46758e744642f66dd209b50eef7d.

* no message

* Update sparse.py

* Update tensor.py
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

ff94ee80

[Bugfix][Pytorch] Fix model save and load bug of stgcn_wave (#3303) · f7039418
HaoWei-TomTom authored Oct 18, 2021
```
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
```
f7039418

15 Oct, 2021 2 commits

[Bug] Fix edge exclusion still not working for full neighbor sampling (#3424) · b81bb914
Quan (Andy) Gan authored Oct 15, 2021

b81bb914

[Bugfix] Add UVM specialized IndexSelect kernels which perform boundary checks (#3293) · 4f5c3aa2

David Min authored Oct 15, 2021



* Add pytorch-direct version

* remove

* add documentation for UnifiedTensor

* Revert "add documentation for UnifiedTensor"

This reverts commit 63ba42644d4aba197c1cb4ea4b85fa1bc43b8849.

* add boundary check for UVM IndexSelect

* relocate boundary check index kernels to cuda

* fix function name

* fix indexkernel in nccl api

* fix argument ordering

* simplify code

* Add a comment for the uvm version
Co-authored-by: shhssdm <shhssdm@gmail.com>
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

4f5c3aa2

14 Oct, 2021 6 commits

[Fix] Use ==/!= to compare constant literals (str, bytes, int, float, tuple) (#3415) · 04ed6126

Christian Clauss authored Oct 14, 2021

* Use ==/!= to compare constant literals (str, bytes, int, float, tuple)

Avoid Syntax Warnings on Python >= 3.8

$ `python3`
```
>>> "" == ""
True
>>> "" is ""
<stdin>:1: SyntaxWarning: "is" with a literal. Did you mean "=="?
True
```

* Use ==/!= to compare constant literals (str, bytes, int, float, tuple)

04ed6126

[PyTorch][Bugfix] Use uint8 instead of bool in pytorch to be compatible with... · b81efb2b

nv-dlasalle authored Oct 14, 2021


[PyTorch][Bugfix] Use uint8 instead of bool in pytorch to be compatible with nightly version (#3406)

* Use uint8 instead of bool in pytorch

* Handle type aliases

* Fix syntax error
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

b81efb2b

[DOCS] Add training on CPU sections to docs (#3398) · a47ab71d
mszarma authored Oct 14, 2021

a47ab71d

[Bugfix] three bugs related to using DGL as a subdirectory(third_party) of another project. (#3379) · 18863069

zexi yuan authored Oct 14, 2021

* [Bugfix] fix a compile error for Debug-BuildType on Windows Platform

When using CMakeLists.txt to build the "Debug" BuildType on the Windows Platform, it has three compile errors (C4716) in the file "dgl\src\runtime\shared_mem.cc":

'dgl::runtime::SharedMemory::CreateNew': must return a value
'dgl::runtime::SharedMemory::Open': must return a value
'dgl::runtime::SharedMemory::Exist': must return a value

* [Bugfix] cmake error "cannot find load file" when DGL as a sub_directory on Linux

When using DGL as a subdirectory in a CMake Project, the "CMAKE_SOURCE_DIR" here will return the parent cmake scope dir, which is not a expected dir.
Maybe it is better to use "CMAKE_CURRENT_SOURCE_DIR" to set "GKLIB_PATH".

* [Bugfix] cmd cmake error when DGL as a subdirectory

When DGL as a subdirectory of another project, the WORKING_DIRECTORY of "add_custom_command" will be incorrect at the line 255 of "CMakeLists.txt", such that making a cmake "setlocal" error.

18863069

[Fix] Fix edge ID exclusion not working in EdgeDataLoader (#3412) · 5d4f6bca
Quan (Andy) Gan authored Oct 14, 2021

5d4f6bca
[Bug] Do not skip graphconv even no edge exists (#3416) · 8798872f
Rhett Ying authored Oct 14, 2021

8798872f

12 Oct, 2021 2 commits
- [BugFix] add count_nonzero() into SA_Client (#3417) · 7c7b60be
  Rhett Ying authored Oct 12, 2021
  
  7c7b60be
- [Bug] check dtype before convert to gk (#3414) · 2d88db5a
  Rhett Ying authored Oct 12, 2021
  
  2d88db5a
11 Oct, 2021 2 commits
- [README] Add GNNLens (#3411) · d9472873
  Mufei Li authored Oct 11, 2021
```
* Update README.md

* Update README.md
```
  d9472873
- backward now stores DGLGraph index,not DGLGraph object witattached data (#3410) · 532eaa87
  Israt Nisa authored Oct 11, 2021
```
Co-authored-by: Israt Nisa <nisisrat@amazon.com>
```
  532eaa87
07 Oct, 2021 1 commit

[Model] Refine GraphSAINT (#3328) · aef96dfa

K authored Oct 07, 2021

* The start of experiments of Jiahang Li on GraphSAINT.

* a nightly build

Check the basic pipeline of codes. Next to check the details of samplers , GCN layer (forward propagation) and loss (backward propagation)

* a night build

* Implement GraphSAINT with torch.dataloader

There're still some bugs with sampling in training procedure

* Test validity

Succeed in testing validity on ppi_node experiments without testing other setup.
1. Online sampling on ppi_node experiments performs perfectly.
2. Sampling speed is a bit slow because the operations on [dgl.subgraphs], next step is to improve this part by putting the conversion into parallelism
3. Figuring out why offline+online sampling method performs bad, which does not make sense
4. Doing experiments on other setup

* Implement saint with torch.dataloader

Use torch.dataloader to speed up saint sampling with experiments. Except experiments on too large dataset Amazon, we've done some experiments on other four datasets including ppi, flickr, reddit and yelp. Preliminary experimental results show consumed time and metrics reach not bad level. Next step is to employ more accurate profiler which is the line_profiler to test consumed period, and adjust num_workers to speed up sampling procedures on same certain datasets faster.

* a nightly build

* Update .gitignore

* reorganize codes

Reorganize some codes and comments.

* a nightly build

* Update .gitignore

* fix bugs

Fix bugs about why fully offline sampling and author's version don't work

* reorganize files and codes

Reorganize files and codes then do some experiments to test the performance of offline sampling and online sampling

* do some experiments and update README

* a nightly build

* Update README.md

* delete unnecessary files

* Update README.md

* a nightly update

1. handle directory named 'graphsaintdata'
2. control graph shift between gpu and cpu related to large dataset ('amazon')
3. remove parameter 'train'
4. refine annotations of the sampler
5. update README.md including updating dataset info, dependencies info, etc

* a nightly update

explain config differences in TEST part
remove a sampling time variant
make 'online' an argument
change 'norm' to 'sampler'
explain parameters in README.md

* Update README.md

* a nightly build

* make online an argument
* refine README.md
* refine codes of `collate_fn` in sampler.py, in training phase only return one subgraph, no need to check if the number of subgraphs larger than 1

* Update sampler.py

check the problem on flickr is about overfitting.

* a nightly update

Fix the overfitting problem of `flickr` dataset. We need to restrict the number of subgraphs (also the number of iterations) used in each epoch of training phase. Or it might overfit when validating at the end of each epoch. The method to limit the number is a formula specified by the author.

* Set up a new flag `full` specifying if the number of subgraphs used in training phase equals to that of pre-sampled subgraphs

* Modify codes and annotations related the new flag

* Add a new parameter called `node_budget` in the base class `SAINTSampler` to compute the specific formula

* set `gpu` as a command line argument

* Update README.md

* Finish the experiments on Flickr, which is done after adding new flag `full`

* a nightly update

* use half of edges in the original graph to do sampling
* test dgl.random.choice with or without replacement with half of edges
~ next is to test what if put the calculating probability part out of __getitem__ can speed up sampling and try to implement sampling method of author

* employ cython to implement edge sampling for per edge

* employ cython to implement edge sampling for per edge
* doing experiments to test consumed time and performance
** the consumed time decreased to approximately 480s, the performance decrease about 5 points.
* deprecate cython implementation

* Revert "employ cython to implement edge sampling for per edge"

* This reverts commit 4ba4f092
* Deprecate cython implementation
* Reserve half-edges mechanism

* a nightly update

* delete unnecessary annotations
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

aef96dfa

30 Sep, 2021 1 commit
- [BugFix] extract gz into target dir (#3389) · f9fd7fd7
  Rhett Ying authored Sep 30, 2021
  
  f9fd7fd7
29 Sep, 2021 1 commit

[Feature] enable create/set/free cuda stream for internal use (#3334) · e234fcfa

Rhett Ying authored Sep 29, 2021

* [Feature] enable create/set/free cuda stream for internal use

* add unit test

* fix unit test failure on mxnet and tf

* refactor stream wrapper

* fix lint error

* fix lint error

e234fcfa

28 Sep, 2021 1 commit
- [Feature] Implement one thread multiple socket (#3200) · 5cf48fc6
  Jingcheng Yu authored Sep 28, 2021
```
Co-authored-by: JingchengYu94 <jingchengyu94@gmail.com>
```
  5cf48fc6
23 Sep, 2021 2 commits

[Distributed] Allow user to pass-in extra env parameters when launching a... · 179d6aab

xiang song(charlie.song) authored Sep 23, 2021


[Distributed] Allow user to pass-in extra env parameters when launching a distributed training task. (#3375)

* Allow user to pass-in extra env parameters when launching a distributed training task.

* Update

* upd
Co-authored-by: xiangsx <xiangsx@ip-10-3-59-214.eu-west-1.compute.internal>

179d6aab

Fix torch import in example (#3372) · 367a3a34
Junwen Yao authored Sep 22, 2021

367a3a34

22 Sep, 2021 1 commit
- [Feature] Graceful handling of exceptions thrown within OpenMP blocks (#3353) · a04a8d06
  Quan (Andy) Gan authored Sep 22, 2021
```
* graceful c++ exception in OpenMP

* credits

* add test
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
```
  a04a8d06
21 Sep, 2021 1 commit

[Feature] Exclude edges in sample_neighbors (#2971) · bc14829f

mszarma authored Sep 21, 2021



* [Feature] Exclude edges in sample_neighbors

Extending sample_neighbors and sample_frontier
API to support exclude_edges parameter.

exclude_edges support tensor and dict data
Feature enable excluding certain edges
during neighborhood sampling
Exclude_edges contains EID's of edges
which will be excluded
during neighbor picking for seed nodes.

Added test case for heterograph and homograph
RFC issue id: 2944

* compatibility

* fix

* fix
Co-authored-by: Quan Gan <coin2028@hotmail.com>

bc14829f