Commits · aef96dfa34519e09fc956789bd7edb6307f404f7 · OpenDAS / dgl

07 Oct, 2021 1 commit

[Model] Refine GraphSAINT (#3328) · aef96dfa

K authored Oct 07, 2021

* The start of experiments of Jiahang Li on GraphSAINT.

* a nightly build

Check the basic pipeline of codes. Next to check the details of samplers , GCN layer (forward propagation) and loss (backward propagation)

* a night build

* Implement GraphSAINT with torch.dataloader

There're still some bugs with sampling in training procedure

* Test validity

Succeed in testing validity on ppi_node experiments without testing other setup.
1. Online sampling on ppi_node experiments performs perfectly.
2. Sampling speed is a bit slow because the operations on [dgl.subgraphs], next step is to improve this part by putting the conversion into parallelism
3. Figuring out why offline+online sampling method performs bad, which does not make sense
4. Doing experiments on other setup

* Implement saint with torch.dataloader

Use torch.dataloader to speed up saint sampling with experiments. Except experiments on too large dataset Amazon, we've done some experiments on other four datasets including ppi, flickr, reddit and yelp. Preliminary experimental results show consumed time and metrics reach not bad level. Next step is to employ more accurate profiler which is the line_profiler to test consumed period, and adjust num_workers to speed up sampling procedures on same certain datasets faster.

* a nightly build

* Update .gitignore

* reorganize codes

Reorganize some codes and comments.

* a nightly build

* Update .gitignore

* fix bugs

Fix bugs about why fully offline sampling and author's version don't work

* reorganize files and codes

Reorganize files and codes then do some experiments to test the performance of offline sampling and online sampling

* do some experiments and update README

* a nightly build

* Update README.md

* delete unnecessary files

* Update README.md

* a nightly update

1. handle directory named 'graphsaintdata'
2. control graph shift between gpu and cpu related to large dataset ('amazon')
3. remove parameter 'train'
4. refine annotations of the sampler
5. update README.md including updating dataset info, dependencies info, etc

* a nightly update

explain config differences in TEST part
remove a sampling time variant
make 'online' an argument
change 'norm' to 'sampler'
explain parameters in README.md

* Update README.md

* a nightly build

* make online an argument
* refine README.md
* refine codes of `collate_fn` in sampler.py, in training phase only return one subgraph, no need to check if the number of subgraphs larger than 1

* Update sampler.py

check the problem on flickr is about overfitting.

* a nightly update

Fix the overfitting problem of `flickr` dataset. We need to restrict the number of subgraphs (also the number of iterations) used in each epoch of training phase. Or it might overfit when validating at the end of each epoch. The method to limit the number is a formula specified by the author.

* Set up a new flag `full` specifying if the number of subgraphs used in training phase equals to that of pre-sampled subgraphs

* Modify codes and annotations related the new flag

* Add a new parameter called `node_budget` in the base class `SAINTSampler` to compute the specific formula

* set `gpu` as a command line argument

* Update README.md

* Finish the experiments on Flickr, which is done after adding new flag `full`

* a nightly update

* use half of edges in the original graph to do sampling
* test dgl.random.choice with or without replacement with half of edges
~ next is to test what if put the calculating probability part out of __getitem__ can speed up sampling and try to implement sampling method of author

* employ cython to implement edge sampling for per edge

* employ cython to implement edge sampling for per edge
* doing experiments to test consumed time and performance
** the consumed time decreased to approximately 480s, the performance decrease about 5 points.
* deprecate cython implementation

* Revert "employ cython to implement edge sampling for per edge"

* This reverts commit 4ba4f092
* Deprecate cython implementation
* Reserve half-edges mechanism

* a nightly update

* delete unnecessary annotations
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

aef96dfa

23 Sep, 2021 1 commit
- Fix torch import in example (#3372) · 367a3a34
  Junwen Yao authored Sep 22, 2021
  
  367a3a34
21 Sep, 2021 1 commit

[Doc] Added md5sum info for OGB-LSC dataset (#3332) · ac9261b2

Vikram Sharma authored Sep 21, 2021

* Added md5sum for the large dataset files

md5sum helps in validating the correctness of large dataset files once downloaded. 

Refer: https://github.com/snap-stanford/ogb/issues/253

ac9261b2

20 Sep, 2021 1 commit
- Enable faster validation for pytorch graphsage example (#3361) · 01a22144
  nv-dlasalle authored Sep 19, 2021
  
  01a22144
13 Sep, 2021 2 commits

[Model] PCT (#3339) · 3fef5d27

esang authored Sep 13, 2021



* publish pct

* add train_cls

* add readme

* update opt for point transformer

* update the example index

* update for comments
Co-authored-by: Tong He <hetong007@gmail.com>

3fef5d27

[Bugfix] Fix Correct&Smooth (#3329) · 26b63180

skepsun authored Sep 13, 2021



* Update model.py

fix typo

* Update main.py

fix autoscale

* Update README.md
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

26b63180

02 Sep, 2021 1 commit
- Fix distributed device mapping problem. (#3313) · 21a40279
  xiang song(charlie.song) authored Sep 02, 2021
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-2-66.ec2.internal>
```
  21a40279
27 Aug, 2021 1 commit

[Model] Point transformer (#3284) · 4fb50be4

esang authored Aug 27, 2021



* some modifications for pointnet2

* temporarily save changes

* move files to new directory point_transformer

* implement point transformer for classification

* restore train_cls in pointnet

* implement point transformer for partseg

* fix point transformer for nan loss

* modify point transformer for cls

* modify training setting

* update transformer for cls

* update code

* update code for latest performance

* update the example index

* some minor changes
Co-authored-by: Tong He <hetong007@gmail.com>

4fb50be4

23 Aug, 2021 1 commit
- fix relgraphconv bug (#3256) · b4cd60a9
  Quan (Andy) Gan authored Aug 23, 2021
  
  b4cd60a9
20 Aug, 2021 1 commit

GeniePath model add a Tanh. (#3269) · f5b410b7

Peiqi Yin authored Aug 20, 2021



* Update model.py

* Update README.md
Co-authored-by: Zihao Ye <expye@outlook.com>

f5b410b7

19 Aug, 2021 1 commit

[Model] add model example EvolveGCN. (#3190) · ea06688e

maqy authored Aug 19, 2021



* add evolveGCN example

* small fix

* fix defect

* fix defect
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

ea06688e

16 Aug, 2021 1 commit
- [Model] Fix diffpool loss (#3233) · 68c0cfbb
  Peiqi Yin authored Aug 16, 2021
  
  68c0cfbb
11 Aug, 2021 1 commit

[example] Create EEG-GCNN example. (#3186) · 738b75f4

JOHNW02 authored Aug 11, 2021

* Create EEG-GCNN example.

* Update README.md

* Remove gitignore file.

* Update README.md

* change 'datas' to 'datasets'.

* Change train.py to main.py

* Added an entry in the indexing page.

* State "simplified version"; change how to run.

* Fix bug in contact

* Remove paper link in reference.

* Create working branch

* Add normalization of x.

* Update paper link and tags

* Update paper link in readme

* Update readme; add patient level indices

* Update readme. Add comments to models

* Update README.md

* change to with; specify location for ch and el; move note

* fix bug for note

* Add args for models; clean code.

* delete = in readme

* Add reference for spec_coh_values

738b75f4

02 Aug, 2021 1 commit

[Feature] Add removed edges in distributed graph partitioning to handle heterogeneous graph (#3137) · b1319200

Ankit Garg authored Aug 03, 2021



* Added code for Rectifying (TypeError: unhashable type: 'slice') when copying file

* 1) added distributed preprocessing code to create ParMetis Input from CSV files
2) add code to run pm_dglpart on multiple machines
3) added support for recreating heteregenous graph from homo geneous graph based on dropped edges, as ParMetis currently only supports homogeneous graphs

* move to pandas

* Added comments and remove drop_duplicates as it was redundant

* Addressed Pr Comments

* Rename variable

* Added comment

* Added comment

* updated ReadMe
Co-authored-by: Ankit Garg <gaank@amazon.com>
Co-authored-by: Da Zheng <zhengda1936@gmail.com>

b1319200

30 Jul, 2021 2 commits

[model] add model example GeniePath (#3199) · 39764da4

Kay Liu authored Jul 30, 2021



* [model] add model example GeniePath

* improvements based on feedback

* improvements based on feedback
Co-authored-by: zhjwy9343 <6593865@qq.com>

39764da4

fix (#3167) · 6f93c6aa

KounianhuaDu authored Jul 30, 2021


Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>
Co-authored-by: zhjwy9343 <6593865@qq.com>

6f93c6aa

29 Jul, 2021 1 commit

[Model] add model example CARE-GNN (#3187) · a107993f

Kay Liu authored Jul 29, 2021



* [Model] add model example CARE-GNN

* update README

* improvements based on the review feedback

* fix missing item()
Co-authored-by: zhjwy9343 <6593865@qq.com>

a107993f

28 Jul, 2021 2 commits

[Doc] Fix WeightBasis documentation (#3189) · 2583ec59
Jinjing Zhou authored Jul 28, 2021
```
* fix

* fix type
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>
```
2583ec59

[New Feature] Per edge type sampler for to_homogeneous graphs. (#3131) · ba7e7cf9

xiang song(charlie.song) authored Jul 28, 2021



* fix.

* fix.

* fix.

* fix.

* Fix test

* Deprecate old DistEmbedding impl, use synchronized embedding impl

* Basic imple of heterogeneous on homogenenous sampling

* make pass

* Pass C++ test

* Add python test code

* lint

* lint

* Add MultiLayerEtypeNeighborSampler

* Add unitest for single machine dataloader

* Add dist dataloader test for edge type sampler

* Fix lint

* fix

* support for per etype sample

* Fix some bug and enable distributed training with per edge sample

* fix

* Now distributed training works

* turn off some mxnet

* turn off mxnet for some dist test

* fix

* upd

* upd according to the comments

* Fix

* Fix test and now distributed works.

* upd

* upd

* Fix

* Fix bug

* remove dead code.

* upd

* Fix

* upd

* Fix
Co-authored-by: Ubuntu <ubuntu@ip-172-31-71-112.ec2.internal>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-2-66.ec2.internal>
Co-authored-by: Da Zheng <zhengda1936@gmail.com>

ba7e7cf9

24 Jul, 2021 2 commits

[Model] add model example GCN-based Anti-Spam (#3145) · 59a7d0d1

Kay Liu authored Jul 24, 2021



* add model example GCN-based Anti-Spam

* update example index

* add usage info

* improvements as per comments

* fix image invisiable problem

* add image file
Co-authored-by: zhjwy9343 <6593865@qq.com>

59a7d0d1

train test on face use concat bce (#3180) · c0719ec5
Tianjun Xiao authored Jul 24, 2021

c0719ec5

23 Jul, 2021 1 commit
- change numbers to concat bce (#3175) · 65ecbb94
  Tianjun Xiao authored Jul 23, 2021
  
  65ecbb94
20 Jul, 2021 1 commit
- [Doc] Update the example folder README · 9d56d386
  Minjie Wang authored Jul 20, 2021
  
  9d56d386
17 Jul, 2021 1 commit

[Distributed] Distributed heterograph training (#3069) · 34426a98

Da Zheng authored Jul 17, 2021



* support hetero RGCN.

* fix.

* simplify code.

* sample_neighbors return heterograph directly.

* avoid using to_heterogeneous.

* compute canonical etypes in advance.

* fix tests.

* fix.

* fix distributed data loader for heterograph.

* use NodeDataLoader.

* fix bugs in partitioning on heterogeneous graphs.

* fix lint.

* fix tests.

* fix.

* fix.

* fix bugs.

* fix tests.

* fix.

* enable coo for distributed.

* fix.

* fix.

* fix.

* fix.

* fix.
Co-authored-by: Ubuntu <ubuntu@ip-172-31-71-112.ec2.internal>
Co-authored-by: Zheng <dzzhen@3c22fba32af5.ant.amazon.com>

34426a98

16 Jul, 2021 1 commit

[Feature][Performance][GPU] Introducing UnifiedTensor for efficient zero-copy... · 905c0aa5

David Min authored Jul 17, 2021

[Feature][Performance][GPU] Introducing UnifiedTensor for efficient zero-copy host memory access from GPU (#3086)

* Add pytorch-direct version

* Initial commit of unified tensor

* Merge branch 'master' of https://github.com/davidmin7/dgl



* Remove unnecessary things

* Fix error message

* Fix/Add descriptions

* whitespace fix

* add unpin

* disable IndexSelectCPUFromGPU with no CUDA

* add a newline for unified_tensor.py

* Apply changes based on feedback

* add 'os' module

* skip unified tensor unit test for cpu only

* Update tests/pytorch/test_unified_tensor.py
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

* reflect feedback
Co-authored-by: shhssdm <shhssdm@gmail.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

905c0aa5

15 Jul, 2021 2 commits

fix tags (#3140) · 88f20eec
Jinjing Zhou authored Jul 15, 2021
```
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
```
88f20eec

[Bug fix] Various fix from bug bash (#3133) · 3f6f6941

Mufei Li authored Jul 15, 2021



* Update

* Update

* Update dependencies

* Update

* Update

* Fix ogbn-products gat

* Update

* Update

* Reformat

* Fix typo in node2vec_random_walk

* Specify file encoding

* Working for 6.7

* Update

* Fix subgraph

* Fix doc for sample_neighbors_biased

* Fix hyperlink

* Add example for udf cross reducer

* Fix

* Add example for slice_batch

* Replace dgl.bipartite

* Fix GATConv

* Fix math rendering

* Fix doc
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-17.us-west-2.compute.internal>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-22-156.us-west-2.compute.internal>

3f6f6941

13 Jul, 2021 3 commits

[Distributed] Deprecate old DistEmbedding impl, use synchronized embedding impl (#3111) · d7390763

xiang song(charlie.song) authored Jul 14, 2021



* fix.

* fix.

* fix.

* fix.

* Fix test

* Deprecate old DistEmbedding impl, use synchronized embedding impl

* update doc
Co-authored-by: Ubuntu <ubuntu@ip-172-31-71-112.ec2.internal>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-2-66.ec2.internal>
Co-authored-by: Da Zheng <zhengda1936@gmail.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

d7390763

Update a comment in pytorch HGT example (#3101) · ee6bc951

Wilfried L. Bounsi authored Jul 13, 2021


Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

ee6bc951

[Feature] Add left normalizer for GCN (#3114) · b576e617
Quan (Andy) Gan authored Jul 13, 2021
```
* add left normalizer for gcn

* fix

* fixes and some bug stuff
```
b576e617

12 Jul, 2021 3 commits

[BugFix] fix problems found in bug bash (#3116) · 0ce92a86

Kay Liu authored Jul 12, 2021



* fix breakline in fakenews.py

* fix inconsistent argument name

* modify incorrect example and deprecated graph type

* modify docstring and example in knn_graph

* fix incorrect node type
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

0ce92a86

fix docs (#3126) · 36418292
Jinjing Zhou authored Jul 12, 2021
```
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
```
36418292
Fix module name (#3128) · fc55225f
Tomohiro Endo authored Jul 12, 2021

fc55225f

09 Jul, 2021 1 commit

[Performance] Add a warning for ChebConv (#3099) · 5798ee8d

Quan (Andy) Gan authored Jul 09, 2021



* add a warning for chebconv

* fix and docstrings

* update bgnn

* fix
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

5798ee8d

07 Jul, 2021 1 commit
- [bugfix] fix a bug found in v0.7 bug bash in the 'grace' model (#3109) · 0d1dcdcd
  Hengrui Zhang authored Jul 07, 2021
```
* Update aug.py

* Update aug.py
Co-authored-by: Mufei Li <mufeili1996@gmail.com>
```
  0d1dcdcd
06 Jul, 2021 1 commit
- [Example ] fix faiss issue on k 3 for hilander model (#3105) · 22e8c120
  Tianjun Xiao authored Jul 06, 2021
```
* fix faiss issue on k 3

* Add Hilander paper link
Co-authored-by: Tong He <hetong007@gmail.com>
```
  22e8c120
04 Jul, 2021 1 commit

[Model] Official implementation for HiLANDER model. (#3087) · 9c41c22d

Tong He authored Jul 04, 2021



* add hilander model implementation draft

* use focal loss

* fix

* change data root

* add necessary scripts

* update download links

* update

* update example table

* fix

* update readme with numbers

* add empty folder

* only eval at the end

* set up hilander

* inform results may fluctuate

* address comments
Co-authored-by: sneakerkg <xiaotj1990327@gmail.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-19-212.us-east-2.compute.internal>

9c41c22d

02 Jul, 2021 1 commit

[Distributed] Fix bugs in partitioning on heterogeneous graphs. (#3085) · 0884d024

Da Zheng authored Jul 02, 2021



* fix bugs in partitioning on heterogeneous graphs.

* fix.

* fix.

* fix example.

* fix.

* fix test.

* fix.

* fix.

* fix.

* fix tests.
Co-authored-by: Ubuntu <ubuntu@ip-172-31-71-112.ec2.internal>
Co-authored-by: Zheng <dzzhen@3c22fba32af5.ant.amazon.com>

0884d024

29 Jun, 2021 1 commit

[Example][Bugfix] Fix RGCN example datasplitting (#3037) · 02e5d47d

nv-dlasalle authored Jun 28, 2021



* Fix example datasplitting

* Remove left-over split parameter

* Remove unused parameter

* Set epoch when using multiple GPUs
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

02e5d47d

28 Jun, 2021 1 commit
- cuda() to to(device) (#3064) · 7f4086da
  Tomohiro Endo authored Jun 28, 2021
```
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
```
  7f4086da