Commits · 701b4fccc2eed979ae3db801fabb6bf7bc03940c · OpenDAS / dgl

30 Jan, 2022 1 commit

[Sampling] New sampling pipeline plus asynchronous prefetching (#3665) · 701b4fcc

Quan (Andy) Gan authored Jan 30, 2022

* initial update

* more

* more

* multi-gpu example

* cluster gcn, finalize homogeneous

* more explanation

* fix

* bunch of fixes

* fix

* RGAT example and more fixes

* shadow-gnn sampler and some changes in unit test

* fix

* wth

* more fixes

* remove shadow+node/edge dataloader tests for possible ux changes

* lints

* add legacy dataloading import just in case

* fix

* update pylint for f-strings

* fix

* lint

* lint

* lint again

* cherry-picking commit fa9f494

* oops

* fix

* add sample_neighbors in dist_graph

* fix

* lint

* fix

* fix

* fix

* fix tutorial

* fix

* fix

* fix

* fix warning

* remove debug

* add get_foo_storage apis

* lint

701b4fcc

25 Jan, 2022 1 commit

feature: add a parse parameter degree_as_nlabel for pytorch-gin demo (#3676) · 8f99b131

PengZhang authored Jan 25, 2022

* feature: add a parse parameter degree_as_nlabel for pytorch-gin demo

* fix some typo

* [fix]: allow to benchmark all of the 9 dataset.

* [Feature] add epoch number to log

* [Feature]:simply list the command lines for all datasets (https://github.com/dmlc/dgl/pull/3676#discussion_r790270705

) and run a test.

* Update README.md
Co-authored-by: Ubuntu <ubuntu@ip-172-31-10-175.ap-northeast-1.compute.internal>
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

8f99b131

23 Jan, 2022 1 commit
- [Bugfix] Improve CompGCN (#3663) · 9a6b81ef
  nxznm authored Jan 23, 2022
```
Co-authored-by: Mufei Li <mufeili1996@gmail.com>
```
  9a6b81ef
21 Jan, 2022 1 commit
- [Example] fix auc in caregnn example (#3647) · ed4134ed
  Zekuan (Kay) Liu authored Jan 21, 2022
```
Co-authored-by: zhjwy9343 <6593865@qq.com>
```
  ed4134ed
20 Jan, 2022 1 commit
- Update examples/pytorch/graphsage/experimental/README.md · 14ab462f
  Da Zheng authored Jan 20, 2022
```
Co-authored-by: Minjie Wang <minjie.wang@nyu.edu>
```
  14ab462f
19 Jan, 2022 1 commit
- fix. · 3b1978a3
  Da Zheng authored Jan 19, 2022
  
  3b1978a3
15 Jan, 2022 1 commit

fix. (#3652) · 8d14a739

Da Zheng authored Jan 14, 2022


Co-authored-by: Ubuntu <ubuntu@ip-172-31-30-164.us-west-2.compute.internal>

8d14a739

24 Dec, 2021 1 commit

Add 'nccl' backend in train_dist.py and fix pad_data function cuda bug (#3607) · 4889c578

xcwan authored Dec 24, 2021



* Add nccl backend  and fix pad_data function cuda bug

* Update train_dist.py

* Update train_dist.py
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

4889c578

20 Dec, 2021 2 commits

[Example] An example that re-creates the PyG OGB performance on ogbnmag (#3563) · 42897c36

Zak Jost authored Dec 20, 2021



* Adding initial files of example

* Removing old timing code

* Improving doc strings and fixing some minor bugs

* Merging from upstream and addressing PR comments
Co-authored-by: zakjost <jostza@amazon.com>
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

42897c36

Update train_dist.py (#3594) · 421c3622
Jinjing Zhou authored Dec 20, 2021
```
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
```
421c3622

15 Dec, 2021 1 commit

[DistGNN, Graph partitioning] Libra partition (#3376) · 78e0dae6

Vasimuddin Md authored Dec 15, 2021



* added distgnn plus libra codebase

* Dist application codes

* added comments in partition code. changed the interface of partitioning call.

* updated readme

* create libra partitioning branch for the PR

* removed disgnn files for first PR

* updated kernel.cc

* added libra_partition.cc and moved libra code from kernel.cc to libra_partition.cc

* fixed lint error; merged libra2dgl.py and main_Libra.py to libra_partition.py; added graphsage/distgnn folder and partition script.

* removed libra2dgl.py

* fixed the lint error and cleaned the code.

* revisions due to PR comments. added distgnn/tools contains partitions routines

* update 2 PR revision I

* fixed errors; also improved the runtime by 10x.

* fixed minor lint error

* fixed some more lints

* PR revision II changed the interface of libra partition function

* rewrite docstring
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

78e0dae6

06 Dec, 2021 1 commit
- Fix for distributed training (#3542) · 987db374
  Jinjing Zhou authored Dec 06, 2021
```
* tmp fix

* add description
```
  987db374
30 Nov, 2021 1 commit

[Model] RGCN with new heterograph API (#3025) · 490c5a8d

Israt Nisa authored Nov 30, 2021



* rgcn with new heterograph API

* added new apply_edge()

* optimized forward pass

* renaming from *hetero to *heteroAPI
Co-authored-by: Israt Nisa <nisisrat@amazon.com>
Co-authored-by: Da Zheng <zhengda1936@gmail.com>

490c5a8d

29 Nov, 2021 1 commit
- Fix tgn example (#3543) · da53275a
  Jinjing Zhou authored Nov 29, 2021
  
  da53275a
24 Nov, 2021 1 commit
- [BugFix] fix dimension unmatch issue and legacy issue of torchtext (#3539) · cd6d1138
  Rhett Ying authored Nov 24, 2021
  
  cd6d1138
23 Nov, 2021 1 commit
- [Bugfix] issue #3527 (#3528) · 4bf70f09
  Harsh Sinha authored Nov 22, 2021
```
* Fix issue 3527

* Changed default device

* Added g to device
```
  4bf70f09
19 Nov, 2021 1 commit
- [NN] JumpingKnowledge (#3512) · 9e7fbf95
  Mufei Li authored Nov 19, 2021
```
* Update

* Fix
```
  9e7fbf95
17 Nov, 2021 1 commit

[Examples] RGCN Heterogeneous on ogbn-mag (#3371) · 81915f55

Krzysztof Sadowski authored Nov 17, 2021



* upload

* cleanup of unused code

* default gpu training/inference

* layer norm instead of batch norm

* fix for default inference mode

* simplified embedding forward method
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

81915f55

16 Nov, 2021 1 commit

[Sampling] Cluster-GCN and ShaDow-GNN DataLoader (#3487) · b8ce0f41

Quan (Andy) Gan authored Nov 16, 2021

* first commit

* next commit

* third commit

* add ShaDow-GNN sampler and unit tests

* fixes

* lint

* cr*p

* lint

* fix lint

* fixes and more unit tests

* more tests

* fix docs

* lint

* fix

* fix

* fix

* fixes

* fix doc

b8ce0f41

10 Nov, 2021 1 commit

[BugFix] fix #3429 and update results of caregnn (#3441) · 7c771d0d

Yuchen authored Nov 10, 2021



* squeeze node labels in FraudDataset

* fix RLModule

* update results in README.md

* fix KeyError in full graph training
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

7c771d0d

08 Nov, 2021 2 commits

[Model] Lda subgraph (#3206) · fe6e01ad

yifeim authored Nov 08, 2021



* add word_ids and simplify

* simplify

* add word_ids to be removed later

* remove word_ids

* seems to work

* tweak

* transpose word_z

* add word_ids example

* check api compatibility

* improve compatibility

* update doc

* tweak verbose

* restore word_z layout; tweak

* tweak

* tweak doc

* word_cT

* use log_weight and some other tweaks

* rewrite README

* update equations

* rewrite for clarity and pass tests

* tweak

* bugfix import

* fix unit test

* fix mult to be the same as old versions

* tweak

* could be a bugfix

* 0/0=nan

* add doc_subgraph utility function

* minor cache optimization

* minor cache tweak

* add environmental variable to trade cache speed for memory

* update README

* tweak

* add sparse update pass unit test

* simplify sparse update

* improve low-memory efficiency

* tweak

* add sample expectation scores to allow resampling

* simplify

* update comment

* avoid edge cases

* bugfix pred scores

* simplify

* add save function
Co-authored-by: Yifei Ma <yifeim@amazon.com>
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

fe6e01ad

Remove self-loops and duplicate edges before ParMETIS and restore when... · 2a757d4a

Rhett Ying authored Nov 08, 2021


Remove self-loops and duplicate edges before ParMETIS and restore when converting to DGLGraph (#3472)

* save self-loops and duplicated edges separately.

* [BugFix] sort graph by dgl.ETYPE

* fix bugs in verify script

* fix verify logic

* refine README
Co-authored-by: Da Zheng <zhengda1936@gmail.com>

2a757d4a

03 Nov, 2021 1 commit

[NN][Model] GATv2 (#3473) · e2f33fd5

Shaked Brody authored Nov 03, 2021



* [Model][Core] GATv2

* lint

* gatv2conv.py

* lint

* lint

* style and docs

* lint

* gatv2conv fix
Co-authored-by: Shaked Brody shakedbr@campus.technion.ac.il <shakedbr@tangerine.cslcs.technion.ac.il>
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

e2f33fd5

26 Oct, 2021 1 commit
- Update README.md (#3442) · 579cd3eb
  Hongyu Cai authored Oct 26, 2021
  
  579cd3eb
21 Oct, 2021 1 commit

[Sampling] Implement dgl.compact_graphs() for the GPU (#3423) · a8c81018

Xin Yao authored Oct 21, 2021

* gpu compact graph template

* cuda compact graph draft

* fix typo

* compact graphs

* pass unit test but fail in training

* example using EdgeDataLoader on the GPU

* refactor cuda_compact_graph and cuda_to_block

* update training scripts

* fix linting

* fix linting

* fix exclude_edges for the GPU

* add --data-cpu & fix copyright

a8c81018

18 Oct, 2021 1 commit
- [Bugfix][Pytorch] Fix model save and load bug of stgcn_wave (#3303) · f7039418
  HaoWei-TomTom authored Oct 18, 2021
```
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
```
  f7039418
14 Oct, 2021 1 commit

[Fix] Use ==/!= to compare constant literals (str, bytes, int, float, tuple) (#3415) · 04ed6126

Christian Clauss authored Oct 14, 2021

* Use ==/!= to compare constant literals (str, bytes, int, float, tuple)

Avoid Syntax Warnings on Python >= 3.8

$ `python3`
```
>>> "" == ""
True
>>> "" is ""
<stdin>:1: SyntaxWarning: "is" with a literal. Did you mean "=="?
True
```

* Use ==/!= to compare constant literals (str, bytes, int, float, tuple)

04ed6126

07 Oct, 2021 1 commit

[Model] Refine GraphSAINT (#3328) · aef96dfa

K authored Oct 07, 2021

* The start of experiments of Jiahang Li on GraphSAINT.

* a nightly build

Check the basic pipeline of codes. Next to check the details of samplers , GCN layer (forward propagation) and loss (backward propagation)

* a night build

* Implement GraphSAINT with torch.dataloader

There're still some bugs with sampling in training procedure

* Test validity

Succeed in testing validity on ppi_node experiments without testing other setup.
1. Online sampling on ppi_node experiments performs perfectly.
2. Sampling speed is a bit slow because the operations on [dgl.subgraphs], next step is to improve this part by putting the conversion into parallelism
3. Figuring out why offline+online sampling method performs bad, which does not make sense
4. Doing experiments on other setup

* Implement saint with torch.dataloader

Use torch.dataloader to speed up saint sampling with experiments. Except experiments on too large dataset Amazon, we've done some experiments on other four datasets including ppi, flickr, reddit and yelp. Preliminary experimental results show consumed time and metrics reach not bad level. Next step is to employ more accurate profiler which is the line_profiler to test consumed period, and adjust num_workers to speed up sampling procedures on same certain datasets faster.

* a nightly build

* Update .gitignore

* reorganize codes

Reorganize some codes and comments.

* a nightly build

* Update .gitignore

* fix bugs

Fix bugs about why fully offline sampling and author's version don't work

* reorganize files and codes

Reorganize files and codes then do some experiments to test the performance of offline sampling and online sampling

* do some experiments and update README

* a nightly build

* Update README.md

* delete unnecessary files

* Update README.md

* a nightly update

1. handle directory named 'graphsaintdata'
2. control graph shift between gpu and cpu related to large dataset ('amazon')
3. remove parameter 'train'
4. refine annotations of the sampler
5. update README.md including updating dataset info, dependencies info, etc

* a nightly update

explain config differences in TEST part
remove a sampling time variant
make 'online' an argument
change 'norm' to 'sampler'
explain parameters in README.md

* Update README.md

* a nightly build

* make online an argument
* refine README.md
* refine codes of `collate_fn` in sampler.py, in training phase only return one subgraph, no need to check if the number of subgraphs larger than 1

* Update sampler.py

check the problem on flickr is about overfitting.

* a nightly update

Fix the overfitting problem of `flickr` dataset. We need to restrict the number of subgraphs (also the number of iterations) used in each epoch of training phase. Or it might overfit when validating at the end of each epoch. The method to limit the number is a formula specified by the author.

* Set up a new flag `full` specifying if the number of subgraphs used in training phase equals to that of pre-sampled subgraphs

* Modify codes and annotations related the new flag

* Add a new parameter called `node_budget` in the base class `SAINTSampler` to compute the specific formula

* set `gpu` as a command line argument

* Update README.md

* Finish the experiments on Flickr, which is done after adding new flag `full`

* a nightly update

* use half of edges in the original graph to do sampling
* test dgl.random.choice with or without replacement with half of edges
~ next is to test what if put the calculating probability part out of __getitem__ can speed up sampling and try to implement sampling method of author

* employ cython to implement edge sampling for per edge

* employ cython to implement edge sampling for per edge
* doing experiments to test consumed time and performance
** the consumed time decreased to approximately 480s, the performance decrease about 5 points.
* deprecate cython implementation

* Revert "employ cython to implement edge sampling for per edge"

* This reverts commit 4ba4f092
* Deprecate cython implementation
* Reserve half-edges mechanism

* a nightly update

* delete unnecessary annotations
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

aef96dfa

23 Sep, 2021 1 commit
- Fix torch import in example (#3372) · 367a3a34
  Junwen Yao authored Sep 22, 2021
  
  367a3a34
21 Sep, 2021 1 commit

[Doc] Added md5sum info for OGB-LSC dataset (#3332) · ac9261b2

Vikram Sharma authored Sep 21, 2021

* Added md5sum for the large dataset files

md5sum helps in validating the correctness of large dataset files once downloaded. 

Refer: https://github.com/snap-stanford/ogb/issues/253

ac9261b2

20 Sep, 2021 1 commit
- Enable faster validation for pytorch graphsage example (#3361) · 01a22144
  nv-dlasalle authored Sep 19, 2021
  
  01a22144
13 Sep, 2021 2 commits

[Model] PCT (#3339) · 3fef5d27

esang authored Sep 13, 2021



* publish pct

* add train_cls

* add readme

* update opt for point transformer

* update the example index

* update for comments
Co-authored-by: Tong He <hetong007@gmail.com>

3fef5d27

[Bugfix] Fix Correct&Smooth (#3329) · 26b63180

skepsun authored Sep 13, 2021



* Update model.py

fix typo

* Update main.py

fix autoscale

* Update README.md
Co-authored-by: Mufei Li <mufeili1996@gmail.com>

26b63180

02 Sep, 2021 1 commit
- Fix distributed device mapping problem. (#3313) · 21a40279
  xiang song(charlie.song) authored Sep 02, 2021
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-2-66.ec2.internal>
```
  21a40279
27 Aug, 2021 1 commit

[Model] Point transformer (#3284) · 4fb50be4

esang authored Aug 27, 2021



* some modifications for pointnet2

* temporarily save changes

* move files to new directory point_transformer

* implement point transformer for classification

* restore train_cls in pointnet

* implement point transformer for partseg

* fix point transformer for nan loss

* modify point transformer for cls

* modify training setting

* update transformer for cls

* update code

* update code for latest performance

* update the example index

* some minor changes
Co-authored-by: Tong He <hetong007@gmail.com>

4fb50be4

23 Aug, 2021 1 commit
- fix relgraphconv bug (#3256) · b4cd60a9
  Quan (Andy) Gan authored Aug 23, 2021
  
  b4cd60a9
20 Aug, 2021 1 commit

GeniePath model add a Tanh. (#3269) · f5b410b7

Peiqi Yin authored Aug 20, 2021



* Update model.py

* Update README.md
Co-authored-by: Zihao Ye <expye@outlook.com>

f5b410b7

19 Aug, 2021 1 commit

[Model] add model example EvolveGCN. (#3190) · ea06688e

maqy authored Aug 19, 2021



* add evolveGCN example

* small fix

* fix defect

* fix defect
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

ea06688e

16 Aug, 2021 1 commit
- [Model] Fix diffpool loss (#3233) · 68c0cfbb
  Peiqi Yin authored Aug 16, 2021
  
  68c0cfbb
11 Aug, 2021 1 commit

[example] Create EEG-GCNN example. (#3186) · 738b75f4

JOHNW02 authored Aug 11, 2021

* Create EEG-GCNN example.

* Update README.md

* Remove gitignore file.

* Update README.md

* change 'datas' to 'datasets'.

* Change train.py to main.py

* Added an entry in the indexing page.

* State "simplified version"; change how to run.

* Fix bug in contact

* Remove paper link in reference.

* Create working branch

* Add normalization of x.

* Update paper link and tags

* Update paper link in readme

* Update readme; add patient level indices

* Update readme. Add comments to models

* Update README.md

* change to with; specify location for ch and el; move note

* fix bug for note

* Add args for models; clean code.

* delete = in readme

* Add reference for spec_coh_values

738b75f4