1. 30 Jan, 2022 1 commit
    • Quan (Andy) Gan's avatar
      [Sampling] New sampling pipeline plus asynchronous prefetching (#3665) · 701b4fcc
      Quan (Andy) Gan authored
      * initial update
      
      * more
      
      * more
      
      * multi-gpu example
      
      * cluster gcn, finalize homogeneous
      
      * more explanation
      
      * fix
      
      * bunch of fixes
      
      * fix
      
      * RGAT example and more fixes
      
      * shadow-gnn sampler and some changes in unit test
      
      * fix
      
      * wth
      
      * more fixes
      
      * remove shadow+node/edge dataloader tests for possible ux changes
      
      * lints
      
      * add legacy dataloading import just in case
      
      * fix
      
      * update pylint for f-strings
      
      * fix
      
      * lint
      
      * lint
      
      * lint again
      
      * cherry-picking commit fa9f494
      
      * oops
      
      * fix
      
      * add sample_neighbors in dist_graph
      
      * fix
      
      * lint
      
      * fix
      
      * fix
      
      * fix
      
      * fix tutorial
      
      * fix
      
      * fix
      
      * fix
      
      * fix warning
      
      * remove debug
      
      * add get_foo_storage apis
      
      * lint
      701b4fcc
  2. 25 Jan, 2022 1 commit
  3. 23 Jan, 2022 1 commit
  4. 21 Jan, 2022 1 commit
  5. 20 Jan, 2022 1 commit
  6. 19 Jan, 2022 1 commit
  7. 15 Jan, 2022 1 commit
  8. 24 Dec, 2021 1 commit
  9. 20 Dec, 2021 2 commits
  10. 15 Dec, 2021 1 commit
    • Vasimuddin Md's avatar
      [DistGNN, Graph partitioning] Libra partition (#3376) · 78e0dae6
      Vasimuddin Md authored
      
      
      * added distgnn plus libra codebase
      
      * Dist application codes
      
      * added comments in partition code. changed the interface of partitioning call.
      
      * updated readme
      
      * create libra partitioning branch for the PR
      
      * removed disgnn files for first PR
      
      * updated kernel.cc
      
      * added libra_partition.cc and moved libra code from kernel.cc to libra_partition.cc
      
      * fixed lint error; merged libra2dgl.py and main_Libra.py to libra_partition.py; added graphsage/distgnn folder and partition script.
      
      * removed libra2dgl.py
      
      * fixed the lint error and cleaned the code.
      
      * revisions due to PR comments. added distgnn/tools contains partitions routines
      
      * update 2 PR revision I
      
      * fixed errors; also improved the runtime by 10x.
      
      * fixed minor lint error
      
      * fixed some more lints
      
      * PR revision II changed the interface of libra partition function
      
      * rewrite docstring
      Co-authored-by: default avatarQuan (Andy) Gan <coin2028@hotmail.com>
      78e0dae6
  11. 06 Dec, 2021 1 commit
  12. 30 Nov, 2021 1 commit
  13. 29 Nov, 2021 1 commit
  14. 24 Nov, 2021 1 commit
  15. 23 Nov, 2021 1 commit
  16. 19 Nov, 2021 1 commit
  17. 17 Nov, 2021 1 commit
  18. 16 Nov, 2021 1 commit
  19. 10 Nov, 2021 1 commit
  20. 08 Nov, 2021 2 commits
    • yifeim's avatar
      [Model] Lda subgraph (#3206) · fe6e01ad
      yifeim authored
      
      
      * add word_ids and simplify
      
      * simplify
      
      * add word_ids to be removed later
      
      * remove word_ids
      
      * seems to work
      
      * tweak
      
      * transpose word_z
      
      * add word_ids example
      
      * check api compatibility
      
      * improve compatibility
      
      * update doc
      
      * tweak verbose
      
      * restore word_z layout; tweak
      
      * tweak
      
      * tweak doc
      
      * word_cT
      
      * use log_weight and some other tweaks
      
      * rewrite README
      
      * update equations
      
      * rewrite for clarity and pass tests
      
      * tweak
      
      * bugfix import
      
      * fix unit test
      
      * fix mult to be the same as old versions
      
      * tweak
      
      * could be a bugfix
      
      * 0/0=nan
      
      * add doc_subgraph utility function
      
      * minor cache optimization
      
      * minor cache tweak
      
      * add environmental variable to trade cache speed for memory
      
      * update README
      
      * tweak
      
      * add sparse update pass unit test
      
      * simplify sparse update
      
      * improve low-memory efficiency
      
      * tweak
      
      * add sample expectation scores to allow resampling
      
      * simplify
      
      * update comment
      
      * avoid edge cases
      
      * bugfix pred scores
      
      * simplify
      
      * add save function
      Co-authored-by: default avatarYifei Ma <yifeim@amazon.com>
      Co-authored-by: default avatarQuan (Andy) Gan <coin2028@hotmail.com>
      Co-authored-by: default avatarJinjing Zhou <VoVAllen@users.noreply.github.com>
      fe6e01ad
    • Rhett Ying's avatar
      Remove self-loops and duplicate edges before ParMETIS and restore when... · 2a757d4a
      Rhett Ying authored
      
      Remove self-loops and duplicate edges before ParMETIS and restore when converting to DGLGraph (#3472)
      
      * save self-loops and duplicated edges separately.
      
      * [BugFix] sort graph by dgl.ETYPE
      
      * fix bugs in verify script
      
      * fix verify logic
      
      * refine README
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      2a757d4a
  21. 03 Nov, 2021 1 commit
  22. 26 Oct, 2021 1 commit
  23. 21 Oct, 2021 1 commit
    • Xin Yao's avatar
      [Sampling] Implement dgl.compact_graphs() for the GPU (#3423) · a8c81018
      Xin Yao authored
      * gpu compact graph template
      
      * cuda compact graph draft
      
      * fix typo
      
      * compact graphs
      
      * pass unit test but fail in training
      
      * example using EdgeDataLoader on the GPU
      
      * refactor cuda_compact_graph and cuda_to_block
      
      * update training scripts
      
      * fix linting
      
      * fix linting
      
      * fix exclude_edges for the GPU
      
      * add --data-cpu & fix copyright
      a8c81018
  24. 18 Oct, 2021 1 commit
  25. 14 Oct, 2021 1 commit
  26. 07 Oct, 2021 1 commit
    • K's avatar
      [Model] Refine GraphSAINT (#3328) · aef96dfa
      K authored
      
      
      * The start of experiments of Jiahang Li on GraphSAINT.
      
      * a nightly build
      
      * a nightly build
      
      Check the basic pipeline of codes. Next to check the details of samplers , GCN layer (forward propagation) and loss (backward propagation)
      
      * a night build
      
      * Implement GraphSAINT with torch.dataloader
      
      There're still some bugs with sampling in training procedure
      
      * Test validity
      
      Succeed in testing validity on ppi_node experiments without testing other setup.
      1. Online sampling on ppi_node experiments performs perfectly.
      2. Sampling speed is a bit slow because the operations on [dgl.subgraphs], next step is to improve this part by putting the conversion into parallelism
      3. Figuring out why offline+online sampling method performs bad, which does not make sense
      4. Doing experiments on other setup
      
      * Implement saint with torch.dataloader
      
      Use torch.dataloader to speed up saint sampling with experiments. Except experiments on too large dataset Amazon, we've done some experiments on other four datasets including ppi, flickr, reddit and yelp. Preliminary experimental results show consumed time and metrics reach not bad level. Next step is to employ more accurate profiler which is the line_profiler to test consumed period, and adjust num_workers to speed up sampling procedures on same certain datasets faster.
      
      * a nightly build
      
      * Update .gitignore
      
      * reorganize codes
      
      Reorganize some codes and comments.
      
      * a nightly build
      
      * Update .gitignore
      
      * fix bugs
      
      Fix bugs about why fully offline sampling and author's version don't work
      
      * reorganize files and codes
      
      Reorganize files and codes then do some experiments to test the performance of offline sampling and online sampling
      
      * do some experiments and update README
      
      * a nightly build
      
      * a nightly build
      
      * Update README.md
      
      * delete unnecessary files
      
      * Update README.md
      
      * a nightly update
      
      1. handle directory named 'graphsaintdata'
      2. control graph shift between gpu and cpu related to large dataset ('amazon')
      3. remove parameter 'train'
      4. refine annotations of the sampler
      5. update README.md including updating dataset info, dependencies info, etc
      
      * a nightly update
      
      explain config differences in TEST part
      remove a sampling time variant
      make 'online' an argument
      change 'norm' to 'sampler'
      explain parameters in README.md
      
      * Update README.md
      
      * a nightly build
      
      * make online an argument
      * refine README.md
      * refine codes of `collate_fn` in sampler.py, in training phase only return one subgraph, no need to check if the number of subgraphs larger than 1
      
      * Update sampler.py
      
      check the problem on flickr is about overfitting.
      
      * a nightly update
      
      Fix the overfitting problem of `flickr` dataset. We need to restrict the number of subgraphs (also the number of iterations) used in each epoch of training phase. Or it might overfit when validating at the end of each epoch. The method to limit the number is a formula specified by the author.
      
      * Set up a new flag `full` specifying if the number of subgraphs used in training phase equals to that of pre-sampled subgraphs
      
      * Modify codes and annotations related the new flag
      
      * Add a new parameter called `node_budget` in the base class `SAINTSampler` to compute the specific formula
      
      * set `gpu` as a command line argument
      
      * Update README.md
      
      * Finish the experiments on Flickr, which is done after adding new flag `full`
      
      * a nightly update
      
      * use half of edges in the original graph to do sampling
      * test dgl.random.choice with or without replacement with half of edges
      ~ next is to test what if put the calculating probability part out of __getitem__ can speed up sampling and try to implement sampling method of author
      
      * employ cython to implement edge sampling for per edge
      
      * employ cython to implement edge sampling for per edge
      * doing experiments to test consumed time and performance
      ** the consumed time decreased to approximately 480s, the performance decrease about 5 points.
      * deprecate cython implementation
      
      * Revert "employ cython to implement edge sampling for per edge"
      
      * This reverts commit 4ba4f092
      * Deprecate cython implementation
      * Reserve half-edges mechanism
      
      * a nightly update
      
      * delete unnecessary annotations
      Co-authored-by: default avatarMufei Li <mufeili1996@gmail.com>
      aef96dfa
  27. 23 Sep, 2021 1 commit
  28. 21 Sep, 2021 1 commit
  29. 20 Sep, 2021 1 commit
  30. 13 Sep, 2021 2 commits
  31. 02 Sep, 2021 1 commit
  32. 27 Aug, 2021 1 commit
    • esang's avatar
      [Model] Point transformer (#3284) · 4fb50be4
      esang authored
      
      
      * some modifications for pointnet2
      
      * temporarily save changes
      
      * move files to new directory point_transformer
      
      * implement point transformer for classification
      
      * restore train_cls in pointnet
      
      * implement point transformer for partseg
      
      * fix point transformer for nan loss
      
      * modify point transformer for cls
      
      * modify training setting
      
      * update transformer for cls
      
      * update code
      
      * update code for latest performance
      
      * update the example index
      
      * some minor changes
      Co-authored-by: default avatarTong He <hetong007@gmail.com>
      4fb50be4
  33. 23 Aug, 2021 1 commit
  34. 20 Aug, 2021 1 commit
  35. 19 Aug, 2021 1 commit
  36. 16 Aug, 2021 1 commit
  37. 11 Aug, 2021 1 commit
    • JOHNW02's avatar
      [example] Create EEG-GCNN example. (#3186) · 738b75f4
      JOHNW02 authored
      * Create EEG-GCNN example.
      
      * Update README.md
      
      * Remove gitignore file.
      
      * Update README.md
      
      * change 'datas' to 'datasets'.
      
      * Change train.py to main.py
      
      * Added an entry in the indexing page.
      
      * State "simplified version"; change how to run.
      
      * Fix bug in contact
      
      * Remove paper link in reference.
      
      * Create working branch
      
      * Add normalization of x.
      
      * Update paper link and tags
      
      * Update paper link in readme
      
      * Update readme; add patient level indices
      
      * Update readme. Add comments to models
      
      * Update README.md
      
      * change to with; specify location for ch and el; move note
      
      * fix bug for note
      
      * Add args for models; clean code.
      
      * delete = in readme
      
      * Add reference for spec_coh_values
      738b75f4