- 12 Apr, 2022 2 commits
-
-
Quan (Andy) Gan authored
* cleaned pl node classification example * conform to PL's method of updating the dataloader * update * lint * fix test * fix
-
Serge Panev authored
Signed-off-by:
Serge Panev <spanev@nvidia.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 09 Apr, 2022 1 commit
-
-
Mufei Li authored
* Update README.md * Update README.md * Update README.md Co-authored-by:Rhett Ying <85214957+Rhett-Ying@users.noreply.github.com>
-
- 05 Apr, 2022 1 commit
-
-
nv-dlasalle authored
[Examples] Update graphsage multi-gpu example to use mutliple GPUs for validation and testing. (#3827) * Update graphsage multi-gpu example to use mutliple GPUs for validation and testing. * Remove argmax * Fix rebase error * Add more documentation to example and simplify * Switch to name shared memory * Add comment about how training is distributed * Restore iteration count * fix munmap error reporting for better error messages Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
- 25 Mar, 2022 1 commit
-
-
Quan (Andy) Gan authored
* fix distributed multi-GPU example device * try Join * update version requirement in README * use model.join * fix docs Co-authored-by:Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 11 Mar, 2022 1 commit
-
-
nv-dlasalle authored
* Add pure_gpu example of graphsage * move to advanced directory Co-authored-by:Quan Gan <coin2028@hotmail.com>
-
- 07 Mar, 2022 1 commit
-
-
Quan (Andy) Gan authored
-
- 03 Mar, 2022 1 commit
-
-
Rhett Ying authored
-
- 01 Mar, 2022 1 commit
-
-
Rhett Ying authored
-
- 27 Feb, 2022 1 commit
-
-
Quan (Andy) Gan authored
* huuuuge update * remove * lint * lint * fix * what happened to nccl * update multi-gpu unsupervised graphsage example * replace most of the dgl.mp.process with torch.mp.spawn * update if condition for use_uva case * update user guide * address comments * incorporating suggestions from @jermainewang * oops * fix tutorial to pass CI * oops * fix again Co-authored-by:Xin Yao <xiny@nvidia.com>
-
- 09 Feb, 2022 1 commit
-
-
Xin Yao authored
* implement pin_memory/unpin_memory/is_pinned for dgl.graph * update python docstring * update c++ docstring * add test * fix the broken UnifiedTensor * XPU_SWITCH for kDLCPUPinned * a rough version ready for testing * eliminate extra context parameter for pin/unpin * update train_sampling * fix linting * fix typo * multi-gpu uva sampling case * disable new format materialization for pinned graphs * update python doc for pin_memory_ * fix unit test * UVA sampling for link prediction * dispatch most csr ops * update graphsage example to combine uva sampling and UnifiedTensor * update graphsage example to combine uva sampling and UnifiedTensor * update graphsage example to combine uva sampling and UnifiedTensor * update doc * update examples * change unitgraph and heterograph's PinMemory to in-place * update examples for multi-gpu uva sampling * update doc * fix linting * fix cpu build * fix is_pinned for DistGraph * fix is_pinned for DistGraph * update graphsage unsupervised example * update doc for gpu sampling * update some check for sampling device switching * fix linting * adapt for new dataloader * fix linting * fix * fix some name issue * adjust device check * add unit test for uva sampling & fix some zero_copy bug * fix linting * update num_threads in graphsage examples Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 07 Feb, 2022 1 commit
-
-
Jinjing Zhou authored
-
- 30 Jan, 2022 1 commit
-
-
Quan (Andy) Gan authored
* initial update * more * more * multi-gpu example * cluster gcn, finalize homogeneous * more explanation * fix * bunch of fixes * fix * RGAT example and more fixes * shadow-gnn sampler and some changes in unit test * fix * wth * more fixes * remove shadow+node/edge dataloader tests for possible ux changes * lints * add legacy dataloading import just in case * fix * update pylint for f-strings * fix * lint * lint * lint again * cherry-picking commit fa9f494 * oops * fix * add sample_neighbors in dist_graph * fix * lint * fix * fix * fix * fix tutorial * fix * fix * fix * fix warning * remove debug * add get_foo_storage apis * lint
-
- 20 Jan, 2022 1 commit
-
-
Da Zheng authored
Co-authored-by:Minjie Wang <minjie.wang@nyu.edu>
-
- 19 Jan, 2022 1 commit
-
-
Da Zheng authored
-
- 24 Dec, 2021 1 commit
-
-
xcwan authored
* Add nccl backend and fix pad_data function cuda bug * Update train_dist.py * Update train_dist.py Co-authored-by:Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 20 Dec, 2021 1 commit
-
-
Jinjing Zhou authored
Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
- 15 Dec, 2021 1 commit
-
-
Vasimuddin Md authored
* added distgnn plus libra codebase * Dist application codes * added comments in partition code. changed the interface of partitioning call. * updated readme * create libra partitioning branch for the PR * removed disgnn files for first PR * updated kernel.cc * added libra_partition.cc and moved libra code from kernel.cc to libra_partition.cc * fixed lint error; merged libra2dgl.py and main_Libra.py to libra_partition.py; added graphsage/distgnn folder and partition script. * removed libra2dgl.py * fixed the lint error and cleaned the code. * revisions due to PR comments. added distgnn/tools contains partitions routines * update 2 PR revision I * fixed errors; also improved the runtime by 10x. * fixed minor lint error * fixed some more lints * PR revision II changed the interface of libra partition function * rewrite docstring Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
- 06 Dec, 2021 1 commit
-
-
Jinjing Zhou authored
* tmp fix * add description
-
- 21 Oct, 2021 1 commit
-
-
Xin Yao authored
* gpu compact graph template * cuda compact graph draft * fix typo * compact graphs * pass unit test but fail in training * example using EdgeDataLoader on the GPU * refactor cuda_compact_graph and cuda_to_block * update training scripts * fix linting * fix linting * fix exclude_edges for the GPU * add --data-cpu & fix copyright
-
- 23 Sep, 2021 1 commit
-
-
Junwen Yao authored
-
- 20 Sep, 2021 1 commit
-
-
nv-dlasalle authored
-
- 02 Sep, 2021 1 commit
-
-
xiang song(charlie.song) authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-2-66.ec2.internal>
-
- 16 Jul, 2021 1 commit
-
-
David Min authored
[Feature][Performance][GPU] Introducing UnifiedTensor for efficient zero-copy host memory access from GPU (#3086) * Add pytorch-direct version * Initial commit of unified tensor * Merge branch 'master' of https://github.com/davidmin7/dgl * Remove unnecessary things * Fix error message * Fix/Add descriptions * whitespace fix * add unpin * disable IndexSelectCPUFromGPU with no CUDA * add a newline for unified_tensor.py * Apply changes based on feedback * add 'os' module * skip unified tensor unit test for cpu only * Update tests/pytorch/test_unified_tensor.py Co-authored-by:
xiang song(charlie.song) <classicxsong@gmail.com> * reflect feedback Co-authored-by:
shhssdm <shhssdm@gmail.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
xiang song(charlie.song) <classicxsong@gmail.com>
-
- 15 Jul, 2021 1 commit
-
-
Mufei Li authored
* Update * Update * Update dependencies * Update * Update * Fix ogbn-products gat * Update * Update * Reformat * Fix typo in node2vec_random_walk * Specify file encoding * Working for 6.7 * Update * Fix subgraph * Fix doc for sample_neighbors_biased * Fix hyperlink * Add example for udf cross reducer * Fix * Add example for slice_batch * Replace dgl.bipartite * Fix GATConv * Fix math rendering * Fix doc Co-authored-by:
Ubuntu <ubuntu@ip-172-31-28-17.us-west-2.compute.internal> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-22-156.us-west-2.compute.internal>
-
- 13 Jul, 2021 1 commit
-
-
xiang song(charlie.song) authored
* fix. * fix. * fix. * fix. * Fix test * Deprecate old DistEmbedding impl, use synchronized embedding impl * update doc Co-authored-by:
Ubuntu <ubuntu@ip-172-31-71-112.ec2.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-2-66.ec2.internal> Co-authored-by:
Da Zheng <zhengda1936@gmail.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 28 Jun, 2021 1 commit
-
-
WangYQ authored
-
- 25 Jun, 2021 1 commit
-
-
Quan (Andy) Gan authored
* update docstrings and tidy code * add docs * address comments * Update __init__.py * address comments
-
- 16 Jun, 2021 1 commit
-
-
Da Zheng authored
* add. * fix. * fix. * fix. * fix. * add tests. * support node split and edge split. * support 1 partition. * add tests. * fix. * fix test. * use hierarchical partition. * add check. Co-authored-by:
Zheng <dzzhen@3c22fba32af5.ant.amazon.com> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-22-57.us-west-2.compute.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-71-112.ec2.internal>
-
- 13 Jun, 2021 1 commit
-
-
nv-dlasalle authored
[Performance] Perform to_block on the GPU when the dataloader is created with a GPU `device`. (#3016) * add output device for dataloading * Update dataloader * Get sampler device from dataloader * Fix line length * Update examples * Fix to_block GPU for empty relation types * Handle the case where the DistGraph has None for the underlying graph Co-authored-by:Da Zheng <zhengda1936@gmail.com>
-
- 10 Jun, 2021 1 commit
-
-
nv-dlasalle authored
* Make multigpu graphsage use whole datset * Specify queeze dimension * Remove squeeze dimension Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
- 26 May, 2021 1 commit
-
-
Da Zheng authored
* explicitly set the graph format. * fix. * fix. * fix launch script. * fix readme. Co-authored-by:
Zheng <dzzhen@3c22fba32af5.ant.amazon.com> Co-authored-by:
xiang song(charlie.song) <classicxsong@gmail.com> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-71-112.ec2.internal>
-
- 17 May, 2021 1 commit
-
-
Quan (Andy) Gan authored
-
- 14 May, 2021 1 commit
-
-
Quan (Andy) Gan authored
* standardizing thread_wrapped_func * lints * Update __init__.py
-
- 11 May, 2021 1 commit
-
-
Quan (Andy) Gan authored
* pytorch lightning initial examples * revert most changes in dataloader to favor #2886. * address comments
-
- 03 May, 2021 1 commit
-
-
xiang song(charlie.song) authored
* Draft for sparse emb * add some notes * Fix * Add sparse optim for dist pytorch * Update test * Fix * upd * upd * Fix * Fix * Fix bug * add transductive exmpale * Fix example * Some fix * Upd * Fix lint * lint * lint * lint * upd * Fix lint * lint * upd * remove dead import * update * lint * update unitest * update example * Add adam optimizer * Add unitest and update data * upd * upd * upd * Fix docstring and fix some bug in example code * Update rgcn readme Co-authored-by:
Ubuntu <ubuntu@ip-172-31-57-25.ec2.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-24-210.ec2.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-2-66.ec2.internal>
-
- 15 Apr, 2021 1 commit
-
-
nv-dlasalle authored
* Start on uniform GPU sampling * Save more work * Get cu file compiling * Update sampling * More changes * Get GPU sampling for uniform probabilities solved * Fix batch tensor migration * Fix * update kernels * expand blocking * Undo testing change * Cut down on sampling overhead * Fix replacement * Update unit tests * Add option to gpu sample in graphsage * Copy only csc to gpu * Add ogbn support * Fix linting * Remove nvtx from sample * Improve documentation and error checking * Expand documentation * Update assert checking * delete extra space * Use standard dataloader when dataset is a dictionary * ogb -> ogbn * Fix edge selection determinism * Fix typos * Remove nvtx * Add comment for self.fanout_arrays and assert * Fix linting * Migrate to scalarbatcher * Fix indentation * Fix batcher * Fix indexing * Only use databatcher for GPU * Convert to DGL NDArray to PyTorch Tensor * Add optimization for PyTorch's F.tensor() for list of GPU tensors Co-authored-by:Da Zheng <zhengda1936@gmail.com>
-
- 08 Apr, 2021 1 commit
-
-
Da Zheng authored
Co-authored-by:
Ubuntu <ubuntu@ip-172-31-73-81.ec2.internal> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 30 Mar, 2021 1 commit
-
-
Da Zheng authored
* remove num_workers. * remove num_workers. * remove num_workers. * remove num-servers. * update error message. * update docstring. * fix docs. * fix tests. * fix test. * fix. * print messages in test. * fix. * fix test. * fix. Co-authored-by:Ubuntu <ubuntu@ip-172-31-9-132.us-west-1.compute.internal>
-
- 22 Mar, 2021 1 commit
-
-
Kaiqiang Xu authored
Method `dataset.num_labels` has been deprecated and replaced by `dataset.num_classes`. Updating the method name to avoid runtime warning. Co-authored-by:Minjie Wang <wmjlyjemaine@gmail.com>
-