- 11 Aug, 2022 3 commits
-
-
Minjie Wang authored
-
kylasa authored
* Adding launch script and wrapper script to trigger distributed graph partitioning pipeline as defined in the UX document 1. dispatch_data.py is a wrapper script which builds the command and triggers the distributed partitioning pipeline 2. distgraphlaunch.py is the main python script which triggers the pipeline and to simplify its usage dispatch_data.py is included as a wrapper script around it. * Added code to auto-detect python version and retrieve some parameters from the input metadata json file 1. Auto detect python version 2. Read the metadata json file and extract some parameters to pass to the user defined command which is used to trigger the pipeline. * Updated the json file name to metadata.json file per UX documentation 1. Renamed json file name per UX documentation. * address comments * fix * fix doc * use unbuffered logging to cure anxiety * cure more anxiety * Update tools/dispatch_data.py Co-authored-by:
Minjie Wang <minjie.wang@nyu.edu> * oops Co-authored-by:
Quan Gan <coin2028@hotmail.com> Co-authored-by:
Minjie Wang <minjie.wang@nyu.edu>
-
Quan (Andy) Gan authored
* first commit * update * huh * fix * update * revert core * fix * update * rewrite * oops * address comments * add graph name * address comments * remove sample metadata file * address comments * fix * remove * add docs
-
- 10 Aug, 2022 4 commits
-
-
YJ-Zhao authored
* rgcn-ogbn-mag * Add link in README.md * correct code-format,add the reset_parameters function to the HeteroEmbedding module * add the annotation in hetero.py * add a unit test * modify format * Update Co-authored-by:
Mufei Li <mufeili1996@gmail.com> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-50-143.us-west-2.compute.internal>
-
Chang Liu authored
* Fix infograph example * Update * Revert the changes and update Doc * Update * Split lines to pass CI-lint * Update * Update Co-authored-by:Mufei Li <mufeili1996@gmail.com>
-
Minjie Wang authored
-
Minjie Wang authored
-
- 09 Aug, 2022 2 commits
-
-
Xin Yao authored
-
Quan (Andy) Gan authored
* bunch of fixes * Update test_edge_softmax_hetero.py * Update test_edge_softmax_hetero.py Co-authored-by:Minjie Wang <wmjlyjemaine@gmail.com>
-
- 07 Aug, 2022 2 commits
-
-
kylasa authored
* Fix for node_subgraph function, which seems to generate segmentation fault for very large partitions 1. Removed three graph dgl objects and we create the final dgl object directly by maintaining the following constraints a) nodes are reordered so that local nodes are placed in the beginning of the nodes list compared to non-local nodes. b)Edges order are maintained as passed into this function. c) src/dst end points are mapped to target values based on the reshuffle'd nodes order. * Code changes addressing CI comments for this PR 1. Used Da's suggested map to map nodes from old to new order. This is much simpler and mem. efficient. * Addressing CI Comments. 1. Reduced the amount of documentation to reflect the actual implementation. 2. named the mapping object appropriately.
-
Chang Liu authored
* Fix doc and default settings for RelGraphConv * Add unit test * Split msg in two lines to pass CI-lint
-
- 06 Aug, 2022 1 commit
-
-
kylasa authored
* Alltoall Fix to bypass gloo - alltoallv bug which is preventing further testing 1. Replaced alltoallv gloo wrapper call with alltoall message. 2. All the messages are padded to be of same length 3. Receiving side unpads the messages and continues processing. * Code changes to address CI comments 1. Removed unused functions from gloo_wrapper.py 2. Changed the function signature of alltoallv_cpu_data as suggested. 3. Added docstring to include more description of the functionality inside alltoallv_cpu_data. Included more asserts to validate the assumptions. * Changed the function name appropriately Changed the function name from "alltoallv_cpu_data" to alltoallv_cpu which I believe is appropriate because underlying functionality is providing alltoallv which is basically alltoall_cpu + padding * Added code and text to address the review comments. 1. Changed the function name to indicate the local use of this function. 2. Changed docstring to indicate the assumptions made by alltoallv_cpu function. * Removed unused function from import statement Removed unused/removed function from import statement.
-
- 03 Aug, 2022 1 commit
-
-
Rhett Ying authored
-
- 02 Aug, 2022 1 commit
-
-
Xin Yao authored
* test ddp dataloader * add pure_gpu for edgedataloader * resolve ddp issue
-
- 01 Aug, 2022 3 commits
-
-
Rhett Ying authored
-
Xin Yao authored
* enable use for weighted neighbor sampler and biased random walk * add unit tests * fix for mxnet/tf * fix typo
-
Chang Liu authored
* Refactor GIN example * Update * Update README * Minor update * README update Co-authored-by:Mufei Li <mufeili1996@gmail.com>
-
- 30 Jul, 2022 2 commits
-
-
Rhett Ying authored
-
Rhett Ying authored
* [CI] separate distributed tests from torch cpu tests * remove TF related env
-
- 29 Jul, 2022 1 commit
-
-
Xin Yao authored
* add weighted sampling without replacement (A-Chao) * improve Algorithm A-Chao with block-wise prefix sum * correctly fill out_idxs * implement weighted sampling with replacement * small fix * merge host-side code of weighted/uniform sampling * enable unit tests for cuda weighted sampling * move thrust/cub wrapper to the cmake file * update docs accordingly * fix linting * fix linting * fix unit test * Bump external CUB/Thrust versions * Fix code style and update description of algorithm design * [Feature] GPU support weighted graph neighbor sampling commit by pengqirong(OPPO) * merge pengqirong's implementation * revert the change to cub and thrust * fix linting * use DeviceSegmentedSort for better performance * add more comments * add necessary notes * add necessary notes * resolve some comments * define THRUST_CUB_WRAPPED_NAMESPACE * fix doc Co-authored-by:彭齐荣 <657017034@qq.com>
-
- 28 Jul, 2022 2 commits
-
-
Rhett Ying authored
* [DistTest] fix incorrect shell if statement * fix incorrect use of dist.initialize()
-
Wey Gu authored
* Fixed device type in inference * change buffer_device instead
-
- 27 Jul, 2022 3 commits
-
-
Pengfei Xia authored
* Update * Update functional.py * Update * Update test_transform.py * Update * Update functional.py * Update functional.py * Update functional.py * Update functional.py * Update * Update * Update functional.py * Update functional.py * Update functional.py * Update functional.py * Update module.py * Update test_transform.py * Update test_transform.py Co-authored-by:Mufei Li <mufeili1996@gmail.com>
-
Zhuobin Huang authored
Co-authored-by:Xin Yao <xiny@nvidia.com>
-
Rhett Ying authored
* [Log] fix confusing error log in TCPSocket::Bind() * fix lint
-
- 26 Jul, 2022 2 commits
-
-
Chang Liu authored
* Modify to repro crash * Revert to orig. scenario and add fix * Update Co-authored-by:Xin Yao <xiny@nvidia.com>
-
Dewvin authored
* [Feature] Add CUDA Weighted Randomwalk Sampling * [Feature] Add CUDA Weighted Randomwalk Sampling * [Feature] Add CUDA Weighted Randomwalk Sampling * [Feature] Add CUDA Weighted Randomwalk Sampling * fix empty prob array && enable non-uniform for restart && enable unit tests * update doc and guide for randomwalk and pinsage * update comments Co-authored-by:
zhenliangqiu <ubuntu@ip-172-31-24-245.ap-southeast-1.compute.internal> Co-authored-by:
xiny <xiny@nvidia.com>
-
- 25 Jul, 2022 2 commits
-
-
Serge Panev authored
Signed-off-by:
Serge Panev <spanev@nvidia.com> Co-authored-by:
Mufei Li <mufeili1996@gmail.com>
-
Rhett Ying authored
-
- 23 Jul, 2022 1 commit
-
-
kylasa authored
* Code changes to address the updated file format support for massively large graphs. 1. Updated the docstring for the starting function 'gen_dist_partitions" to describe the newly proposed file format for input dataset. 2. Code which was dependent on the structure of the old-metadata json object has been updated to read from the newly proposed metadata file. 3. Fixed some errors when appropriate functions were invoked and the calling function expects return values from the invoked furnction. 4. This modified code has been tested on "mag" dataset using 4-way partitions and verified the results * Code changes to address the CI review comments 1. Improved docstrings for some functions. 2. Added a new function in the utils.py to compute the id ranges and this is used in multiple places. * Added TODO to indicate the redundant data structure. Because of the new file format changes, one of the dictionaries (node_feature_tids, node_tids) will be redundant. Added TODO text so that this will be removed in the next iteration of code changes.
-
- 22 Jul, 2022 1 commit
-
-
Chang Liu authored
* Refactor gat example * Add ppi support * Minor update * Update * Update * Change valid_xxx to val_xxx * Readme Update * Update Co-authored-by:Mufei Li <mufeili1996@gmail.com>
-
- 21 Jul, 2022 2 commits
-
-
Mufei Li authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-53-142.us-west-2.compute.internal>
-
Tianyue Cao authored
* upload PSS * upload PSS * upload PSS * pss code reformat * fix bug * update README * update train bash * remove vit * update README * delete InfoPlotter * delete Smooth_AP_loss.py * update README * update README Co-authored-by:Tianjun Xiao <xiaotj1990327@gmail.com>
-
- 20 Jul, 2022 1 commit
-
-
Serge Panev authored
Signed-off-by:
Serge Panev <spanev@nvidia.com> Co-authored-by:
Rhett Ying <85214957+Rhett-Ying@users.noreply.github.com>
-
- 19 Jul, 2022 3 commits
-
-
Chang Liu authored
* Fix node_classification.py * Minor update Co-authored-by:nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
-
Serge Panev authored
Signed-off-by:
Serge Panev <spanev@nvidia.com> Co-authored-by:
Rhett Ying <85214957+Rhett-Ying@users.noreply.github.com>
-
Rhett Ying authored
* [Doc] fix docstring typo * Update sparse_emb.py * Update sparse_emb.py * update link
-
- 15 Jul, 2022 1 commit
-
-
Quan (Andy) Gan authored
-
- 14 Jul, 2022 2 commits
-
-
Mufei Li authored
* Update * Update * Update * Update Co-authored-by:
Ubuntu <ubuntu@ip-172-31-53-142.us-west-2.compute.internal> Co-authored-by:
Xin Yao <xiny@nvidia.com>
-
Xin Yao authored
* Fix test_csrmm for tensor core * unset allow tf32 flag * update test unified tensor * skip fp16 for CPU
-