- 27 Jun, 2022 1 commit
-
-
Rhett Ying authored
* [Dist] enable USE_EPOLL in default * fix build issue on windows * fix build issue on windows * fix build issue on windows * fix build issue on windows * fix build issue on windows * fix build issue
-
- 24 Jun, 2022 2 commits
-
-
PotatoChipsNinja authored
Co-authored-by:Xin Yao <xiny@nvidia.com>
-
nv-dlasalle authored
* Add uva by default to embedding * More updates * Update optimizer * Add new uva functions * Expose new pinned memory function * Add unit tests * Update formatting * Fix unit test * Handle auto UVA case when training is on CPU * Allow per-embedding decisions for whether to use UVA * Address spares_optim.py comments * Remove unused templates * Update unit test * Use dgl allocate memory for pinning * allow automatically unpin * workaround for d2h copy with a different dtype * fix linting * update error message * update copyright Co-authored-by:
Xin Yao <xiny@nvidia.com> Co-authored-by:
Minjie Wang <wmjlyjemaine@gmail.com>
-
- 23 Jun, 2022 5 commits
-
-
Lucas Prieto authored
Co-authored-by:
Mufei Li <mufeili1996@gmail.com> Co-authored-by:
Xin Yao <xiny@nvidia.com>
-
Chang Liu authored
* Remove all torchtext legacy-related APIs * Remove unused BagOfWordsPretrained class, and fix some typos Co-authored-by:Mufei Li <mufeili1996@gmail.com>
-
Xin Yao authored
* Explicitly unpin tensoradapter allocated arrays * Undo unrelated change * Add unit test * update unit test * add pinned_by_dgl flag to NDArray::Container * use dgl.ndarray for holding the pinning status * update multi-gpu uva inference * reinterpret cast NDArray::Container* to DLTensor* in MoveAsDLTensor * update unpin column and examples * add unit test for unpin column Co-authored-by:
Dominique LaSalle <dlasalle@nvidia.com> Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
-
Triston authored
* Fix a cub compile error for CUDA 11.5 * Fix comparison of integer expressions of different signedness in coo_sort.cu file * Fix comparison of integer expressions of different signedness in cuda_compact_graph.cu file * Remove never referenced variable in spmm.cu * Fix comparison of integer expressions of different signedness in rowwise_pick.h file * Fix comparison of integer expressions of different signedness in choice.cc file * Remove never referenced variable col_data in spat_op_impl_coo.cc * Remove never referenced variable allowed in global_uniform.cc * Fix comparison of integer expressions of different signedness in graph.cc * Fix comparison of integer expressions of different signedness in graph_apis.cc * Fix the un-used ctx variable in ndarray_partition.cc file for cpu only build * Fix comparison of integer expressions of different signedness in libra_partition.cc * Fix comparison of integer expressions of different signedness in graph_op.cc Co-authored-by:
Triston Cao <tristonc@nvidia.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
Rhett Ying authored
-
- 22 Jun, 2022 3 commits
-
-
Mufei Li authored
* Update citation_graph.py * Update * Update * Update Co-authored-by:Minjie Wang <wmjlyjemaine@gmail.com>
-
Quan (Andy) Gan authored
* fix * fix * Update utils.py
-
maqy authored
* fix unstable sort * add torch version check * reformat * split too long comments * Update dataloader.py Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
- 21 Jun, 2022 1 commit
-
-
Mufei Li authored
* Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update
-
- 20 Jun, 2022 3 commits
-
-
Rhett Ying authored
* [Dist] defer to load node/edge feats * fix lint * Update python/dgl/distributed/partition.py Co-authored-by:
Minjie Wang <minjie.wang@nyu.edu> * Update python/dgl/distributed/partition.py Co-authored-by:
Minjie Wang <minjie.wang@nyu.edu> * fix lint Co-authored-by:
Minjie Wang <minjie.wang@nyu.edu>
-
Anthony Mahanna authored
* Update README.md * fix: ArangoDB hyperlink Co-authored-by:Minjie Wang <wmjlyjemaine@gmail.com>
-
Rhett Ying authored
-
- 17 Jun, 2022 1 commit
-
-
RuisiZhang authored
* add dist tutorial * add predictor in dist prediction * refine after rendering * change links * Update 2_link_prediction.py Co-authored-by:
Mufei Li <mufeili1996@gmail.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com> Co-authored-by:
Rhett Ying <85214957+Rhett-Ying@users.noreply.github.com> Co-authored-by:
Minjie Wang <wmjlyjemaine@gmail.com>
-
- 16 Jun, 2022 1 commit
-
-
Rhett Ying authored
* [Dist] set socket as default backend for RPC * add tests both for socket and tensorpipe
-
- 15 Jun, 2022 3 commits
-
-
Yunxin Sang authored
-
ndickson-nvidia authored
* * Added functions from dgl.transforms.functional that were missing from the list for documentation in dgl.rst * * Sorted transform ops list in dgl.rst in alphabetical order Co-authored-by:Xin Yao <xiny@nvidia.com>
-
Serge Panev authored
Signed-off-by:
Serge Panev <spanev@nvidia.com> Co-authored-by:
Rhett Ying <85214957+Rhett-Ying@users.noreply.github.com>
-
- 14 Jun, 2022 5 commits
-
-
nv-dlasalle authored
* Disable non-atomic atomic operations * Improve error message * Make error message more friendly
-
彭齐荣 authored
* Fix fail to create_shared_mem_array in ddp spawn train #4110 Fix fail to create_shared_mem_array in ddp spawn train #4110 * [Bugfix] Fix fail to create_shared_mem_array in ddp spawn train #4110 [Bugfix] Fix fail to create_shared_mem_array in ddp spawn train #4110 Replace random.seed() to random_ = random.Random() * Update pytorch.py Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
Rhett Ying authored
-
RecLusIve-F authored
* Add Flickr and Yelp dataset * Update flickr.py * update * Update yelp.py * Update yelp.py * update * Update yelp.py * Update test_data.py * Update yelp.py * update * Update test_data.py * Update yelp.py Co-authored-by:Mufei Li <mufeili1996@gmail.com>
-
Rhett Ying authored
* [Dist] master port should be fixed for all trainers * add tests for tools/launch.py
-
- 12 Jun, 2022 2 commits
-
-
Huarui HE authored
* add argument reorder=False for citation_graph * add description of the argument reorder * add reordered/un_reordered save_path * add version number postfix Co-authored-by:Mufei Li <mufeili1996@gmail.com>
-
Quan (Andy) Gan authored
-
- 11 Jun, 2022 1 commit
-
-
Xin Yao authored
* Wrap all CUDA runtime API/CUB calls with macro * remove the usage of explicit cudaMalloc in favor of AllocWorkspace * fix typo Co-authored-by:Israt Nisa <neesha295@gmail.com>
-
- 09 Jun, 2022 3 commits
-
-
Rhett Ying authored
-
Rhett Ying authored
-
Chang Liu authored
[Bugfix] Fix example case: examples/pytorch/ogb/ogbn-proteins and examples/pytorch/ogb/ogbn-products (#4080) * [Bugfix] Fix ogbn-gat-proteins/products examples * Remove unused BatchSampler definition * Remove comments to ease reading/reviewing * Remove dataloader wrapper
-
- 08 Jun, 2022 4 commits
-
-
Rhett Ying authored
* [ist] enable time out when fetching msg * fix lint error * minor refinements * improve minor log * fix dist test * fix timeout issue in tensorpipe
-
Rhett Ying authored
* [DistTest] add python test of RPC * remove return
-
Rhett Ying authored
* [DistTest] add basic pipeline for dist test across machines * move launch remote cmd to separate file * add test for rpc * fix function naming rule Co-authored-by:Minjie Wang <wmjlyjemaine@gmail.com>
-
ndickson-nvidia authored
-
- 07 Jun, 2022 2 commits
-
-
ndickson-nvidia authored
* * Added specialization of cublasGemm function for `__half` type, to try to address https://github.com/dmlc/dgl/issues/3988 * * Added USE_FP16 guard * * Added test cases to test_segment_mm, to test newly-added FP16 specialization of cublasGemm * * Replaced for loop in test_segment_mm with pytest.mark.parametrize, as recommended Co-authored-by:
Xin Yao <xiny@nvidia.com>
-
Tudor Andrei Dumitrascu authored
* Update link to correct destination. * Update 4_rgcn.py * Update 4_rgcn.py * Update tutorials/models/1_gnn/4_rgcn.py * Update tutorials/models/1_gnn/4_rgcn.py * Update tutorials/models/1_gnn/4_rgcn.py Co-authored-by:Rhett Ying <85214957+Rhett-Ying@users.noreply.github.com>
-
- 06 Jun, 2022 3 commits
-
-
ndickson-nvidia authored
* * Added support for common operations on FP16 (`half` or `__half`) for older GPU architectures * Fixed an issue with previous check for FP16 support * * Removing FP16 type checks, since they should no longer be needed * * Fixed AtomicAdd to be atomic for `float` and `double` for old GPU architectures. Unfortunately, it seems that atomicCAS for unsigned short seems to be unavailable until architecture 70, so half will have to stay non-atomic on old GPUs. * * Fixed non-atomic version of `AtomicAdd<half>` for older GPUs to return old value instead value of new
-
Quan (Andy) Gan authored
Co-authored-by:Xin Yao <xiny@nvidia.com>
-
Xin Yao authored
Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com> Co-authored-by:
Israt Nisa <neesha295@gmail.com>
-