- 24 Jun, 2022 1 commit
-
-
nv-dlasalle authored
* Add uva by default to embedding * More updates * Update optimizer * Add new uva functions * Expose new pinned memory function * Add unit tests * Update formatting * Fix unit test * Handle auto UVA case when training is on CPU * Allow per-embedding decisions for whether to use UVA * Address spares_optim.py comments * Remove unused templates * Update unit test * Use dgl allocate memory for pinning * allow automatically unpin * workaround for d2h copy with a different dtype * fix linting * update error message * update copyright Co-authored-by:
Xin Yao <xiny@nvidia.com> Co-authored-by:
Minjie Wang <wmjlyjemaine@gmail.com>
-
- 23 Jun, 2022 2 commits
-
-
Xin Yao authored
* Explicitly unpin tensoradapter allocated arrays * Undo unrelated change * Add unit test * update unit test * add pinned_by_dgl flag to NDArray::Container * use dgl.ndarray for holding the pinning status * update multi-gpu uva inference * reinterpret cast NDArray::Container* to DLTensor* in MoveAsDLTensor * update unpin column and examples * add unit test for unpin column Co-authored-by:
Dominique LaSalle <dlasalle@nvidia.com> Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
-
Triston authored
* Fix a cub compile error for CUDA 11.5 * Fix comparison of integer expressions of different signedness in coo_sort.cu file * Fix comparison of integer expressions of different signedness in cuda_compact_graph.cu file * Remove never referenced variable in spmm.cu * Fix comparison of integer expressions of different signedness in rowwise_pick.h file * Fix comparison of integer expressions of different signedness in choice.cc file * Remove never referenced variable col_data in spat_op_impl_coo.cc * Remove never referenced variable allowed in global_uniform.cc * Fix comparison of integer expressions of different signedness in graph.cc * Fix comparison of integer expressions of different signedness in graph_apis.cc * Fix the un-used ctx variable in ndarray_partition.cc file for cpu only build * Fix comparison of integer expressions of different signedness in libra_partition.cc * Fix comparison of integer expressions of different signedness in graph_op.cc Co-authored-by:
Triston Cao <tristonc@nvidia.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 20 Jun, 2022 1 commit
-
-
Rhett Ying authored
-
- 14 Jun, 2022 1 commit
-
-
nv-dlasalle authored
* Disable non-atomic atomic operations * Improve error message * Make error message more friendly
-
- 11 Jun, 2022 1 commit
-
-
Xin Yao authored
* Wrap all CUDA runtime API/CUB calls with macro * remove the usage of explicit cudaMalloc in favor of AllocWorkspace * fix typo Co-authored-by:Israt Nisa <neesha295@gmail.com>
-
- 08 Jun, 2022 1 commit
-
-
Rhett Ying authored
* [ist] enable time out when fetching msg * fix lint error * minor refinements * improve minor log * fix dist test * fix timeout issue in tensorpipe
-
- 07 Jun, 2022 1 commit
-
-
ndickson-nvidia authored
* * Added specialization of cublasGemm function for `__half` type, to try to address https://github.com/dmlc/dgl/issues/3988 * * Added USE_FP16 guard * * Added test cases to test_segment_mm, to test newly-added FP16 specialization of cublasGemm * * Replaced for loop in test_segment_mm with pytest.mark.parametrize, as recommended Co-authored-by:
Xin Yao <xiny@nvidia.com>
-
- 06 Jun, 2022 3 commits
-
-
ndickson-nvidia authored
* * Added support for common operations on FP16 (`half` or `__half`) for older GPU architectures * Fixed an issue with previous check for FP16 support * * Removing FP16 type checks, since they should no longer be needed * * Fixed AtomicAdd to be atomic for `float` and `double` for old GPU architectures. Unfortunately, it seems that atomicCAS for unsigned short seems to be unavailable until architecture 70, so half will have to stay non-atomic on old GPUs. * * Fixed non-atomic version of `AtomicAdd<half>` for older GPUs to return old value instead value of new
-
Quan (Andy) Gan authored
Co-authored-by:Xin Yao <xiny@nvidia.com>
-
Xin Yao authored
Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com> Co-authored-by:
Israt Nisa <neesha295@gmail.com>
-
- 28 May, 2022 3 commits
-
-
Quan (Andy) Gan authored
* change warning message * Update tensordispatch.cc
-
Quan (Andy) Gan authored
This reverts commit fdd1fe19.
-
Quan (Andy) Gan authored
-
- 26 May, 2022 1 commit
-
-
nv-dlasalle authored
* Enable FP16 for GPU builds in CI * Limit default GPU archs to pascal and above * Disable FP16 dispatching for cuda architectures less than 60 * Fix linting * Fix typos
-
- 25 May, 2022 1 commit
-
-
Minjie Wang authored
* cython nogil * move APIs to internal and add unit test * fix lint * disable callback array test
-
- 17 May, 2022 1 commit
-
-
paoxiaode authored
* Change the curand_init parameter * Change the curand_init parameter * commit * commit * change the curandState and launch dim of CSRRowwiseSample kernel * commit * keep _CSRRowWiseSampleReplaceKernel in sync Co-authored-by:nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
-
- 16 May, 2022 2 commits
-
-
nv-dlasalle authored
* Explicitly unpin tensoradapter allocated arrays * Undo unrelated change * Add unit test * update unit test
-
Xin Yao authored
* remove unnecessary induced vertices in EdgeSubgraph * add unit test
-
- 12 May, 2022 1 commit
-
-
nv-dlasalle authored
-
- 11 May, 2022 1 commit
-
-
Rhett Ying authored
* [Dist] Enable maximum try times for socket backend via DGL_DIST_MAX_TRY_TIMES * reset env before/after test * print log for info when trying to connect * fix * print log in python instead of cpp
-
- 27 Apr, 2022 1 commit
-
-
Rhett Ying authored
* [Feature] enable socket net_type for rpc * fix lint * fix lint * fix build issue on windows * fix test failure on windows * fix test failure * fix cpp unit test failure * net_type blocking max_try_times * fix other comments * fix lint * fix comment * fix lint * fix cpp
-
- 26 Apr, 2022 1 commit
-
-
ayasar70 authored
* Based on issue #3436. Improving _SegmentCopyKernel s GPU utilization by switching to nonzero based thread assignment * fixing lint issues * Update cub for cuda 11.5 compatibility (#3468) * fixing type mismatch * tx guaranteed to be smaller than nnz. Hence removing last check * minor: updating comment * adding three unit tests for csr slice method to cover some corner cases * timing repeatkernel * clean * clean * clean * updating _SegmentMaskColKernel * Working on requests: removing sorted array check and adding comments to utility functions * fixing lint issue * Optimizing disjoint union kernel * Trying to resolve compilation issue on CI * [EMPTY] Relevant commit message here * applying revision requests on cpu/disjoint_union.cc * removing unnecessary casts * remove extra space Co-authored-by:
Abdurrahman Yasar <ayasar@nvidia.com> Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 12 Apr, 2022 1 commit
-
-
Quan (Andy) Gan authored
* cleaned pl node classification example * conform to PL's method of updating the dataloader * update * lint * fix test * fix
-
- 11 Apr, 2022 1 commit
-
-
Xin Yao authored
* enable uva for pinsage sampler * unit test * modify some checks on the python side * remove legacy random walk code * update unit test * update unit test * fix unit test * adjust checks * move some checks to c++ * move max_nodes check to cuda kernel * fix ci for tf Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com> Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
-
- 09 Apr, 2022 1 commit
-
-
Rhett Ying authored
* [BugFix] record/restore pin status when pickle/unpickle * disable test on TF * set version as expected * unpin memory in test
-
- 05 Apr, 2022 1 commit
-
-
nv-dlasalle authored
[Examples] Update graphsage multi-gpu example to use mutliple GPUs for validation and testing. (#3827) * Update graphsage multi-gpu example to use mutliple GPUs for validation and testing. * Remove argmax * Fix rebase error * Add more documentation to example and simplify * Switch to name shared memory * Add comment about how training is distributed * Restore iteration count * fix munmap error reporting for better error messages Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
- 31 Mar, 2022 1 commit
-
-
Quan (Andy) Gan authored
* fix uva with partial node types * lint * skip tensorflow unit test
-
- 27 Mar, 2022 1 commit
-
-
Cheng Wan authored
* upd * upd * upd * upd * upd * fix OpenMP compatibility issues * typo * partition * misc * fix typo * num_parts=1 * import torch * long * print info * print info * print info * upd * remove debug code * revert partition.py * fix cut count * fix cut count * Revert "fix cut count" This reverts commit 10926b4fd48f45c8f1ddb58be7db6c22e653effd. * Revert "fix cut count" This reverts commit 76465283bef093a2b4209ad70dd15d2437b2ec8a. * type of deprecate * typo in deprecate info * fix typo * use cv for partitioning * CE * no message * revert * typo * add objtype * no message * fix bug * fix bug * fix bug * ? * semicolon * drop tensors * no message * backward * backward * max op * store X.shape * th * test * Revert "test" This reverts commit 92b3b2f64a3a1128590098fa03ce429c5466e6ce. * test * tolist * debug * to cuda * tuple * fix bug * remove X * no message * fix bug * workload balance * Revert "workload balance" This reverts commit d7f8e4a16ba2a7eabb4a9bb945523bfe6623e723. * reverse * Revert "reverse" This reverts commit 8a71cf25685aa7d889b9b8881b46f7a16b7d6e6d. * Revert "Revert "reverse"" This reverts commit 196b143932d5cf9813576ece7c990b63d322d063. * Revert "Revert "Revert "reverse""" This reverts commit cf9e89a07013582056e7cde235e51331aca7fa9c. * no message * Merge commit '5498cf05' # Conflicts: # python/dgl/distributed/partition.py * Revert "Merge commit '5498cf05 '" This reverts commit f79be2ad777897c7025b28308454cad81ad6bb27. * fix bug * third party * no message * try to avoid memory leak * try to avoid memory leak * avoid memory leak with no hope * Revert "avoid memory leak with no hope" This reverts commit c77befe9479f46758e744642f66dd209b50eef7d. * no message * Revert "no message" This reverts commit 478cb28fe25fb1002b2f1dc202bb9bdaad8b2a56. * del * Revert "del" This reverts commit 1b468e45ce646b400ff3ffa61a0b2da058b3bdfd. * no message * no message * Revert "no message" This reverts commit 92e4f5561ed42da0606618b2fff9f1ad5ed439d9. * third party * document * Update metis_partition.cc * Update metis_partition_hetero.cc * Update metis_partition_hetero.cc * Update partition.py * Update partition.py * Update partition.py Co-authored-by:
yzh119 <expye@outlook.com> Co-authored-by:
chwan-rice <54331508+chwan-rice@users.noreply.github.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
Da Zheng <zhengda1936@gmail.com>
-
- 24 Mar, 2022 2 commits
-
-
Quan (Andy) Gan authored
* fix * remove setcxx methods * move pin flag to CSR and COO matrix Co-authored-by:Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
Rhett Ying authored
Co-authored-by:Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 10 Mar, 2022 1 commit
-
-
paoxiaode authored
* Change the curand_init parameter * Change the curand_init parameter * commit * commit Co-authored-by:nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
-
- 01 Mar, 2022 1 commit
-
-
Quan (Andy) Gan authored
* fix * explain * oops
-
- 28 Feb, 2022 2 commits
-
-
Quan (Andy) Gan authored
* split files * fix
-
Quan (Andy) Gan authored
* Update randomwalk_gpu.cu * Update randomwalk_gpu.cu
-
- 27 Feb, 2022 1 commit
-
-
Quan (Andy) Gan authored
* huuuuge update * remove * lint * lint * fix * what happened to nccl * update multi-gpu unsupervised graphsage example * replace most of the dgl.mp.process with torch.mp.spawn * update if condition for use_uva case * update user guide * address comments * incorporating suggestions from @jermainewang * oops * fix tutorial to pass CI * oops * fix again Co-authored-by:Xin Yao <xiny@nvidia.com>
-
- 23 Feb, 2022 2 commits
-
-
sanchit-misra authored
-
Minjie Wang authored
* WIP: TypedLinear and new RelGraphConv * wip * further simplify RGCN * a bunch of tweak for performance; add basic cpu support * update on segmm * wip: segment.cu * new backward kernel works * fix a bunch of bugs in kernel; leave idx_a for future * add nn test for typed_linear * rgcn nn test * bugfix in corner case; update RGCN README * doc * fix cpp lint * fix lint * fix ut * wip: hgtconv; presorted flag for rgcn * hgt code and ut; WIP: some fix on reorder graph * better typed linear init * fix ut * fix lint; add docstring
-
- 21 Feb, 2022 1 commit
-
-
Quan (Andy) Gan authored
* fixes * fix * more fixes * update * oops * lint? * temporarily revert - will fix in another PR * more fixes * skipping mxnet test * address comments * fix DDP * fix edge dataloader exclusion problems * stupid bug * fix * use_uvm option * fix * fixes * fixes * fixes * fixes * add evaluation for cluster gcn and ddp * stupid bug again * fixes * move sanity checks to only support DGLGraphs * pytorch lightning compatibility fixes * remove * poke * more fixes * fix * fix * disable test * docstrings * why is it getting a memory leak? * fix * update * updates and temporarily disable forkingpickler * update * fix? * fix? * oops * oops * fix * lint * huh * uh * update * fix * made it memory efficient * refine exclude interface * fix tutorial * fix tutorial * fix graph duplication in CPU dataloader workers * lint * lint * Revert "lint" This reverts commit 805484dd553695111b5fb37f2125214a6b7276e9. * Revert "lint" This reverts commit 0bce411b2b415c2ab770343949404498436dc8b2. * Revert "fix graph duplication in CPU dataloader workers" This reverts commit 9e3a8cf34c175d3093c773f6bb023b155f2bd27f. Co-authored-by:
xiny <xiny@nvidia.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 18 Feb, 2022 1 commit
-
-
ayasar70 authored
* Based on issue #3436. Improving _SegmentCopyKernel s GPU utilization by switching to nonzero based thread assignment * fixing lint issues * Update cub for cuda 11.5 compatibility (#3468) * fixing type mismatch * tx guaranteed to be smaller than nnz. Hence removing last check * minor: updating comment * adding three unit tests for csr slice method to cover some corner cases * timing repeatkernel * clean * clean * clean * updating _SegmentMaskColKernel * Working on requests: removing sorted array check and adding comments to utility functions * fixing lint issue Co-authored-by:
Abdurrahman Yasar <ayasar@nvidia.com> Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-