- 26 Jan, 2022 1 commit
-
-
Rhett Ying authored
* [Feature] long live server for multiple client groups * generate globally unique name for DistTensor within DGL automatically
-
- 21 Jan, 2022 1 commit
-
-
Xin Yao authored
* implement pin_memory/unpin_memory/is_pinned for dgl.graph * update python docstring * update c++ docstring * add test * fix the broken UnifiedTensor * eliminate extra context parameter for pin/unpin * fix linting * fix typo * disable new format materialization for pinned graphs * update python doc for pin_memory_ * fix unit test * update doc * change unitgraph and heterograph's PinMemory to in-place * update comments for NDArray's PinMemory_ and PinData * update doc Co-authored-by:Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 19 Jan, 2022 1 commit
-
-
Rhett Ying authored
* [Fix] reduce error msg, refine fetch logic of available ports * un-initialize client before sending shutdown request * fix import error * print connect failure log only in debug mode * enable DMLC_LOG_DEBUG=1 in CI
-
- 17 Jan, 2022 2 commits
-
-
Quan (Andy) Gan authored
* oops * test
-
Quan (Andy) Gan authored
* fix GPU global negative sampling code * Update negative_sampling.cu
-
- 11 Jan, 2022 2 commits
-
-
MaoYuan Xian authored
* Pass the std:min argument's type, to avoid the compilation error. * Update parallel_for.h * Update negative_sampling.cc Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
Rhett Ying authored
* [Feature] enable TP::Receiver wait for any numbers of senders * fix random unit test failure * avoid endless future wait * fix unit test failure * fix seg fault when finalize wait in receiver * [Feature] refactor sender connect logic and remove unnecessary sleeps in unit tests * fix lint * release RPCContext resources before process exits * [Debug] TPReceiver wait start log * [Debug] add log in get port * [Debug] add log * [ReDebug] revert time sleep in unit tests * [Debug] remove sleep for test_distri,test_mp * [debug] add more log * [debug] add listen_booted_ flag * [debug] restore commented code for queue * [debug] sleep more in rpc_client * restore change in tests * Revert "restore change in tests" This reverts commit 41a18926d181ec2517069389bfc41de2cc949280. * Revert "[debug] sleep more in rpc_client" This reverts commit a908e758eabca0a6ce62eb2e59baea02a840ac67. * Revert "[debug] restore commented code for queue" This reverts commit d3f993b3746e6bb6e2cc2f90204dd7e9461c6301. * Revert "[debug] add listen_booted_ flag" This reverts commit 244b2167d94942ff2a0acec8823b974975e52580. * Revert "[debug] add more log" This reverts commit 4b78447b0a575a824821dc7e25cca2246e6e30e2. * Revert "[Debug] remove sleep for test_distri,test_mp" This reverts commit e1df1aadcc8b1c2a0013ed77322ac391a8807612. * remove debug code * revert unnecessary change * revert unnecessary changes * always reset RPCContext when get started and reset all data * remove time.sleep in dist tests * fix lint * reset envs before each dist test * reset env properly * add time sleep when start each server * sleep for a while when boot server * replace wait_thread with callback * fix lint * add dglconnect handshake check Co-authored-by:Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 10 Jan, 2022 1 commit
-
-
Quan (Andy) Gan authored
-
- 07 Jan, 2022 1 commit
-
-
Quan (Andy) Gan authored
* first commit * a bunch of fixes * add unique * lint * lint * lint * address comments * Update negative_sampler.py * fix * description * address comments and fix * fix * replace unique with replace * test pylint * Update negative_sampler.py
-
- 04 Jan, 2022 1 commit
-
-
Quan (Andy) Gan authored
* support shared memory on windows * Update shared_mem.cc
-
- 19 Dec, 2021 1 commit
-
-
hirayaku authored
* fix CopyVectorToNDArray * Fix lint Co-authored-by:Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 16 Dec, 2021 1 commit
-
-
Israt Nisa authored
[Feature] Add CUDA support for `min` and `max` reducer in heterogeneous API for unary message functions (#3566) * CUDA support max/min reducer on forward pass * docstring * concised UpdateGradMinMax_hetero * reorganized UpdateGradMinMax_hetero * CUDA kernels for max/min reducer * variable name * lint check * changed CUDA 2D thread mapping to 1D * removed legacy cusparse for min/max reducer * git CI issue * restarting git CI * adding namespace std Co-authored-by:
Israt Nisa <nisisrat@amazon.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 15 Dec, 2021 2 commits
-
-
lixiaobai authored
* Feat: support API "randomwalk_topk" in library * Feat: use the new API "randomwalk_topk" for PinSAGESampler * Minor * Minor * Refactor: modified codes as checker required * Minor * Minor * Minor * Minor * Fix: checking errors in RandomWalkTopk * Refactor: modified the docstring for randomwalk_topk * change randomwalk_topk to internal * fix * rename * Minor for pinsage.py * Feat: support randomwalk and SelectPinSageNeighbors on GPU Port RandomWalk algorithm on GPU, and port SelectPinSageNeighbors on GPU. * Feat: support GPU on python APIs * Feat: remove perf print information in FrequenchHashmap * Fix: modified the code format Modified the code format as task_lint.sh suggested * Feat: let test script support PinSAGESampler on GPU Let test script support PinSAGESampler on GPU, minor of "restart_prob". * Minor * Minor * Minor * Refactor: use the atomic operations from the array module * Minor: change the long lines * Refactor: modified the get_node_types for gpu * Feat: update the contributor date * Perf: remove unnecessary stream sync * Feat: support other random walk But the non-uniform choice is still not supported. * Fix: add CUDA switch for random walk Co-authored-by:Quan Gan <coin2028@hotmail.com>
-
Vasimuddin Md authored
* added distgnn plus libra codebase * Dist application codes * added comments in partition code. changed the interface of partitioning call. * updated readme * create libra partitioning branch for the PR * removed disgnn files for first PR * updated kernel.cc * added libra_partition.cc and moved libra code from kernel.cc to libra_partition.cc * fixed lint error; merged libra2dgl.py and main_Libra.py to libra_partition.py; added graphsage/distgnn folder and partition script. * removed libra2dgl.py * fixed the lint error and cleaned the code. * revisions due to PR comments. added distgnn/tools contains partitions routines * update 2 PR revision I * fixed errors; also improved the runtime by 10x. * fixed minor lint error * fixed some more lints * PR revision II changed the interface of libra partition function * rewrite docstring Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
- 08 Dec, 2021 1 commit
-
-
Tianqi Zhang (张天启) authored
* fix setdevice issue * change to curand device API Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
- 06 Dec, 2021 2 commits
-
-
Jinjing Zhou authored
* doesn't know whether works * add change * fix * fix * fix * remove * revert * lint * lint * fix * revert * lint * fix * only build rpc on linux * lint * lint * fix build on windows * fix windows * remove old test * fix cmake * Revert "remove old test" This reverts commit f1ea75c777c34cdc1f08c0589676ba6aee1feb29. * fix windows * fix * fix * fix indent * fix indent * address comment * fix * fix * fix * fix * fix * lint * fix indent * fix lint * add introduction * fix * lint * lint * add more logs * fix * update xbyak for C++14 with gcc5 * Remove channels * fix * add test script * fix * remove unused file * fix lint * add timeout
-
Quan (Andy) Gan authored
* first commit * second commit * spaghetti unit tests * rewrite test
-
- 03 Dec, 2021 1 commit
-
-
Israt Nisa authored
* min/max support for forward CPU heterograph * Added etype with each argU values * scatter_add needs fix * added scatter_add_hetero. Grads dont match for max reducer * storing ntype in argX * fixing scatter_add_hetero * hetero matches with torch's scatter add * works copy_e forward+cpu * added backward for copy_rhs * Computes gradient for all node types in one kernel * bug fix * unnitest for max/min on CPU * renamed scatter_add_hetero to update_grad_minmax_hetero * lint check and comment out cuda call for max. Code is for CPU only * lint check * replace inf with zero * minor * lint check * removed LIBXSMM code from hetro code * fixing backward operator of UpdateGradMinMaxHetero * removed backward from update_grad_minmax_hetero * docstring * improved docstring and coding style * Added pass by pointer for output * typos and pass by references * Support for copy_rhs * Added header <string> * fix bug in copy_u_max * Added comments and dimension check of all etypes * skip mxnet check * pass by pointer output arrays * updated docstring Co-authored-by:
Israt Nisa <nisisrat@amazon.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 30 Nov, 2021 1 commit
-
-
ayasar70 authored
* Based on issue #3436. Improving _SegmentCopyKernel s GPU utilization by switching to nonzero based thread assignment * fixing lint issues * Update cub for cuda 11.5 compatibility (#3468) * fixing type mismatch * tx guaranteed to be smaller than nnz. Hence removing last check * minor: updating comment * adding three unit tests for csr slice method to cover some corner cases * working on repeat * updating repeat kernel * removing unnecessary parameter * cleaning commented line * cleaning time measures * cleaning time measurement lines Co-authored-by:
Abdurrahman Yasar <ayasar@nvidia.com> Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 29 Nov, 2021 1 commit
-
-
lixiaobai authored
* Feat: support API "randomwalk_topk" in library * Feat: use the new API "randomwalk_topk" for PinSAGESampler * Minor * Minor * Refactor: modified codes as checker required * Minor * Minor * Minor * Minor * Fix: checking errors in RandomWalkTopk * Refactor: modified the docstring for randomwalk_topk * change randomwalk_topk to internal * fix * rename * Minor for pinsage.py Co-authored-by:Quan Gan <coin2028@hotmail.com>
-
- 17 Nov, 2021 1 commit
-
-
Israt Nisa authored
* Added SDDMMCOO_hetero support * removed redundant CUDA kernels * added benchmark for regression test * fix * fixed bug for single src node type Co-authored-by:
Israt Nisa <nisisrat@amazon.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 15 Nov, 2021 1 commit
-
-
Eric Kim authored
-
- 10 Nov, 2021 1 commit
-
-
Rhett Ying authored
* [BugFix] fix in/out degree computation * add unit tests
-
- 06 Nov, 2021 1 commit
-
-
ayasar70 authored
* Based on issue #3436. Improving _SegmentCopyKernel s GPU utilization by switching to nonzero based thread assignment * fixing lint issues * Update cub for cuda 11.5 compatibility (#3468) * fixing type mismatch * tx guaranteed to be smaller than nnz. Hence removing last check * minor: updating comment * adding three unit tests for csr slice method to cover some corner cases Co-authored-by:
Abdurrahman Yasar <ayasar@nvidia.com> Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 04 Nov, 2021 2 commits
-
-
Xin Yao authored
* enable unit tests for dataloader on the GPU * fix compatibility * copyright * fix linting Co-authored-by:nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
-
Xin Yao authored
* relabel gpu * unittest for ralebl_ on the GPU * finish Relabel_ for the GPU * copyright * re-enable the unittest for edge_subgrah on the GPU * fix unittest for tensorflow * use a fixed number of threads Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 03 Nov, 2021 1 commit
-
-
nv-dlasalle authored
-
- 21 Oct, 2021 1 commit
-
-
Xin Yao authored
* gpu compact graph template * cuda compact graph draft * fix typo * compact graphs * pass unit test but fail in training * example using EdgeDataLoader on the GPU * refactor cuda_compact_graph and cuda_to_block * update training scripts * fix linting * fix linting * fix exclude_edges for the GPU * add --data-cpu & fix copyright
-
- 18 Oct, 2021 2 commits
-
-
nv-dlasalle authored
-
David Min authored
* parallelize CSRRowSlice() * use parallel_for for the second loop Co-authored-by:
nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 15 Oct, 2021 1 commit
-
-
David Min authored
* Add pytorch-direct version * remove * add documentation for UnifiedTensor * Revert "add documentation for UnifiedTensor" This reverts commit 63ba42644d4aba197c1cb4ea4b85fa1bc43b8849. * add boundary check for UVM IndexSelect * relocate boundary check index kernels to cuda * fix function name * fix indexkernel in nccl api * fix argument ordering * simplify code * Add a comment for the uvm version Co-authored-by:
shhssdm <shhssdm@gmail.com> Co-authored-by:
xiang song(charlie.song) <classicxsong@gmail.com> Co-authored-by:
Minjie Wang <wmjlyjemaine@gmail.com>
-
- 14 Oct, 2021 1 commit
-
-
zexi yuan authored
* [Bugfix] fix a compile error for Debug-BuildType on Windows Platform When using CMakeLists.txt to build the "Debug" BuildType on the Windows Platform, it has three compile errors (C4716) in the file "dgl\src\runtime\shared_mem.cc": 'dgl::runtime::SharedMemory::CreateNew': must return a value 'dgl::runtime::SharedMemory::Open': must return a value 'dgl::runtime::SharedMemory::Exist': must return a value * [Bugfix] cmake error "cannot find load file" when DGL as a sub_directory on Linux When using DGL as a subdirectory in a CMake Project, the "CMAKE_SOURCE_DIR" here will return the parent cmake scope dir, which is not a expected dir. Maybe it is better to use "CMAKE_CURRENT_SOURCE_DIR" to set "GKLIB_PATH". * [Bugfix] cmd cmake error when DGL as a subdirectory When DGL as a subdirectory of another project, the WORKING_DIRECTORY of "add_custom_command" will be incorrect at the line 255 of "CMakeLists.txt", such that making a cmake "setlocal" error.
-
- 12 Oct, 2021 1 commit
-
-
Rhett Ying authored
-
- 29 Sep, 2021 1 commit
-
-
Rhett Ying authored
* [Feature] enable create/set/free cuda stream for internal use * add unit test * fix unit test failure on mxnet and tf * refactor stream wrapper * fix lint error * fix lint error
-
- 28 Sep, 2021 1 commit
-
-
Jingcheng Yu authored
Co-authored-by:JingchengYu94 <jingchengyu94@gmail.com>
-
- 22 Sep, 2021 1 commit
-
-
Quan (Andy) Gan authored
* graceful c++ exception in OpenMP * credits * add test Co-authored-by:Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 21 Sep, 2021 1 commit
-
-
mszarma authored
* [Feature] Exclude edges in sample_neighbors Extending sample_neighbors and sample_frontier API to support exclude_edges parameter. exclude_edges support tensor and dict data Feature enable excluding certain edges during neighborhood sampling Exclude_edges contains EID's of edges which will be excluded during neighbor picking for seed nodes. Added test case for heterograph and homograph RFC issue id: 2944 * compatibility * fix * fix Co-authored-by:Quan Gan <coin2028@hotmail.com>
-
- 17 Sep, 2021 1 commit
-
-
Rhett Ying authored
-
- 16 Sep, 2021 1 commit
-
-
nv-dlasalle authored
[Performance][Feature] Add `src_nodes` paramter to `to_block()` to avoid cost running unique() when available. (#2973) * Add lhs_nodes are paremeter to to_block * Update unit test * Switch to simplified node conversion * Switch lhs_nodes to be in/out parameter * Update docs Co-authored-by:
Da Zheng <zhengda1936@gmail.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com> Co-authored-by:
Minjie Wang <wmjlyjemaine@gmail.com>
-
- 14 Sep, 2021 1 commit
-
-
Rhett Ying authored
* [Performance] improve coo2csr space complexity when row is not sorted * [Perf] replace std::vector<> by NDArray * keep both impl of unsorted coo to csr and choose according to graph density dynamically * refine criteria to choose btw Unsorted algos Co-authored-by:Ubuntu <ubuntu@ip-172-31-34-27.us-west-2.compute.internal>
-