- 15 Nov, 2023 1 commit
-
-
Rhett Ying authored
-
- 12 Sep, 2023 1 commit
-
-
czkkkkkk authored
-
- 01 Sep, 2023 1 commit
-
-
Hugo MacDermott-Opeskin authored
Co-authored-by:Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
-
- 15 Aug, 2023 1 commit
-
-
Hongzhi (Steve), Chen authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
-
- 14 Aug, 2023 1 commit
-
-
Muhammed Fatih BALIN authored
-
- 07 Aug, 2023 2 commits
-
-
Hongzhi (Steve), Chen authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
-
Hongzhi (Steve), Chen authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
-
- 03 Aug, 2023 1 commit
-
-
Hongzhi (Steve), Chen authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
-
- 02 Aug, 2023 2 commits
-
-
Hongzhi (Steve), Chen authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
-
Hongzhi (Steve), Chen authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
-
- 01 Aug, 2023 1 commit
-
-
Muhammed Fatih BALIN authored
-
- 24 Jul, 2023 1 commit
-
-
Muhammed Fatih BALIN authored
Co-authored-by:xiny <xiny@nvidia.com>
-
- 02 Jun, 2023 1 commit
-
-
Hongzhi (Steve), Chen authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
-
- 17 Apr, 2023 1 commit
-
-
Xin Yao authored
-
- 22 Mar, 2023 1 commit
-
-
Xin Yao authored
* cleanup unused cmake options * disable BUILD_TORCH for cugraph * resolve comments
-
- 08 Mar, 2023 1 commit
-
-
Xin Yao authored
* expose GeneratePermutation * add sparse_all_to_all_push * add sparse_all_to_all_pull * add unit test * handle world_size=1 * remove python nccl wrapper * remove the nccl dependency * use pinned memory to speedup D2H copy * fix lint * resolve comments * fix lint * fix ut * resolve comments
-
- 05 Jan, 2023 1 commit
-
-
Xin Yao authored
-
- 19 Nov, 2022 1 commit
-
-
Xin Yao authored
* Update CUDA.cmake to align with PyTorch's * add Ada and Hopper * add more comments * resolve comments Co-authored-by:Triston <triston.cao@gmail.com>
-
- 17 Nov, 2022 1 commit
-
-
czkkkkkk authored
-
- 15 Dec, 2021 1 commit
-
-
lixiaobai authored
* Feat: support API "randomwalk_topk" in library * Feat: use the new API "randomwalk_topk" for PinSAGESampler * Minor * Minor * Refactor: modified codes as checker required * Minor * Minor * Minor * Minor * Fix: checking errors in RandomWalkTopk * Refactor: modified the docstring for randomwalk_topk * change randomwalk_topk to internal * fix * rename * Minor for pinsage.py * Feat: support randomwalk and SelectPinSageNeighbors on GPU Port RandomWalk algorithm on GPU, and port SelectPinSageNeighbors on GPU. * Feat: support GPU on python APIs * Feat: remove perf print information in FrequenchHashmap * Fix: modified the code format Modified the code format as task_lint.sh suggested * Feat: let test script support PinSAGESampler on GPU Let test script support PinSAGESampler on GPU, minor of "restart_prob". * Minor * Minor * Minor * Refactor: use the atomic operations from the array module * Minor: change the long lines * Refactor: modified the get_node_types for gpu * Feat: update the contributor date * Perf: remove unnecessary stream sync * Feat: support other random walk But the non-uniform choice is still not supported. * Fix: add CUDA switch for random walk Co-authored-by:Quan Gan <coin2028@hotmail.com>
-
- 08 Nov, 2021 1 commit
-
-
Hongyu Cai authored
Co-authored-by:Minjie Wang <wmjlyjemaine@gmail.com>
-
- 16 Jul, 2021 1 commit
-
-
David Min authored
[Feature][Performance][GPU] Introducing UnifiedTensor for efficient zero-copy host memory access from GPU (#3086) * Add pytorch-direct version * Initial commit of unified tensor * Merge branch 'master' of https://github.com/davidmin7/dgl * Remove unnecessary things * Fix error message * Fix/Add descriptions * whitespace fix * add unpin * disable IndexSelectCPUFromGPU with no CUDA * add a newline for unified_tensor.py * Apply changes based on feedback * add 'os' module * skip unified tensor unit test for cpu only * Update tests/pytorch/test_unified_tensor.py Co-authored-by:
xiang song(charlie.song) <classicxsong@gmail.com> * reflect feedback Co-authored-by:
shhssdm <shhssdm@gmail.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
xiang song(charlie.song) <classicxsong@gmail.com>
-
- 16 Jun, 2021 1 commit
-
-
nv-dlasalle authored
* Update cmake to build Ampere * Fix version check
-
- 25 May, 2021 1 commit
-
-
nv-dlasalle authored
* Add NCCL as a submodule * Allow using third_party/nccl or system nccl * Add nccl_external as a dependency * Fix conditional Co-authored-by:Quan (Andy) Gan <coin2028@hotmail.com>
-
- 20 May, 2021 1 commit
-
-
nv-dlasalle authored
[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825) * Split NCCL wrapper from sparse optimizer and sparse embedding * Add more unit tests for single node nccl * Fix unit test for tf * Switch to device histogram * Fix histgram issues * Finish migration to histogram * Handle cases with zero send/recieve data * Start on partition object * Get compiling * Updates * Add unit tests * Switch to partition object * Fix linting issues * Rename partition file * Add python doc * Fix python assert and finish doxygen comments * Remove stubs for range based partition to satisfy pylint * Wrap unit test in GPU only * Wrap explicit cuda call in ifdef * Merge with partition.py * update docstrings * Cleanup partition_op * Add Workspace object * Switch to using workspace object * Move last remainder based function out of nccl_api * Add error messages * Update docs with examples * Fix linting erros Co-authored-by:xiang song(charlie.song) <classicxsong@gmail.com>
-
- 09 Mar, 2021 1 commit
-
-
Tianqi Zhang (张天启) authored
* finish graph matching gpu version * use C++ shuffle * finish graph matching * fix bug * fix bug * change name and use swap * upt * fix format problem * fix format problem * stronger test * upt * upt * change python api * upt * upt * format check * upt * upt * fix bug Co-authored-by:Tong He <hetong007@gmail.com>
-
- 08 Feb, 2021 1 commit
-
-
nv-dlasalle authored
* Add start of to_block gpu implementation * Pull in more changes from 0.4.2 cuda_to_block * Move more code to IdArray * Refactor DeviceNodeMapMaker * Updates * get compiling * Integrate to_block * Fix ID allocation * Minor fixes * Cleanup cuda calls to use cuda_common * Reduce kernel calls * Lint cleanup * Expand documentation * Remove unused function * Rename variables for consistency * Add doxygen comments * Fix file extension * Remove raw asynccopy for deviceapi * Remove unused function * Fix block/tile configuration * Add cuda_device_common.cuh * Add basic hashtable * Migrate part of hashtable * Refactor to use external hashtable * Make functions members * Format hash table functions * Migrate duplicate filling * Move last function over * Refactor with cu file * lint c++ code * Move context check to C++ code * Use macro switch * Add missing files * Update docstring * update docs * Move atomic functions * Refactor hashtable * Fix linting * Expand docs * Fix mismatched argument names * Switch doxygen comments from using @param to \param Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
Minjie Wang <wmjlyjemaine@gmail.com>
-
- 28 Jan, 2021 1 commit
-
-
Zihao Ye authored
* add tvm as submodule * compilation is ok but calling fails * can call now * pack multiple modules, change names * upd * upd * upd * fix cmake * upd * upd * upd * upd * fix * relative path * upd * upd * upd * singleton * upd * trigger * fix * upd * count reducible * upd * upd * upd * upd * upd * upd * upd * upd * upd * only keep related files * upd * upd * upd * upd * lint * lint * lint * lint * pylint * upd * upd * compilation * fix * upd * upd * upd * upd * upd * upd * upd doc * refactor * fix * upd number Co-authored-by:
Zhi Lin <linzhilynn@gmail.com> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-42-78.us-east-2.compute.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-21-156.us-east-2.compute.internal> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 31 Dec, 2020 1 commit
-
-
Zhi Lin authored
Co-authored-by:Zihao Ye <expye@outlook.com>
-
- 25 Dec, 2020 1 commit
-
-
Quan (Andy) Gan authored
* first commit * some thoughts * move around * more commit * more fixes * now it uses torch allocator * fix symbol export error * fix * fixes * test fix * add script * building separate library per version * fix for vs2019 * more fixes * fix on windows build * update jenkinsfile * auto copy built dlls for windows * lint and installation guide update * fix * specify conda environment * set environment for ci * fix * fix * fix * fix again * revert * fix cmake * fix * switch to using python interpreter path * remove scripts * debug * oops sorry * Update index.rst * Update index.rst * copies automatically, no need for this * do not print message if library not found * tiny fixes * debug on nightly * replace add_compile_definitions to make CMake 3.5 happy * fix linking to wrong lib for multiple pytorch envs * changed building strategy * fix nightly * fix windows * fix windows again * setup bugfix * address comments * change README
-
- 21 Dec, 2020 1 commit
-
-
Zihao Ye authored
-
- 17 Dec, 2020 1 commit
-
-
Zihao Ye authored
* upd cmake * upd * format
-
- 14 Nov, 2020 1 commit
-
-
Minjie Wang authored
-
- 13 Nov, 2020 1 commit
-
-
Quan (Andy) Gan authored
* multiple fixes * fix CI * fiddle * revert stubs * remove stubs * poke * remove linking of driver library * minor Co-authored-by:Minjie Wang <wmjlyjemaine@gmail.com>
-
- 07 Nov, 2020 1 commit
-
-
Minjie Wang authored
* add support for cuda 11 * fix inc bug in pytorch 1.8 * poke ci * fix * small fix * try fix * try fix Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 27 Aug, 2020 1 commit
-
-
Zihao Ye authored
* upd * upd * upd * upd * upd * upd * upd * upd
-
- 21 Jun, 2020 1 commit
-
-
Tong He authored
* working framework without actual algorithm logic * rename * fix * fps passes compilation * correct algorithm * add cuda implementation * update random start * before refactor * pass compilation but cuda not working * working * code working, will add docstring * add mxnet support * update docstring * update doc and test * cpplint * cpcplint * pylint * temporary fix * fix for win64 * fix unitetest * fix * fix * remove comment * move to geometry package * remove redundant include * add docstrings and comments * add proof * add validity check
-
- 17 Jul, 2019 1 commit
-
-
Minjie Wang authored
* WIP: array refactoring * WIP: implementation * wip * most csr part * WIP: on coo * WIP: coo * finish refactoring immutable graph * compiled * fix undefined ndarray copy bug; add COOToCSR when coo has no data array * fix bug in COOToCSR * fix bug in CSR constructor * fix bug in in_edges(vid) * fix OutEdges bug * pass test_graph * pass test_graph * fix bug in CSR constructor * fix bug in CSR constructor * fix bug in CSR constructor * fix stupid bug * pass gpu test * remove debug printout * fix lint * rm biparate grpah * fix lint * address comments * fix bug in Clone * cpp utests
-
- 12 Jun, 2019 1 commit
-
-
Quan (Andy) Gan authored
* bump up version * conda+cuda trial * switch conda branch * revert * disable cudnn
-
- 10 Jun, 2019 1 commit
-
-
Quan (Andy) Gan authored
-