Commits · 19096c6a8e7f1fb6f97bd2b43d1e9bde80a7a47f · OpenDAS / dgl

15 Nov, 2023 1 commit
- [dev] enable cuda12.1 build (#6567) · 19096c6a
  Rhett Ying authored Nov 15, 2023
  
  19096c6a
12 Sep, 2023 1 commit
- [CMAKE] Move DGL cuda file declaration to the main CMakeLists.txt (#6300) · edcecdd0
  czkkkkkk authored Sep 12, 2023
  
  edcecdd0
01 Sep, 2023 1 commit
- [Build] Add CMake changes from conda-forge build (#6189) · 4a42027d
  Hugo MacDermott-Opeskin authored Sep 01, 2023
```
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
```
  4a42027d
15 Aug, 2023 1 commit
- [Misc] Cleanup old flags, and rely on BUILD_TYPE for all features. (#6154) · 34641092
  Hongzhi (Steve), Chen authored Aug 15, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  34641092
14 Aug, 2023 1 commit
- [Dev] Change CXX standard to 17 (#6138) · f0d8ca1e
  Muhammed Fatih BALIN authored Aug 13, 2023
  
  f0d8ca1e
07 Aug, 2023 2 commits
- [Misc] Add comment to clarify __dgl_option. (#6106) · cff938c6
  Hongzhi (Steve), Chen authored Aug 07, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  cff938c6
- [Misc] Support build option: "all". (#6102) · f7fef600
  Hongzhi (Steve), Chen authored Aug 07, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  f7fef600
03 Aug, 2023 1 commit
- [Misc] Support DGL feature option. (#6088) · d7410cf4
  Hongzhi (Steve), Chen authored Aug 03, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  d7410cf4
02 Aug, 2023 2 commits
- [Misc] Cleanup unused cmake util. (#6084) · 12ade95c
  Hongzhi (Steve), Chen authored Aug 02, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  12ade95c
- [Misc] Cleanup duplicated flags. (#6081) · ffd8edeb
  Hongzhi (Steve), Chen authored Aug 02, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  ffd8edeb
01 Aug, 2023 1 commit
- [Dev] Resolve compile issue with gcc 11.3.0 (#6072) · 224d6a69
  Muhammed Fatih BALIN authored Aug 01, 2023
  
  224d6a69
24 Jul, 2023 1 commit
- [Feature] Gpu cache for node and edge data (#4341) · 69a532c1
  Muhammed Fatih BALIN authored Jul 24, 2023
```
Co-authored-by: xiny <xiny@nvidia.com>
```
  69a532c1
02 Jun, 2023 1 commit
- [Cleanup] Remove featgraph and unused TVM dependency. (#5767) · 9ff56d20
  Hongzhi (Steve), Chen authored Jun 02, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
```
  9ff56d20
17 Apr, 2023 1 commit
- [Fix] Remove curand host functions (#5552) · bea5c78b
  Xin Yao authored Apr 17, 2023
  
  bea5c78b
22 Mar, 2023 1 commit
- [Cleanup] Cleanup unused CMake options (#5470) · acb955e1
  Xin Yao authored Mar 22, 2023
```
* cleanup unused cmake options

* disable BUILD_TORCH for cugraph

* resolve comments
```
  acb955e1
08 Mar, 2023 1 commit

[Refactor] Replace third_party/nccl with PyTorch's NCCL backend (#4989) · 8d5d8962

Xin Yao authored Mar 08, 2023

* expose GeneratePermutation

* add sparse_all_to_all_push

* add sparse_all_to_all_pull

* add unit test

* handle world_size=1

* remove python nccl wrapper

* remove the nccl dependency

* use pinned memory to speedup D2H copy

* fix lint

* resolve comments

* fix lint

* fix ut

* resolve comments

8d5d8962

05 Jan, 2023 1 commit
- update cmake for cuda12 (#5048) · 7ee550f0
  Xin Yao authored Jan 05, 2023
  
  7ee550f0
19 Nov, 2022 1 commit

[Makefile] Refactor CUDA makefile and add Hopper (SM90) to default build (#4830) · 65b34702

Xin Yao authored Nov 20, 2022



* Update CUDA.cmake to align with PyTorch's

* add Ada and Hopper

* add more comments

* resolve comments
Co-authored-by: Triston <triston.cao@gmail.com>

65b34702

17 Nov, 2022 1 commit
- [Sparse] Link to DGL (#4877) · 06438d70
  czkkkkkk authored Nov 17, 2022
  
  06438d70
15 Dec, 2021 1 commit

[PinSAGESampler] support PinSAGE sampler on GPU (#3567) · dd762a1e

lixiaobai authored Dec 15, 2021



* Feat: support API "randomwalk_topk" in library

* Feat: use the new API "randomwalk_topk" for PinSAGESampler

* Minor

* Minor

* Refactor: modified codes as checker required

* Minor

* Minor

* Minor

* Minor

* Fix: checking errors in RandomWalkTopk

* Refactor: modified the docstring for randomwalk_topk

* change randomwalk_topk to internal

* fix

* rename

* Minor for pinsage.py

* Feat: support randomwalk and SelectPinSageNeighbors on GPU

Port RandomWalk algorithm on GPU,
and port SelectPinSageNeighbors on GPU.

* Feat: support GPU on python APIs

* Feat: remove perf print information in FrequenchHashmap

* Fix: modified the code format

Modified the code format as task_lint.sh suggested

* Feat: let test script support PinSAGESampler on GPU

Let test script support PinSAGESampler on GPU,
minor of "restart_prob".

* Minor

* Minor

* Minor

* Refactor: use the atomic operations from the array module

* Minor: change the long lines

* Refactor: modified the get_node_types for gpu

* Feat: update the contributor date

* Perf: remove unnecessary stream sync

* Feat: support other random walk

But the non-uniform choice is still not supported.

* Fix: add CUDA switch for random walk
Co-authored-by: Quan Gan <coin2028@hotmail.com>

dd762a1e

08 Nov, 2021 1 commit
- [Doc] Fix type in CUDA.cmake (#3479) · 9c41e97c
  Hongyu Cai authored Nov 08, 2021
```
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>
```
  9c41e97c
16 Jul, 2021 1 commit

[Feature][Performance][GPU] Introducing UnifiedTensor for efficient zero-copy... · 905c0aa5

David Min authored Jul 17, 2021

[Feature][Performance][GPU] Introducing UnifiedTensor for efficient zero-copy host memory access from GPU (#3086)

* Add pytorch-direct version

* Initial commit of unified tensor

* Merge branch 'master' of https://github.com/davidmin7/dgl



* Remove unnecessary things

* Fix error message

* Fix/Add descriptions

* whitespace fix

* add unpin

* disable IndexSelectCPUFromGPU with no CUDA

* add a newline for unified_tensor.py

* Apply changes based on feedback

* add 'os' module

* skip unified tensor unit test for cpu only

* Update tests/pytorch/test_unified_tensor.py
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

* reflect feedback
Co-authored-by: shhssdm <shhssdm@gmail.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

905c0aa5

16 Jun, 2021 1 commit
- Add Ampere support to cmake files (#3031) · 7d069d62
  nv-dlasalle authored Jun 16, 2021
```
* Update cmake to build Ampere

* Fix version check
```
  7d069d62
25 May, 2021 1 commit

[Bugfix] Include NCCL as a submodule (#2934) · 66eb240d

nv-dlasalle authored May 24, 2021



* Add NCCL as a submodule

* Allow using third_party/nccl or system nccl

* Add nccl_external as a dependency

* Fix conditional
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

66eb240d

20 May, 2021 1 commit

[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d

nv-dlasalle authored May 20, 2021


[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)

* Split NCCL wrapper from sparse optimizer and sparse embedding

* Add more unit tests for single node nccl

* Fix unit test for tf

* Switch to device histogram

* Fix histgram issues

* Finish migration to histogram

* Handle cases with zero send/recieve data

* Start on partition object

* Get compiling

* Updates

* Add unit tests

* Switch to partition object

* Fix linting issues

* Rename partition file

* Add python doc

* Fix python assert and finish doxygen comments

* Remove stubs for range based partition to satisfy pylint

* Wrap unit test in GPU only

* Wrap explicit cuda call in ifdef

* Merge with partition.py

* update docstrings

* Cleanup partition_op

* Add Workspace object

* Switch to using workspace object

* Move last remainder based function out of nccl_api

* Add error messages

* Update docs with examples

* Fix linting erros
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

ae8dbe6d

09 Mar, 2021 1 commit

[Feature] Add edge coarsening for homogeneous undirected graphs (#2691) · c88fca50

Tianqi Zhang (张天启) authored Mar 09, 2021



* finish graph matching gpu version

* use C++ shuffle

* finish graph matching

* fix bug

* fix bug

* change name and use swap

* upt

* fix format problem

* fix format problem

* stronger test

* upt

* upt

* change python api

* upt

* upt

* format check

* upt

* upt

* fix bug
Co-authored-by: Tong He <hetong007@gmail.com>

c88fca50

08 Feb, 2021 1 commit

[Sampling] Implement `dgl.to_block()` for the GPU (#2339) · bc3a532f

nv-dlasalle authored Feb 07, 2021



* Add start of to_block gpu implementation

* Pull in more changes from 0.4.2 cuda_to_block

* Move more code to IdArray

* Refactor DeviceNodeMapMaker

* Updates

* get compiling

* Integrate to_block

* Fix ID allocation

* Minor fixes

* Cleanup cuda calls to use cuda_common

* Reduce kernel calls

* Lint cleanup

* Expand documentation

* Remove unused function

* Rename variables for consistency

* Add doxygen comments

* Fix file extension

* Remove raw asynccopy for deviceapi

* Remove unused function

* Fix block/tile configuration

* Add cuda_device_common.cuh

* Add basic hashtable

* Migrate part of hashtable

* Refactor to use external hashtable

* Make functions members

* Format hash table functions

* Migrate duplicate filling

* Move last function over

* Refactor with cu file

* lint c++ code

* Move context check to C++ code

* Use macro switch

* Add missing files

* Update docstring

* update docs

* Move atomic functions

* Refactor hashtable

* Fix linting

* Expand docs

* Fix mismatched argument names

* Switch doxygen comments from using @param to \param
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

bc3a532f

28 Jan, 2021 1 commit

[feature] Supporting half precision floating data type (fp16). (#2552) · 7bab1365

Zihao Ye authored Jan 28, 2021



* add tvm as submodule

* compilation is ok but calling fails

* can call now

* pack multiple modules, change names

* upd

* upd

* upd

* fix cmake

* upd

* upd

* upd

* upd

* fix

* relative path

* upd

* upd

* upd

* singleton

* upd

* trigger

* fix

* upd

* count reducible

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* only keep related files

* upd

* upd

* upd

* upd

* lint

* lint

* lint

* lint

* pylint

* upd

* upd

* compilation

* fix

* upd

* upd

* upd

* upd

* upd

* upd

* upd doc

* refactor

* fix

* upd number
Co-authored-by: Zhi Lin <linzhilynn@gmail.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-42-78.us-east-2.compute.internal>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-21-156.us-east-2.compute.internal>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

7bab1365

31 Dec, 2020 1 commit
- [Feature] Tvm integration (#2367) · 4208ce2b
  Zhi Lin authored Dec 31, 2020
```
Co-authored-by: Zihao Ye <expye@outlook.com>
```
  4208ce2b
25 Dec, 2020 1 commit

[Performance] Use allocator from PyTorch if possible (#2328) · 9a7235fa

Quan (Andy) Gan authored Dec 25, 2020

* first commit

* some thoughts

* move around

* more commit

* more fixes

* now it uses torch allocator

* fix symbol export error

* fix

* fixes

* test fix

* add script

* building separate library per version

* fix for vs2019

* more fixes

* fix on windows build

* update jenkinsfile

* auto copy built dlls for windows

* lint and installation guide update

* fix

* specify conda environment

* set environment for ci

* fix

* fix

* fix

* fix again

* revert

* fix cmake

* fix

* switch to using python interpreter path

* remove scripts

* debug

* oops sorry

* Update index.rst

* Update index.rst

* copies automatically, no need for this

* do not print message if library not found

* tiny fixes

* debug on nightly

* replace add_compile_definitions to make CMake 3.5 happy

* fix linking to wrong lib for multiple pytorch envs

* changed building strategy

* fix nightly

* fix windows

* fix windows again

* setup bugfix

* address comments

* change README

9a7235fa

21 Dec, 2020 1 commit
- [hotfix] Enable AVX optimization by default. (#2438) · 5d3da4bc
  Zihao Ye authored Dec 21, 2020
  
  5d3da4bc
17 Dec, 2020 1 commit
- [hotfix] Make USE_AVX a flag in cmake to avoid compilation error for arm user (#2428) · e379e525
  Zihao Ye authored Dec 17, 2020
```
* upd cmake

* upd

* format
```
  e379e525
14 Nov, 2020 1 commit
- [Build] use different flags for NVCC and CC (#2342) · 77968e30
  Minjie Wang authored Nov 14, 2020
  
  77968e30
13 Nov, 2020 1 commit

[Bug] Multiple fixes for CUDA 11 support (#2333) · 501b2b75

Quan (Andy) Gan authored Nov 13, 2020



* multiple fixes

* fix CI

* fiddle

* revert stubs

* remove stubs

* poke

* remove linking of driver library

* minor
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

501b2b75

07 Nov, 2020 1 commit

[CUDA] Add CUDA11 support (#2308) · 4fb0241b

Minjie Wang authored Nov 07, 2020



* add support for cuda 11

* fix inc bug in pytorch 1.8

* poke ci

* fix

* small fix

* try fix

* try fix
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

4fb0241b

27 Aug, 2020 1 commit
- [Feature] Use new cusparse API to support CUDA 11. (#1979) · 5cff2f1c
  Zihao Ye authored Aug 27, 2020
```
* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd
```
  5cff2f1c
21 Jun, 2020 1 commit

[Op] Farthest Point Sampler in Cpp and CUDA (#1630) · 3d47693b

Tong He authored Jun 22, 2020

* working framework without actual algorithm logic

* rename

* fix

* fps passes compilation

* correct algorithm

* add cuda implementation

* update random start

* before refactor

* pass compilation but cuda not working

* working

* code working, will add docstring

* add mxnet support

* update docstring

* update doc and test

* cpplint

* cpcplint

* pylint

* temporary fix

* fix for win64

* fix unitetest

* fix

* fix

* remove comment

* move to geometry package

* remove redundant include

* add docstrings and comments

* add proof

* add validity check

3d47693b

17 Jul, 2019 1 commit

[Refactor] Separating graph and sparse matrix operations (#699) · b0d9e7aa

Minjie Wang authored Jul 17, 2019

* WIP: array refactoring

* WIP: implementation

* wip

* most csr part

* WIP: on coo

* WIP: coo

* finish refactoring immutable graph

* compiled

* fix undefined ndarray copy bug; add COOToCSR when coo has no data array

* fix bug in COOToCSR

* fix bug in CSR constructor

* fix bug in in_edges(vid)

* fix OutEdges bug

* pass test_graph

* pass test_graph

* fix bug in CSR constructor

* fix bug in CSR constructor

* fix bug in CSR constructor

* fix stupid bug

* pass gpu test

* remove debug printout

* fix lint

* rm biparate grpah

* fix lint

* address comments

* fix bug in Clone

* cpp utests

b0d9e7aa

12 Jun, 2019 1 commit

[Release] Bump up version (#636) · 059b1a6d

Quan (Andy) Gan authored Jun 12, 2019

* bump up version

* conda+cuda trial

* switch conda branch

* revert

* disable cudnn

059b1a6d

10 Jun, 2019 1 commit
- update known gpu arch list (#629) · a1513f7c
  Quan (Andy) Gan authored Jun 10, 2019
  
  a1513f7c