Commits · e9b624fe227d2e01d3aff057b4a49f0cae58da13 · OpenDAS / dgl

29 Jul, 2022 1 commit

[Feature] Add CUDA Weighted Neighborhood Sampling (#4064) · 86c81b4e

Xin Yao authored Jul 29, 2022



* add weighted sampling without replacement (A-Chao)

* improve Algorithm A-Chao with block-wise prefix sum

* correctly fill out_idxs

* implement weighted sampling with replacement

* small fix

* merge host-side code of weighted/uniform sampling

* enable unit tests for cuda weighted sampling

* move thrust/cub wrapper to the cmake file

* update docs accordingly

* fix linting

* fix linting

* fix unit test

* Bump external CUB/Thrust versions

* Fix code style and update description of algorithm design

* [Feature] GPU support weighted graph neighbor sampling
commit by pengqirong(OPPO)

* merge pengqirong's implementation

* revert the change to cub and thrust

* fix linting

* use DeviceSegmentedSort for better performance

* add more comments

* add necessary notes

* add necessary notes

* resolve some comments

* define THRUST_CUB_WRAPPED_NAMESPACE

* fix doc
Co-authored-by: 彭齐荣 <657017034@qq.com>

86c81b4e

07 Jul, 2022 1 commit
- [Performance] Redirect `AllocWorkspace` to PyTorch's allocator if available (#4199) · 9ee7ced5
  Xin Yao authored Jul 07, 2022
  
  9ee7ced5
28 Jun, 2022 1 commit
- [BugFix] fix build issue on mac OS (#4175) · 15188611
  Rhett Ying authored Jun 28, 2022
```
* [BugFix] fix build issue on mac OS

* refine
```
  15188611
27 Jun, 2022 1 commit

[Dist] enable USE_EPOLL in default (#4167) · 9d425315

Rhett Ying authored Jun 27, 2022

* [Dist] enable USE_EPOLL in default

* fix build issue on windows

* fix build issue on windows

* fix build issue on windows

* fix build issue on windows

* fix build issue on windows

* fix build issue

9d425315

08 Jun, 2022 1 commit

[DistTest] add basic pipeline for dist test across machines (#3984) · c1ff4c9b

Rhett Ying authored Jun 08, 2022



* [DistTest] add basic pipeline for dist test across machines

* move launch remote cmd to separate file

* add test for rpc

* fix function naming rule
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

c1ff4c9b

11 May, 2022 1 commit

Make USE_AVX flag default value OFF (#3983) · 1a6806e2

Vikram Sharma authored May 11, 2022



With the emergence of new ISA (like ARM and RISCV) retaining USE_AVX ON default makes the default build instructions fail. Fundamentally DGL does not require the use of AVX for functional working. AVX is mainly needed when to enable optimization. So proposal is to default turn it off and then later during build instructions, folks with AVX capabilities can enable with 
`cmake .. -DUSE_AVX=ON`
Co-authored-by: Zihao Ye <expye@outlook.com>

1a6806e2

07 Feb, 2022 1 commit
- Change standard to c++14 to be compatible to tensorpipe's dependencies (#3712) · 35122075
  nv-dlasalle authored Feb 06, 2022
```
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
```
  35122075
11 Jan, 2022 1 commit
- [Build] Fix compiler crashes when system's libuv-devel is older than required (#3640) · 6dce19d8
  Quan (Andy) Gan authored Jan 11, 2022
  
  6dce19d8
06 Dec, 2021 1 commit

[RPC] Use tensorpipe for rpc communication (#3335) · a3ce780d

Jinjing Zhou authored Dec 06, 2021

* doesn't know whether works

* add change

* fix

* fix

* fix

* remove

* revert

* lint

* lint

* fix

* revert

* lint

* fix

* only build rpc on linux

* lint

* lint

* fix build on windows

* fix windows

* remove old test

* fix cmake

* Revert "remove old test"

This reverts commit f1ea75c777c34cdc1f08c0589676ba6aee1feb29.

* fix windows

* fix

* fix

* fix indent

* fix indent

* address comment

* fix

* fix

* fix

* fix

* fix

* lint

* fix indent

* fix lint

* add introduction

* fix

* lint

* lint

* add more logs

* fix

* update xbyak for C++14 with gcc5

* Remove channels

* fix

* add test script

* fix

* remove unused file

* fix lint

* add timeout

a3ce780d

03 Dec, 2021 1 commit
- Bring back thrust for backward compatibility (#3562) · 769718df
  Jinjing Zhou authored Dec 03, 2021
  
  769718df
02 Dec, 2021 1 commit
- Fix removed submodule (#3560) · 9f445a13
  Jinjing Zhou authored Dec 03, 2021
  
  9f445a13
29 Nov, 2021 1 commit
- Fix #3497 (#3546) · 03c2c6d1
  Jinjing Zhou authored Nov 29, 2021
  
  03c2c6d1
14 Oct, 2021 1 commit

[Bugfix] three bugs related to using DGL as a subdirectory(third_party) of another project. (#3379) · 18863069

zexi yuan authored Oct 14, 2021

* [Bugfix] fix a compile error for Debug-BuildType on Windows Platform

When using CMakeLists.txt to build the "Debug" BuildType on the Windows Platform, it has three compile errors (C4716) in the file "dgl\src\runtime\shared_mem.cc":

'dgl::runtime::SharedMemory::CreateNew': must return a value
'dgl::runtime::SharedMemory::Open': must return a value
'dgl::runtime::SharedMemory::Exist': must return a value

* [Bugfix] cmake error "cannot find load file" when DGL as a sub_directory on Linux

When using DGL as a subdirectory in a CMake Project, the "CMAKE_SOURCE_DIR" here will return the parent cmake scope dir, which is not a expected dir.
Maybe it is better to use "CMAKE_CURRENT_SOURCE_DIR" to set "GKLIB_PATH".

* [Bugfix] cmd cmake error when DGL as a subdirectory

When DGL as a subdirectory of another project, the WORKING_DIRECTORY of "add_custom_command" will be incorrect at the line 255 of "CMakeLists.txt", such that making a cmake "setlocal" error.

18863069

28 Sep, 2021 1 commit
- [Feature] Implement one thread multiple socket (#3200) · 5cf48fc6
  Jingcheng Yu authored Sep 28, 2021
```
Co-authored-by: JingchengYu94 <jingchengyu94@gmail.com>
```
  5cf48fc6
06 Sep, 2021 1 commit
- Remove deprecated kernels (#3316) · c81efdf2
  Jinjing Zhou authored Sep 06, 2021
```
* remove

* remove

* fix

* remove

* remove
```
  c81efdf2
13 Jul, 2021 2 commits

Remove march=native flag (#3134) · 7c3e1f94
Quan (Andy) Gan authored Jul 13, 2021

7c3e1f94

[CPU][Kernel] Single socket spmm (#3024) · fac75e16

sanchit-misra authored Jul 13, 2021



* optimizations of spmm for CPU

* Added names of contributors

* Minor code cleanup

* Moved the spmm optimization code to a new header file

* Moved to DGL's logging method

* removed duplicate code between SpMMSumCsr and SpMMCmpCsr

* Changes made to follow Google coding style

* Fixed lint errors in spmm.h

* Fixed some lint errors from spmm_blocking_libxsmm.h

* Fixed lint errors from spmm_blocking_libxsmm.h

* Added comments to SpMMCreateLibxsmmKernel

* to enable building of tests, and other cosmetic changes

* disabling libxsmm on windows

* Put a condition to avoid opt impl for FP64 as libxsmm does not have FP64 support yet

* cosmetic changes and documentation

* cosmetic changes

* to pass lint tests

* replaced multiple allocations for buffers of indices and edges with a single allocation
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

fac75e16

27 Jun, 2021 1 commit

[Build] Make nccl optional (#3056) · 9664cdff

Jinjing Zhou authored Jun 27, 2021

* fix

* remove nvidiasmi

* fix

* fix docs

* fix

* fix

* 1

* fix

* remove

* skip deprecated kernel

* fix

* Revert "skip deprecated kernel"

This reverts commit c5ceb7f60dbbaf065b81cc3680757fd611d90ad3.

* fix

9664cdff

03 Jun, 2021 1 commit
- [Build] Fix NCCL building crashes when using submodules but with system NCCL installed (#2975) · 7e58236c
  Quan (Andy) Gan authored Jun 03, 2021
  
  7e58236c
25 May, 2021 1 commit

[Bugfix] Include NCCL as a submodule (#2934) · 66eb240d

nv-dlasalle authored May 24, 2021



* Add NCCL as a submodule

* Allow using third_party/nccl or system nccl

* Add nccl_external as a dependency

* Fix conditional
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

66eb240d

20 May, 2021 1 commit

[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d

nv-dlasalle authored May 20, 2021


[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)

* Split NCCL wrapper from sparse optimizer and sparse embedding

* Add more unit tests for single node nccl

* Fix unit test for tf

* Switch to device histogram

* Fix histgram issues

* Finish migration to histogram

* Handle cases with zero send/recieve data

* Start on partition object

* Get compiling

* Updates

* Add unit tests

* Switch to partition object

* Fix linting issues

* Rename partition file

* Add python doc

* Fix python assert and finish doxygen comments

* Remove stubs for range based partition to satisfy pylint

* Wrap unit test in GPU only

* Wrap explicit cuda call in ifdef

* Merge with partition.py

* update docstrings

* Cleanup partition_op

* Add Workspace object

* Switch to using workspace object

* Move last remainder based function out of nccl_api

* Add error messages

* Update docs with examples

* Fix linting erros
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

ae8dbe6d

09 Apr, 2021 1 commit

[Feature] Add kd-tree implementation (CPU) for kNN (#2767) · e83d0a80

Tianqi Zhang (张天启) authored Apr 09, 2021



* add submodule nanoflann

* finish python API for knn

* finish ndarray adaptor

* finish cpu-kdtree version of knn

* use openmp

* add endline

* upt

* upt

* fix format and code style

* upt

* add warning for gpu-cpu copy

* avoid contiguous copy
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>
Co-authored-by: Tong He <hetong007@gmail.com>

e83d0a80

24 Mar, 2021 1 commit

[Feature] Sparse-sparse matrix multiplication, addition, and masking (#2753) · 929d8634

Quan (Andy) Gan authored Mar 24, 2021

* test

* more stuff

* add test

* fixes

* optimize algo

* replace unordered_map with arrays

* lint

* lint x2

* oops

* disable gpu csrmm tests

* remove gpu invocation

* optimize with openmp

* remove python functions

* add back with docstrings

* lint

* lint

* update python interface

* functionize

* functionize

* lint

* lint

929d8634

28 Jan, 2021 1 commit

[feature] Supporting half precision floating data type (fp16). (#2552) · 7bab1365

Zihao Ye authored Jan 28, 2021



* add tvm as submodule

* compilation is ok but calling fails

* can call now

* pack multiple modules, change names

* upd

* upd

* upd

* fix cmake

* upd

* upd

* upd

* upd

* fix

* relative path

* upd

* upd

* upd

* singleton

* upd

* trigger

* fix

* upd

* count reducible

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* upd

* only keep related files

* upd

* upd

* upd

* upd

* lint

* lint

* lint

* lint

* pylint

* upd

* upd

* compilation

* fix

* upd

* upd

* upd

* upd

* upd

* upd

* upd doc

* refactor

* fix

* upd number
Co-authored-by: Zhi Lin <linzhilynn@gmail.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-42-78.us-east-2.compute.internal>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-21-156.us-east-2.compute.internal>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

7bab1365

31 Dec, 2020 1 commit
- [Feature] Tvm integration (#2367) · 4208ce2b
  Zhi Lin authored Dec 31, 2020
```
Co-authored-by: Zihao Ye <expye@outlook.com>
```
  4208ce2b
25 Dec, 2020 1 commit

[Performance] Use allocator from PyTorch if possible (#2328) · 9a7235fa

Quan (Andy) Gan authored Dec 25, 2020

* first commit

* some thoughts

* move around

* more commit

* more fixes

* now it uses torch allocator

* fix symbol export error

* fix

* fixes

* test fix

* add script

* building separate library per version

* fix for vs2019

* more fixes

* fix on windows build

* update jenkinsfile

* auto copy built dlls for windows

* lint and installation guide update

* fix

* specify conda environment

* set environment for ci

* fix

* fix

* fix

* fix again

* revert

* fix cmake

* fix

* switch to using python interpreter path

* remove scripts

* debug

* oops sorry

* Update index.rst

* Update index.rst

* copies automatically, no need for this

* do not print message if library not found

* tiny fixes

* debug on nightly

* replace add_compile_definitions to make CMake 3.5 happy

* fix linking to wrong lib for multiple pytorch envs

* changed building strategy

* fix nightly

* fix windows

* fix windows again

* setup bugfix

* address comments

* change README

9a7235fa

21 Dec, 2020 1 commit
- [hotfix] Enable AVX optimization by default. (#2438) · 5d3da4bc
  Zihao Ye authored Dec 21, 2020
  
  5d3da4bc
17 Dec, 2020 1 commit
- [hotfix] Make USE_AVX a flag in cmake to avoid compilation error for arm user (#2428) · e379e525
  Zihao Ye authored Dec 17, 2020
```
* upd cmake

* upd

* format
```
  e379e525
27 Nov, 2020 1 commit
- disable warn-common on mac (#2374) · 35a3ead2
  Quan (Andy) Gan authored Nov 27, 2020
  
  35a3ead2
17 Nov, 2020 1 commit

[Performance] Dynamic cpu kernel V3 for SpMMSumCsr all Ops (#2309) · f8ebcd7f

pawelpiotrowicz authored Nov 17, 2020



* support AVX512

* env DGL_CPU_INTEL_KERNEL_ENABLED=1

* env DGL_CPU_INTEL_KERNEL_LOG=1

* Add unittest test_spmm.cc
Co-authored-by: Izabela Mazur <izabela.mazur@intel.com>
Co-authored-by: Michal Szarmach <michal.szarmach@intel.com>

Review patch

f8ebcd7f

14 Nov, 2020 1 commit
- [Build] use different flags for NVCC and CC (#2342) · 77968e30
  Minjie Wang authored Nov 14, 2020
  
  77968e30
13 Nov, 2020 1 commit

[Bug] Multiple fixes for CUDA 11 support (#2333) · 501b2b75

Quan (Andy) Gan authored Nov 13, 2020



* multiple fixes

* fix CI

* fiddle

* revert stubs

* remove stubs

* poke

* remove linking of driver library

* minor
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

501b2b75

07 Nov, 2020 1 commit

[CUDA] Add CUDA11 support (#2308) · 4fb0241b

Minjie Wang authored Nov 07, 2020



* add support for cuda 11

* fix inc bug in pytorch 1.8

* poke ci

* fix

* small fix

* try fix

* try fix
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

4fb0241b

30 Oct, 2020 1 commit

[Dataloading] Add class for copying tensors to/from the GPU on a non-default stream (#2284) · f673fc25

nv-dlasalle authored Oct 30, 2020

* Add async transferer class

* Add async ndarray copy interface

* Add python bindings

* Fix comment

* Add python class

* Fix linting issues

* Add python unit test

* Update python interface

* move async_transferer to cuda only directory

* Fix linting issue

* Move out of contrib

* Add doc strings

* Move test compute from backend

* Update comment

* Fix test naming

* Fix argument usage

* Wrap/unwrap backend parameters

* Move to dataloading

* Move to 'dataloading'

* Make GPU/CPU compatible

* Fix unit tests

* Add docs

* Use only backend interface for datamovement in unit test

f673fc25

26 Aug, 2020 1 commit
- Fix build error with hdfs and update dmlc-core (#2107) · 628d9fc5
  Jinjing Zhou authored Aug 26, 2020
```
* update dmlc-core for hdfs build

* add hdfs support

* default off

* trigger ci
```
  628d9fc5
10 Aug, 2020 1 commit

Fix the performance issue of graph partitioning in new DGLGraph (#1934) · 729ff2ef

Da Zheng authored Aug 09, 2020



* fix perf.

* fix.

* accelerate metis.

* fix lint.

* use gklib.

* fix perf.

* fix.

* update metis.

* update launch script

* handle synchronized API.

* fix.

* fix example.

* fix dataloader.

* temp fix.

* temp fix omp.

* distinguish roles.

* initialize iterator of DistDataloader correctly.

* check the correctness of launch script.

* move feature copy to sampler.

* measure mem/network copy time.

* remove

* Revert "measure mem/network copy time."

This reverts commit 86cefdc14b7815fcf5aad6496af912dba48e4aa6.

* fix.

* fix

* fix.

* fix cmake.

* disable metis in windows.

* disable metis tests in windows.

* remove test for multigraph.

* fix test.

* fix.

* fix cmake.

* fix.

* revert.
Co-authored-by: Ubuntu <ubuntu@ip-172-31-19-115.us-west-2.compute.internal>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-19-1.us-west-2.compute.internal>

729ff2ef

09 Jul, 2020 1 commit

[Windows] Compile METIS on Windows (#1771) · 22a6ad6d

Quan (Andy) Gan authored Jul 09, 2020



* make metis compilable on windows

* lint
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

22a6ad6d

30 Jun, 2020 1 commit

[Data] Support HeteroGraph save/load (#1526) · 18a26fcf

Jinjing Zhou authored Jun 30, 2020



* 111

* add history version test

* fix

* 111

* save

* ``

* fix1

* 111

* add save heterograph

* lint

* lint

* add tests

* minor fix

* fix

* docs

* add format tets

* use unique_ptr

* fix

* fix interface

* 111

* 111

* fix

* lint

* fix

* add support to s3

* fix

* fix

* fix leak

* fix

* fix docs

* fix

* linlt

* fix

* fix

* fix

* address comment

* address comment
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

18a26fcf

28 Jun, 2020 1 commit

[CUDA][Kernel] More CUDA kernels; Standardize the behavior for sorted COO/CSR (#1704) · 870da747

Minjie Wang authored Jun 28, 2020

* add cub; array cumsum

* CSRSliceRows

* fix warning

* operator << for ndarray; CSRSliceRows

* add CSRIsSorted

* add csr_sort

* inplace coosort and outplace csrsort

* WIP: coo is sorted

* mv cuda_utils

* add AllTrue utility

* csr sort

* coo sort

* coo2csr for sorted coo arrays

* CSRToCOO from sorted

* pass tests for the new kernel changes

* cannot use inplace sort

* lint

* try fix msvc error

* Fix g.copy_to and g.asnumbits; ToBlock no longer uses CSC

* stash

* revert some hack

* revert some changes

* address comments

* fix

* fix to_block unittest

* add todo note

870da747

21 Jun, 2020 1 commit

[Op] Farthest Point Sampler in Cpp and CUDA (#1630) · 3d47693b

Tong He authored Jun 22, 2020

* working framework without actual algorithm logic

* rename

* fix

* fps passes compilation

* correct algorithm

* add cuda implementation

* update random start

* before refactor

* pass compilation but cuda not working

* working

* code working, will add docstring

* add mxnet support

* update docstring

* update doc and test

* cpplint

* cpcplint

* pylint

* temporary fix

* fix for win64

* fix unitetest

* fix

* fix

* remove comment

* move to geometry package

* remove redundant include

* add docstrings and comments

* add proof

* add validity check

3d47693b