Commits · 97c3f870c760eb4e9df18cbd37e76cd81df245f4 · OpenDAS / dgl

"src/git@developer.sourcefind.cn:renzhc/diffusers_dcu.git" did not exist on "2120b4eee35bcc0db5f3acd3900fb31188ed0160"

11 Aug, 2023 1 commit
- [Misc] Small fix of cpp tests (#6137) · de344fa4
  Songqing Zhang authored Aug 12, 2023
  
  de344fa4
10 May, 2023 1 commit
- [Performance] Improve COOToCSR implementation (#5508) · e0d2250e
  Andrzej Kotłowski authored May 10, 2023
```
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
```
  e0d2250e
06 Apr, 2023 1 commit
- [Feature] Add bfloat16 support for CPU (#5497) · acb4eb7e
  Ilia Taraban authored Apr 06, 2023
```
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
```
  acb4eb7e
15 Mar, 2023 1 commit

[Config] Enable libxsmm by default for AVX cpu (#5165) · 87fb7ed0

Daniil Sizov authored Mar 15, 2023

* Enable AVX by default

* Fix linting errors

* Fix win64 build (libxsmm not linked)

Libxsmm on Win64 is not linked, should be disabled by default

* Fix clang format issues

* Change lower supported cpu version to LIBXSMM_X86_AVX2

Change lower supported cpu version to LIBXSMM_X86_AVX2 to address https://github.com/dmlc/dgl/issues/3459

 issue

* Fix unit test

Remove assumption that libxsmm is enabled in the config by default (only true for intel CPUs with AVX2 instructions)

---------
Co-authored-by: Ubuntu <ubuntu@ip-172-31-15-137.us-west-2.compute.internal>
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

87fb7ed0

21 Feb, 2023 1 commit
- [Enhancement] Change id hash map (#5304) · ed2e5409
  peizhou001 authored Feb 21, 2023
```
* change concurrent id hash map
```
  ed2e5409
09 Feb, 2023 1 commit
- [Performance]Add concurrent cpu id hashmap (#5241) · f0b7cc96
  peizhou001 authored Feb 09, 2023
```
Add Id hash map
```
  f0b7cc96
07 Nov, 2022 2 commits

[Misc] clang-format auto fix. (#4824) · 8ac27dad

Hongzhi (Steve), Chen authored Nov 07, 2022



* [Misc] clang-format auto fix.

* blabla

* ablabla

* blabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

8ac27dad

[Misc] Replace /*! with /**. (#4823) · bcd37684

Hongzhi (Steve), Chen authored Nov 07, 2022



* replace

* blabla

* balbla

* blabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

bcd37684

06 Nov, 2022 1 commit

[Misc] Replace \xxx with @XXX in structured comment. (#4822) · 619d735d

Hongzhi (Steve), Chen authored Nov 07, 2022



* param

* brief

* note

* return

* tparam

* brief2

* file

* return2

* return

* blabla

* all
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

619d735d

04 Nov, 2022 1 commit

[Misc] clang-format auto fix. (#4812) · 33a2d9e1

Hongzhi (Steve), Chen authored Nov 04, 2022



* [Misc] clang-format auto fix.

* manual
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

33a2d9e1

29 Oct, 2022 1 commit

[Sampling] Enable sampling with edge masks in sample_etype_neighbors (#4749) · 2bca4759

Quan (Andy) Gan authored Oct 29, 2022

* sample neighbors with masks

* oops

* refactor again

* remove

* remove debug code

* rename macro

* address comments

* more stuff

* remove

* fix

* try fix unit test

* oops

* fix test

* oops

* change name

* rename a lot of stuff

* oops

* ugh

* misc fixes

* lint

* address a lot of comments

* lint

* lint

* fix

* that was silly

* fix

* fix

* fix

* oops

2bca4759

19 Sep, 2022 1 commit

[Feature] Bump DLPack to v0.7 and decouple DLPack from the core library (#4454) · cded5b80

Xin Yao authored Sep 19, 2022

* rename `DLContext` to `DGLContext`

* rename `kDLGPU` to `kDLCUDA`

* replace DLTensor with DGLArray

* fix linting

* Unify DGLType and DLDataType to DGLDataType

* Fix FFI

* rename DLDeviceType to DGLDeviceType

* decouple dlpack from the core library

* fix bug

* fix lint

* fix merge

* fix build

* address comments

* rename dl_converter to dlpack_convert

* remove redundant comments

cded5b80

15 Sep, 2022 1 commit

[Feature] Import PyTorch's CUDA stream management (#4503) · 9a00cf19

Xin Yao authored Sep 15, 2022

* add set_stream

* add .record_stream for NDArray and HeteroGraph

* refactor dgl stream Python APIs

* test record_stream

* add unit test for record stream

* use pytorch's stream

* fix lint

* fix cpu build

* address comments

* address comments

* add record stream tests for dgl.graph

* record frames and update dataloder

* add docstring

* update frame

* add backend check for record_stream

* remove CUDAThreadEntry::stream

* record stream for newly created formats

* fix bug

* fix cpp test

* fix None c_void_p to c_handle

9a00cf19

06 Sep, 2022 1 commit

[Feature] Unify the cuda stream used in core library (#4480) · 1c9d2a03

Chang Liu authored Sep 05, 2022



* Use an internal cuda stream for CopyDataFromTo

* small fix white space

* Fix to compile

* Make stream optional in copydata for compile

* fix lint issue

* Update cub functions to use internal stream

* Lint check

* Update CopyTo/CopyFrom/CopyFromTo to use internal stream

* Address comments

* Fix backward CUDA stream

* Avoid overloading CopyFromTo()

* Minor comment update

* Overload copydatafromto in cuda device api
Co-authored-by: xiny <xiny@nvidia.com>

1c9d2a03

27 Jul, 2022 1 commit
- [Log] fix confusing error log in TCPSocket::Bind() (#4299) · 069068aa
  Rhett Ying authored Jul 27, 2022
```
* [Log] fix confusing error log in TCPSocket::Bind()

* fix lint
```
  069068aa
08 Jun, 2022 1 commit

[Dist] enable time out when fetching msg (#4043) · cac3720b

Rhett Ying authored Jun 08, 2022

* [ist] enable time out when fetching msg

* fix lint error

* minor refinements

* improve minor log

* fix dist test

* fix timeout issue in tensorpipe

cac3720b

11 May, 2022 1 commit

[Dist] Enable maximum try times for socket backend via DGL_DIST_MAX_T… (#3977) · 22e218d3

Rhett Ying authored May 11, 2022

* [Dist] Enable maximum try times for socket backend via DGL_DIST_MAX_TRY_TIMES

* reset env before/after test

* print log for info when trying to connect

* fix

* print log in python instead of cpp

22e218d3

27 Apr, 2022 1 commit

[Feature] enable socket net_type for rpc (#3951) · 37be02a4

Rhett Ying authored Apr 28, 2022

* [Feature] enable socket net_type for rpc

* fix lint

* fix lint

* fix build issue on windows

* fix test failure on windows

* fix test failure

* fix cpp unit test failure

* net_type blocking max_try_times

* fix other comments

* fix lint

* fix comment

* fix lint

* fix cpp

37be02a4

06 Dec, 2021 1 commit
- [Distributed] Edge-type-specific fanouts for heterogeneous graphs (#3558) · eb08ef38
  Quan (Andy) Gan authored Dec 06, 2021
```
* first commit

* second commit

* spaghetti unit tests

* rewrite test
```
  eb08ef38
10 Nov, 2021 1 commit
- [BugFix] fix in_degree/out_degree computation logic (#3477) · ea8b93f9
  Rhett Ying authored Nov 10, 2021
```
* [BugFix] fix in/out degree computation

* add unit tests
```
  ea8b93f9
06 Nov, 2021 1 commit

[Performance][GPU] Improve _SegmentCopyKernel() (#3470) · 96cd2ee6

ayasar70 authored Nov 06, 2021



* Based on issue #3436. Improving _SegmentCopyKernel s GPU utilization by switching to nonzero based thread assignment

* fixing lint issues

* Update cub for cuda 11.5 compatibility (#3468)

* fixing type mismatch

* tx guaranteed to be smaller than nnz. Hence removing last check

* minor: updating comment

* adding three unit tests for csr slice method to cover some corner cases
Co-authored-by: Abdurrahman Yasar <ayasar@nvidia.com>
Co-authored-by: nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>

96cd2ee6

04 Nov, 2021 1 commit

[Feature] aten::Relabel_() for the GPU (#3445) · d3ae7544

Xin Yao authored Nov 04, 2021



* relabel gpu

* unittest for ralebl_ on the GPU

* finish Relabel_ for the GPU

* copyright

* re-enable the unittest for edge_subgrah on the GPU

* fix unittest for tensorflow

* use a fixed number of threads
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: nv-dlasalle <63612878+nv-dlasalle@users.noreply.github.com>
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

d3ae7544

28 Sep, 2021 1 commit
- [Feature] Implement one thread multiple socket (#3200) · 5cf48fc6
  Jingcheng Yu authored Sep 28, 2021
```
Co-authored-by: JingchengYu94 <jingchengyu94@gmail.com>
```
  5cf48fc6
17 Sep, 2021 1 commit
- [BugFix] initialize data if null when converting from row sorted coo to csr (#3360) · bacc9047
  Rhett Ying authored Sep 17, 2021
  
  bacc9047
14 Sep, 2021 1 commit

[Performance] improve coo2csr space complexity when row is not sorted (#3326) · f4c79f7f

Rhett Ying authored Sep 14, 2021



* [Performance] improve coo2csr space complexity when row is not sorted

* [Perf] replace std::vector<> by NDArray

* keep both impl of unsorted coo to csr and choose according to graph density dynamically

* refine criteria to choose btw Unsorted algos
Co-authored-by: Ubuntu <ubuntu@ip-172-31-34-27.us-west-2.compute.internal>

f4c79f7f

01 Sep, 2021 2 commits

[Feature] enable to specify stream in UnitGraph::CopyTo() which could lead to async copy (#3297) · 5a245104
Rhett Ying authored Sep 01, 2021
```
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>
```
5a245104

[Feature] Add a HINT for the per edge type sampler of heterogeneous DistGraph... · f4fe518f

xiang song(charlie.song) authored Sep 01, 2021


[Feature] Add a HINT for the per edge type sampler of heterogeneous DistGraph that highlighting the etypes are sorted already. (#3260)

* pass cpp test

* distgraph use sorted edge flag.

* lint

* triger

* update test
Co-authored-by: Ubuntu <ubuntu@ip-172-31-2-66.ec2.internal>

f4fe518f

20 Aug, 2021 1 commit

[Feature][DistDGL] Add NCCL support for range based partitions (#3213) · 7f927939

nv-dlasalle authored Aug 19, 2021

* Implement range based NDArrayPartition

* Finish implement range based partition support

* Add unit test

* Fix whitepace

* Add Kernel suffix

* Fix argument passing

* Add doxygen docs and improve variable naming

* Add unit test

* Add function for converting a partition book

* Add example to partition_op docs

* Fix dtype conversion for mxnet and tensorflow

7f927939

28 Jul, 2021 1 commit

[New Feature] Per edge type sampler for to_homogeneous graphs. (#3131) · ba7e7cf9

xiang song(charlie.song) authored Jul 28, 2021



* fix.

* fix.

* fix.

* fix.

* Fix test

* Deprecate old DistEmbedding impl, use synchronized embedding impl

* Basic imple of heterogeneous on homogenenous sampling

* make pass

* Pass C++ test

* Add python test code

* lint

* lint

* Add MultiLayerEtypeNeighborSampler

* Add unitest for single machine dataloader

* Add dist dataloader test for edge type sampler

* Fix lint

* fix

* support for per etype sample

* Fix some bug and enable distributed training with per edge sample

* fix

* Now distributed training works

* turn off some mxnet

* turn off mxnet for some dist test

* fix

* upd

* upd according to the comments

* Fix

* Fix test and now distributed works.

* upd

* upd

* Fix

* Fix bug

* remove dead code.

* upd

* Fix

* upd

* Fix
Co-authored-by: Ubuntu <ubuntu@ip-172-31-71-112.ec2.internal>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-2-66.ec2.internal>
Co-authored-by: Da Zheng <zhengda1936@gmail.com>

ba7e7cf9

23 Jun, 2021 1 commit

[Feature] Biased Neighbor Sampling (#2987) · e56bbafd

Qidong Su authored Jun 23, 2021



* update

* update

* update

* update

* lint

* lint

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* lint

* update

* clone

* update

* update

* update

* update

* replace idarray with ndarray

* refactor cpp part

* refactor python part

* debug

* refactor interface

* test and doc

* lint and test

* lint

* fix

* fix

* fix

* const

* doc

* fix

* fix

* fix

* fix

* fix & doc

* fix

* fix

* update

* update

* update

* merge

* doc

* doc

* lint

* fix

* more tests

* doc

* fix

* fix

* update

* update

* update

* fix

* fix
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

e56bbafd

11 Jun, 2021 1 commit

[Feature] Allow using NCCL for communication in dgl.NodeEmbedding and dgl.SparseOptimizer (#2824) · 17d604b5

nv-dlasalle authored Jun 10, 2021



* Split from NCCL PR

* Fix type in comment

* Expand documentation for sparse_all_to_all_push

* Restore previous behavior in example

* Re-work optimizer to use NCCL based on gradient location

* Allow for running with embedding on CPU but using NCCL for gradient exchange

* Optimize single partition case

* Fix pylint errors

* Add missing include

* fix gradient indexing

* Fix line continuation

* Migrate 'first_step'

* Skip tests without enough GPUs to run NCCL

* Improve empty tensor handling for pytorch 1.5

* Fix indentation

* Allow multiple NCCL communicator to coexist

* Improve handling of empty message

* Update python/dgl/nn/pytorch/sparse_emb.py
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

* Update python/dgl/nn/pytorch/sparse_emb.py
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

* Keepy empty tensor dimensionaless

* th.empty -> th.tensor

* Preserve shape for empty non-zero dimension tensors

* Use shared state, when embedding is shared

* Add support for gathering an embedding

* Fix typo

* Fix more typos

* Fix backend call

* Use NodeDataLoader to take advantage of ddp

* Update training script to share memory

* Only squeeze last dimension

* Better handle empty message

* Keep embedding on the target device GPU if dgl_sparse if false in RGCN example

* Fix typo in comment

* Add asserts

* Improve documentation in example
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

17d604b5

10 Jun, 2021 1 commit

[Kernel] Slicing Batched Graphs (#2965) · 5be937a7

Mufei Li authored Jun 10, 2021



* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Add files via upload

* Add files via upload

* Add files via upload

* Add files via upload

* Update

* Update

* Add files via upload

* Add files via upload

* Update

* Lint

* Add files via upload

* Lint

* Update

* Update

* Update

* Update

* Update

* Lint Fix

* Lint
Co-authored-by: Ubuntu <ubuntu@ip-172-31-12-161.us-west-2.compute.internal>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

5be937a7

20 May, 2021 1 commit

[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d

nv-dlasalle authored May 20, 2021


[Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)

* Split NCCL wrapper from sparse optimizer and sparse embedding

* Add more unit tests for single node nccl

* Fix unit test for tf

* Switch to device histogram

* Fix histgram issues

* Finish migration to histogram

* Handle cases with zero send/recieve data

* Start on partition object

* Get compiling

* Updates

* Add unit tests

* Switch to partition object

* Fix linting issues

* Rename partition file

* Add python doc

* Fix python assert and finish doxygen comments

* Remove stubs for range based partition to satisfy pylint

* Wrap unit test in GPU only

* Wrap explicit cuda call in ifdef

* Merge with partition.py

* update docstrings

* Cleanup partition_op

* Add Workspace object

* Switch to using workspace object

* Move last remainder based function out of nccl_api

* Add error messages

* Update docs with examples

* Fix linting erros
Co-authored-by: xiang song(charlie.song) <classicxsong@gmail.com>

ae8dbe6d

27 Apr, 2021 1 commit

[Feature] Add cuda support for Sparse Matrix multiplication, summation and masking (#2782) · ab2bd1f1

Israt Nisa authored Apr 27, 2021



* init cuda support

* cuSPARSE err

* passed unittest for csr_mm/SpGEMM. int64 not supported

* Debugging cuSPARSE error 3

* csrgeam only supports int32?

* disabling int64 for cuda

* refactor and add CSRMask

* lint

* oops

* remove todo

* rewrite CSRMask with CSRGetData

* lint

* fix test

* address comments

* lint

* fix

* addresses comments and rename BUG_ON
Co-authored-by: Israt Nisa <nisisrat@amazon.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-30-71.ec2.internal>
Co-authored-by: Quan Gan <coin2028@hotmail.com>
Co-authored-by: Jinjing Zhou <VoVAllen@users.noreply.github.com>
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

ab2bd1f1

22 Apr, 2021 1 commit

[Sampler] BiasedChoice sampler (#1665) · 6b022d2f

Qidong Su authored Apr 22, 2021



* update

* update

* update

* update

* update

* update

* update

* fix

* fix

* update

* doc

* doc

* fix

* fix
Co-authored-by: Minjie Wang <wmjlyjemaine@gmail.com>

6b022d2f

24 Mar, 2021 1 commit

[Feature] Sparse-sparse matrix multiplication, addition, and masking (#2753) · 929d8634

Quan (Andy) Gan authored Mar 24, 2021

* test

* more stuff

* add test

* fixes

* optimize algo

* replace unordered_map with arrays

* lint

* lint x2

* oops

* disable gpu csrmm tests

* remove gpu invocation

* optimize with openmp

* remove python functions

* add back with docstrings

* lint

* lint

* update python interface

* functionize

* functionize

* lint

* lint

929d8634

27 Jan, 2021 1 commit

[Performance] Improve COO to CSR, and sort columns of CSR only when necessary. (#2391) · 2576647c

nv-dlasalle authored Jan 26, 2021

* Remove double-checking sorted

* Remove sorting of CSR by default

* Update unit test to use unsorted matix

* delete whitespace

* Expand unit tests

* Replace cusparse sort

* Fix row column sorting

* Explicitly don't sort columns

* Fix linting errors

* Fix bit-width calculation

* Fix sorting assertion and unit test

* Fix linting

* Improve CPU COO2CSR

* Remove references

* Rename and add documentation to edge encoding/decoding funcionts

* Fix sorting keys as 64 bit

* Revert cosmetic changes to unit tests

* Update documentation

* Update complexity documentation for coo to csr conversion

* Remove COOIsSorted check in CPU implementation too

2576647c

31 Dec, 2020 1 commit
- [Feature] Tvm integration (#2367) · 4208ce2b
  Zhi Lin authored Dec 31, 2020
```
Co-authored-by: Zihao Ye <expye@outlook.com>
```
  4208ce2b
17 Dec, 2020 1 commit
- [hotfix] Make USE_AVX a flag in cmake to avoid compilation error for arm user (#2428) · e379e525
  Zihao Ye authored Dec 17, 2020
```
* upd cmake

* upd

* format
```
  e379e525
17 Nov, 2020 1 commit

[Performance] Dynamic cpu kernel V3 for SpMMSumCsr all Ops (#2309) · f8ebcd7f

pawelpiotrowicz authored Nov 17, 2020



* support AVX512

* env DGL_CPU_INTEL_KERNEL_ENABLED=1

* env DGL_CPU_INTEL_KERNEL_LOG=1

* Add unittest test_spmm.cc
Co-authored-by: Izabela Mazur <izabela.mazur@intel.com>
Co-authored-by: Michal Szarmach <michal.szarmach@intel.com>

Review patch

f8ebcd7f