Commits · 92a706446b497fd23d0ddfc88fd7383c8a0917f1 · OpenDAS / dgl

01 Mar, 2025 2 commits
- 宏定义更新，sortKerys更新 · 92a70644
  sangwz authored Mar 01, 2025
  
  92a70644
- code update for v2.2.1+dtk25.04 · 83ea9a8d
  sangwz authored Mar 01, 2025
  
  83ea9a8d
20 Feb, 2025 1 commit
- dtk 2504 update · 314cedc1
  sangwz authored Feb 20, 2025
  
  314cedc1
16 Oct, 2024 1 commit
- update warpsize to 64 · 3befaca2
  sangwzh authored Oct 16, 2024
  
  3befaca2
15 Oct, 2024 1 commit
- update device pointer getting while using UVA · 910cec0c
  sangwzh authored Oct 15, 2024
  
  910cec0c
25 Sep, 2024 1 commit
- update atomicAdd and csr2coo.hip · 910d6a98
  sangwzh authored Sep 25, 2024
  
  910d6a98
23 Sep, 2024 1 commit
- update dgl codes to hip · 833803f3
  sangwzh authored Sep 23, 2024
  
  833803f3
13 Sep, 2024 1 commit
- update src and graphbolt code · 6ac701f8
  sangwzh authored Sep 13, 2024
  
  6ac701f8
20 Apr, 2024 1 commit
- [CUDA] Remove unused headers for CCCL 2.4 compat (#7329) · 7de2e51b
  Muhammed Fatih BALIN authored Apr 20, 2024
  
  7de2e51b
19 Apr, 2024 1 commit
- [Determinism] Enable environment var to use cusparse spmm deterministic algorithm (#7310) · a4e19691
  Triston authored Apr 18, 2024
```
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
```
  a4e19691
12 Apr, 2024 1 commit
- [CUDA][Bug] CSR transpose bug in CUDA 12 (#7295) · 2ff3006c
  Muhammed Fatih BALIN authored Apr 12, 2024
  
  2ff3006c
29 Feb, 2024 1 commit
- [CUDA] Update CCCL to 2.3.0 (#7171) · 73e01d6d
  Muhammed Fatih BALIN authored Feb 29, 2024
  
  73e01d6d
23 Nov, 2023 1 commit
- [Misc] Fix signed unsigned comparison warning (#6602) · 5e78e070
  Muhammed Fatih BALIN authored Nov 22, 2023
  
  5e78e070
22 Nov, 2023 1 commit
- [CUDA] Fix issue about integer overflow (#6586) · bfde1422
  Muhammed Fatih BALIN authored Nov 22, 2023
  
  bfde1422
14 Aug, 2023 1 commit
- [Build] Fix bf16/fp16 building issues for CUDA 12.2 (#6074) · 08d18a47
  Xin Yao authored Aug 14, 2023
```
Signed-off-by: Xin Yao <xiny@nvidia.com>
```
  08d18a47
10 Aug, 2023 1 commit
- [Bugfix] Fix cusparseCreateCsr format for cuda12 (#6121) · 88964a82
  Chang Liu authored Aug 10, 2023
  
  88964a82
19 Jul, 2023 1 commit
- [Feature] Adding kappa feature for labor (Cooperative Minibatching) (#6006) · d3bd4c61
  Muhammed Fatih BALIN authored Jul 18, 2023
```
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
```
  d3bd4c61
14 Jul, 2023 2 commits
- [Performance][CUDA] Sorting for indices for UVM code path. (#5882) · f5f7e08e
  Muhammed Fatih BALIN authored Jul 14, 2023
  
  f5f7e08e
- [Performance][CUDA] Faster CSRToCOO (#5648) · 83115794
  Muhammed Fatih BALIN authored Jul 14, 2023
```
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
```
  83115794
13 Jul, 2023 1 commit

[Performance][CUDA] Labor UVA optimization (#5885) · c3aea1b6

Muhammed Fatih BALIN authored Jul 13, 2023


Co-authored-by: Xin Yao <xiny@nvidia.com>
Co-authored-by: Rhett Ying <85214957+Rhett-Ying@users.noreply.github.com>

c3aea1b6

17 May, 2023 1 commit
- [Performance Improvement] Make GPU sampling and to_block use pinned memory to... · 46af76c3
  nv-dlasalle authored May 17, 2023
```
[Performance Improvement] Make GPU sampling and to_block use pinned memory to decrease required synchronization (#5685)
```
  46af76c3
23 Mar, 2023 1 commit
- [Performance] Creating out buffers for `segment_mm`|`sddmm` via `torch.empty()` (#5462) · 170203ae
  Xin Yao authored Mar 23, 2023
```
* update for segmentMM

* update for sddmm

* fix a bug
```
  170203ae
08 Mar, 2023 1 commit
- Fix compile error on ubuntu22.04_g++11.3.0 (#5434) · b1ec112e
  Rhett Ying authored Mar 08, 2023
  
  b1ec112e
12 Jan, 2023 1 commit
- [Bugfix] Replace global cudaStream in Filter with runtime calls (fix #5153) (#5157) · 751b4c26
  nv-dlasalle authored Jan 12, 2023
```
* Add failing unit test

* Add fix

* Remove extra newline

* skip cpu test
Co-authored-by: Xin Yao <yaox12@outlook.com>
```
  751b4c26
09 Dec, 2022 1 commit

[Bugfix] Fix empty tensors may being treated as pinned (#5005) · aad3bd04

Xin Yao authored Dec 09, 2022

* fix empty tensor is treated as pinned

* avoid calling cudaHostGetDevicePointer on nullptr

* update empty array

* add a comment

aad3bd04

06 Dec, 2022 1 commit

Add support for next cusparse release (#4974) · fb223d47

Chang Liu authored Dec 05, 2022

* Add support for next cusparse release

* Fix lint

* Add switch and tune the performance

* Fix lint issue

* Fine tune the heuristics

* Fix lint issue

* Address comments

* Minor fix

* Address comments

fb223d47

24 Nov, 2022 1 commit
- [Cleanup] Remove duplicated _IndexSelect (#4874) · c59000ac
  Xin Yao authored Nov 24, 2022
  
  c59000ac
22 Nov, 2022 2 commits

[Performance] Leverage hashmap to accelerate CSRSliceMatrix<kDGLCUDA, IdType> (#4924) · aa419895

Ping Gong authored Nov 22, 2022



* Leverage hashmap to accelerate CSRSliceMatrix

* fix lint check

* use `min` in cuda_runtime.ch

* fix hash func

* add some comments and adjust the <grid,block> of the _SegmentMaskColKernel kernel

* set device and stream for thrust::for_each

* use thrust::cuda::par_nosync
Co-authored-by: Xin Yao <xiny@nvidia.com>

aa419895

[Feature] (La)yer-Neigh(bor) sampling implementation (#4668) · bf264d00

Muhammed Fatih BALIN authored Nov 21, 2022



* adding LABOR sampling

* add ladies and pladies samplers

* fix compile error after rebase

* add reference for ladies sampler

* Improve ladies implementation.

* weighted labor sampling initial implementation draft
fix indentation and small bug in ladies script

* importance_sampling currently doesn't work with weights

* fix weighted importance sampling

* move labor example into its own folder

* lint fixes

* Improve documentation

* remove examples from the main PR

* fix linting by not using c++17 features

* fix documentation of labor_sampler.py

* update documentation for labor.py

* reformat the labor.py file with black

* fix linting errors

* replace exception use with if

* fix typo in error comment

* fixing win64 build for ci

* fixing weighted implementation, works now.

* fix bug in the weighted case and importance_sampling==0

* address part of the reviews

* remove unused code paths from cuda

* remove unused code path from cpu side

* remove extra features of labor making use of random seed.

* fix exclude_edges bug

* remove pcg and seed logic from cpu implementation, seed logic should still work for cuda.

* minor style change

* refactor CPU implementation, take out the importance_sampling probability computation into a function.

* improve CUDAWorkspaceAllocator

* refactor importance_sampling part out to a function

* minor optimization

* fix linting issue

* Revert "remove pcg and seed logic from cpu implementation, seed logic should still work for cuda."

This reverts commit c250e07ac6d7e13f57e79e8a2c2f098d777378c2.

* Revert "remove extra features of labor making use of random seed."

This reverts commit 7f99034353080308f4783f27d9a08bea343fb796.

* fix the documentation

* disable NIDs

* improve the documentation in the code

* use the stream argument in pcg32 instead of skipping ahead t times, can discard the use of hashmap now since it is faster this way.

* fix linting issue

* address another round of reviews

* further optimize CPU LABOR sampling implementation

* fix linting error

* update the comment

* reformat

* rename and rephrase comment

* fix formatting according to new linting specs

* fix compile error due to renaming, fix linting.

* lint

* rename DGLHeteroGraph to DGLGraph to match master

* replace other occurrences of DGLHeteroGraph to DGLGraph
Co-authored-by: Muhammed Fatih BALIN <m.f.balin@gmail.com>
Co-authored-by: Kaan Sancak <kaansnck@gmail.com>
Co-authored-by: Quan Gan <coin2028@hotmail.com>

bf264d00

10 Nov, 2022 1 commit

[Bugfix] Fix that half-precision SpMM produce incorrect results (#4842) · a8f9d5ef

Xin Yao authored Nov 10, 2022

* update accumulator

* rename half to __half

* add bfloat16

* simplify code

* fix another case

* add unit test

* disable half-precision SpMMCoo

* fix lint

a8f9d5ef

08 Nov, 2022 1 commit

[Misc] Minor code style fix. (#4843) · cb5e3489

Hongzhi (Steve), Chen authored Nov 08, 2022



* [Misc] Change the max line length for cpp to 80 in lint.

* blabla

* blabla

* blabla

* ablabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

cb5e3489

07 Nov, 2022 4 commits

[Misc] clang-format auto fix. (#4831) · 889798fe

Hongzhi (Steve), Chen authored Nov 07, 2022



* [Misc] clang-format auto fix.

* blabla

* nolint

* blabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

889798fe

[Misc] Minor code style fix. (#4825) · df089424

Hongzhi (Steve), Chen authored Nov 07, 2022



* blabla

* more

* blabla

* blabla

* ablabla

* blabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

df089424

[Misc] clang-format auto fix. (#4824) · 8ac27dad

Hongzhi (Steve), Chen authored Nov 07, 2022



* [Misc] clang-format auto fix.

* blabla

* ablabla

* blabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

8ac27dad

[Misc] Replace /*! with /**. (#4823) · bcd37684

Hongzhi (Steve), Chen authored Nov 07, 2022



* replace

* blabla

* balbla

* blabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

bcd37684

06 Nov, 2022 2 commits

[Misc] Replace \xxx with @XXX in structured comment. (#4822) · 619d735d

Hongzhi (Steve), Chen authored Nov 07, 2022



* param

* brief

* note

* return

* tparam

* brief2

* file

* return2

* return

* blabla

* all
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

619d735d

[Feature] Add bfloat16 (bf16) support (#4648) · 96297fb8

Xin Yao authored Nov 06, 2022

* add bf16 specializations

* remove SWITCH_BITS

* enable amp for bf16

* remove SWITCH_BITS for cpu kernels

* enbale bf16 based on CUDART

* fix compiling for sm<80

* fix cpu build

* enable unit tests

* update doc

* disable test for CUDA < 11.0

* address comments

* address comments

96297fb8

03 Nov, 2022 2 commits

[Misc] clang-format auto fix. (#4804) · 8ae50c42

Hongzhi (Steve), Chen authored Nov 03, 2022



* [Misc] clang-format auto fix.

* manual

* manual

* manual

* manual

* todo

* fix
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

8ae50c42

[Bugfix] Fix that UVA cannot work on old GPUs (#4781) · 16e771c0
Xin Yao authored Nov 03, 2022
```
* get device pointers

* change if condition to IsPinned
```
16e771c0

28 Oct, 2022 1 commit

[Sampling] Enable sampling with edge masks on homogeneous graph (#4748) · 72781efb

Quan (Andy) Gan authored Oct 28, 2022

* sample neighbors with masks

* oops

* refactor again

* remove

* remove debug code

* rename macro

* address comments

* address comment

* address comments

* rename a lot of stuff

* oops

72781efb