Commits · 6862e372597d1baeca3ae17e8d7956d6c23755b1 · OpenDAS / dgl

24 May, 2023 1 commit
- Upgrade libxsmm (#5725) · 6862e372
  Andrzej Kotłowski authored May 24, 2023
```
Co-authored-by: Rhett Ying <85214957+Rhett-Ying@users.noreply.github.com>
```
  6862e372
17 May, 2023 1 commit
- [Performance Improvement] Make GPU sampling and to_block use pinned memory to... · 46af76c3
  nv-dlasalle authored May 17, 2023
```
[Performance Improvement] Make GPU sampling and to_block use pinned memory to decrease required synchronization (#5685)
```
  46af76c3
10 May, 2023 2 commits
- [Misc] Disable BF16 LibXSMM SpMM for AVX2 platforms (#5677) · ff9573c4
  Ilia Taraban authored May 10, 2023
  
  ff9573c4
- [Performance] Improve COOToCSR implementation (#5508) · e0d2250e
  Andrzej Kotłowski authored May 10, 2023
```
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
```
  e0d2250e
28 Apr, 2023 1 commit
- [Fix] fix libxsmm build issues on Mac OS (#5626) · 78ecd508
  Ilia Taraban authored Apr 28, 2023
  
  78ecd508
26 Apr, 2023 1 commit
- [Fix] restore SpMMSumCsrNaive function for float and double (#5615) · 8ecbfa57
  Ilia Taraban authored Apr 27, 2023
  
  8ecbfa57
10 Apr, 2023 1 commit
- [Enhancement]Set default graph dataloader thread number (#5479) · c51cc82e
  peizhou001 authored Apr 10, 2023
```
Co-authored-by: Ubuntu <ubuntu@ip-172-31-16-19.ap-northeast-1.compute.internal>
```
  c51cc82e
06 Apr, 2023 1 commit
- [Feature] Add bfloat16 support for CPU (#5497) · acb4eb7e
  Ilia Taraban authored Apr 06, 2023
```
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
```
  acb4eb7e
23 Mar, 2023 1 commit
- [Performance] Creating out buffers for `segment_mm`|`sddmm` via `torch.empty()` (#5462) · 170203ae
  Xin Yao authored Mar 23, 2023
```
* update for segmentMM

* update for sddmm

* fix a bug
```
  170203ae
15 Mar, 2023 1 commit

[Config] Enable libxsmm by default for AVX cpu (#5165) · 87fb7ed0

Daniil Sizov authored Mar 15, 2023

* Enable AVX by default

* Fix linting errors

* Fix win64 build (libxsmm not linked)

Libxsmm on Win64 is not linked, should be disabled by default

* Fix clang format issues

* Change lower supported cpu version to LIBXSMM_X86_AVX2

Change lower supported cpu version to LIBXSMM_X86_AVX2 to address https://github.com/dmlc/dgl/issues/3459

 issue

* Fix unit test

Remove assumption that libxsmm is enabled in the config by default (only true for intel CPUs with AVX2 instructions)

---------
Co-authored-by: Ubuntu <ubuntu@ip-172-31-15-137.us-west-2.compute.internal>
Co-authored-by: Quan (Andy) Gan <coin2028@hotmail.com>

87fb7ed0

08 Mar, 2023 1 commit
- Fix compile error on ubuntu22.04_g++11.3.0 (#5434) · b1ec112e
  Rhett Ying authored Mar 08, 2023
  
  b1ec112e
01 Mar, 2023 1 commit
- removed pragma omp for (#5334) · 308bd6f5
  Kacper Pietkun authored Mar 01, 2023
  
  308bd6f5
23 Feb, 2023 1 commit
- [Bugfix] fixed leak in SpMMCreateBlocks (#5210) · 99937422
  Kacper Pietkun authored Feb 23, 2023
```
* fixed leak in SpMMCreateBlocks

* clang format
```
  99937422
21 Feb, 2023 1 commit
- [Enhancement] Change id hash map (#5304) · ed2e5409
  peizhou001 authored Feb 21, 2023
```
* change concurrent id hash map
```
  ed2e5409
16 Feb, 2023 1 commit
- [Misc] Fix build warnings (#5303) · 1329be96
  Songqing Zhang authored Feb 16, 2023
```
Co-authored-by: songqing.zhang <songqing.zhang@shopee.com>
```
  1329be96
13 Feb, 2023 1 commit

enable sparse on windows and mac (#5277) · f62669b0

Quan (Andy) Gan authored Feb 13, 2023



* enable sparse on windows and mac

* that was stupid

* let's see what's going on..

* [Sparse] Fix the import error on Mac OS.

When using template functions that are defined in source files from DGL,
the loader of MacOS somehow cannot find their definitions. This fix simply
avoids depending on template functions from DGL headers.

With this fix, the sparse tests all pass on the MAC environment.

* ok this is the problem

* make errors clearer

* uh

* test

* Update __init__.py

* disabling ddp on windows

---------
Co-authored-by: czkkkkkk <zekucai@gmail.com>

f62669b0

09 Feb, 2023 1 commit
- [Performance]Add concurrent cpu id hashmap (#5241) · f0b7cc96
  peizhou001 authored Feb 09, 2023
```
Add Id hash map
```
  f0b7cc96
12 Jan, 2023 1 commit
- [Bugfix] Replace global cudaStream in Filter with runtime calls (fix #5153) (#5157) · 751b4c26
  nv-dlasalle authored Jan 12, 2023
```
* Add failing unit test

* Add fix

* Remove extra newline

* skip cpu test
Co-authored-by: Xin Yao <yaox12@outlook.com>
```
  751b4c26
06 Jan, 2023 1 commit
- [Performance] Fix for number of threads in COOToCSR (#5017) · 6069f34c
  Andrzej Kotłowski authored Jan 06, 2023
```
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
```
  6069f34c
15 Dec, 2022 1 commit
- [Sparse] Add SpMM and SDDMM on CSR and COO in dgl include headers (#5016) · 08b60eb1
  czkkkkkk authored Dec 15, 2022
  
  08b60eb1
12 Dec, 2022 2 commits
- Revert "[Sparse] Add SpMM and SDDMM." (#5014) · d02e560e
  czkkkkkk authored Dec 12, 2022
```
* Revert "[Sparse] Add SpMM and SDDMM. (#4999)"

This reverts commit 15365d78.

* lint
```
  d02e560e
- [Sparse] Add SpMM and SDDMM. (#4999) · 15365d78
  czkkkkkk authored Dec 12, 2022
```
* [Sparse] Add SpMM and SDDMM

* Update

* Add CSR and CSC SpMM tests
```
  15365d78
09 Dec, 2022 1 commit

[Bugfix] Fix empty tensors may being treated as pinned (#5005) · aad3bd04

Xin Yao authored Dec 09, 2022

* fix empty tensor is treated as pinned

* avoid calling cudaHostGetDevicePointer on nullptr

* update empty array

* add a comment

aad3bd04

06 Dec, 2022 1 commit

Add support for next cusparse release (#4974) · fb223d47

Chang Liu authored Dec 05, 2022

* Add support for next cusparse release

* Fix lint

* Add switch and tune the performance

* Fix lint issue

* Fine tune the heuristics

* Fix lint issue

* Address comments

* Minor fix

* Address comments

fb223d47

01 Dec, 2022 1 commit

[Feature] replace dgl PRNG with pcg32 (#4807) · b1e2695f

Muhammed Fatih BALIN authored Nov 30, 2022



* replace dgl PRNG with pcg32

* remove pcg submodule, add a simple implementation

* replace pcg32 with std::mt19937_64

* fix include order

* change RandomEngine to pcg32

* Remove custom pcg32 implementation, use the submodule provided by the original author.

* minor bug

* move include for linting

* include pcg for tests too
Co-authored-by: Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>

b1e2695f

24 Nov, 2022 1 commit
- [Cleanup] Remove duplicated _IndexSelect (#4874) · c59000ac
  Xin Yao authored Nov 24, 2022
  
  c59000ac
22 Nov, 2022 2 commits

[Performance] Leverage hashmap to accelerate CSRSliceMatrix<kDGLCUDA, IdType> (#4924) · aa419895

Ping Gong authored Nov 22, 2022



* Leverage hashmap to accelerate CSRSliceMatrix

* fix lint check

* use `min` in cuda_runtime.ch

* fix hash func

* add some comments and adjust the <grid,block> of the _SegmentMaskColKernel kernel

* set device and stream for thrust::for_each

* use thrust::cuda::par_nosync
Co-authored-by: Xin Yao <xiny@nvidia.com>

aa419895

[Feature] (La)yer-Neigh(bor) sampling implementation (#4668) · bf264d00

Muhammed Fatih BALIN authored Nov 21, 2022



* adding LABOR sampling

* add ladies and pladies samplers

* fix compile error after rebase

* add reference for ladies sampler

* Improve ladies implementation.

* weighted labor sampling initial implementation draft
fix indentation and small bug in ladies script

* importance_sampling currently doesn't work with weights

* fix weighted importance sampling

* move labor example into its own folder

* lint fixes

* Improve documentation

* remove examples from the main PR

* fix linting by not using c++17 features

* fix documentation of labor_sampler.py

* update documentation for labor.py

* reformat the labor.py file with black

* fix linting errors

* replace exception use with if

* fix typo in error comment

* fixing win64 build for ci

* fixing weighted implementation, works now.

* fix bug in the weighted case and importance_sampling==0

* address part of the reviews

* remove unused code paths from cuda

* remove unused code path from cpu side

* remove extra features of labor making use of random seed.

* fix exclude_edges bug

* remove pcg and seed logic from cpu implementation, seed logic should still work for cuda.

* minor style change

* refactor CPU implementation, take out the importance_sampling probability computation into a function.

* improve CUDAWorkspaceAllocator

* refactor importance_sampling part out to a function

* minor optimization

* fix linting issue

* Revert "remove pcg and seed logic from cpu implementation, seed logic should still work for cuda."

This reverts commit c250e07ac6d7e13f57e79e8a2c2f098d777378c2.

* Revert "remove extra features of labor making use of random seed."

This reverts commit 7f99034353080308f4783f27d9a08bea343fb796.

* fix the documentation

* disable NIDs

* improve the documentation in the code

* use the stream argument in pcg32 instead of skipping ahead t times, can discard the use of hashmap now since it is faster this way.

* fix linting issue

* address another round of reviews

* further optimize CPU LABOR sampling implementation

* fix linting error

* update the comment

* reformat

* rename and rephrase comment

* fix formatting according to new linting specs

* fix compile error due to renaming, fix linting.

* lint

* rename DGLHeteroGraph to DGLGraph to match master

* replace other occurrences of DGLHeteroGraph to DGLGraph
Co-authored-by: Muhammed Fatih BALIN <m.f.balin@gmail.com>
Co-authored-by: Kaan Sancak <kaansnck@gmail.com>
Co-authored-by: Quan Gan <coin2028@hotmail.com>

bf264d00

15 Nov, 2022 4 commits
- Revert "[Kernel] Parallel find edges (#4878)" (#4899) · ca144886
  Quan (Andy) Gan authored Nov 15, 2022
```
This reverts commit 00c27cb2.
```
  ca144886
- Revert "[Performance] Make IdHashMap parallel (#4881)" (#4898) · 5b193f9b
  Quan (Andy) Gan authored Nov 15, 2022
```
This reverts commit 56962858.
```
  5b193f9b
- [Performance] Make IdHashMap parallel (#4881) · 56962858
  Quan (Andy) Gan authored Nov 15, 2022
```
* make IdHashMap parallel

* fix

* Update array_utils.h
```
  56962858
- [Kernel] Parallel find edges (#4878) · 00c27cb2
  Quan (Andy) Gan authored Nov 15, 2022
```
* use runtime parallel_for

* grain size

* Update array_index_select.cc
```
  00c27cb2
10 Nov, 2022 1 commit

[Bugfix] Fix that half-precision SpMM produce incorrect results (#4842) · a8f9d5ef

Xin Yao authored Nov 10, 2022

* update accumulator

* rename half to __half

* add bfloat16

* simplify code

* fix another case

* add unit test

* disable half-precision SpMMCoo

* fix lint

a8f9d5ef

08 Nov, 2022 2 commits

[Misc] Minor code style fix. (#4843) · cb5e3489

Hongzhi (Steve), Chen authored Nov 08, 2022



* [Misc] Change the max line length for cpp to 80 in lint.

* blabla

* blabla

* blabla

* ablabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

cb5e3489

[Misc] Add // NOLINT for the very long code. (#4834) · 0d687968

Hongzhi (Steve), Chen authored Nov 08, 2022



* alternative

* fix

* remove_todo

* blabl

* ablabl
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

0d687968

07 Nov, 2022 4 commits

[Misc] clang-format auto fix. (#4831) · 889798fe

Hongzhi (Steve), Chen authored Nov 07, 2022



* [Misc] clang-format auto fix.

* blabla

* nolint

* blabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

889798fe

[Misc] Minor code style fix. (#4825) · df089424

Hongzhi (Steve), Chen authored Nov 07, 2022



* blabla

* more

* blabla

* blabla

* ablabla

* blabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

df089424

[Misc] clang-format auto fix. (#4824) · 8ac27dad

Hongzhi (Steve), Chen authored Nov 07, 2022



* [Misc] clang-format auto fix.

* blabla

* ablabla

* blabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

8ac27dad

[Misc] Replace /*! with /**. (#4823) · bcd37684

Hongzhi (Steve), Chen authored Nov 07, 2022



* replace

* blabla

* balbla

* blabla
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

bcd37684

06 Nov, 2022 1 commit

[Misc] Replace \xxx with @XXX in structured comment. (#4822) · 619d735d

Hongzhi (Steve), Chen authored Nov 07, 2022



* param

* brief

* note

* return

* tparam

* brief2

* file

* return2

* return

* blabla

* all
Co-authored-by: Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>

619d735d