- 24 May, 2023 1 commit
-
-
Andrzej Kotłowski authored
Co-authored-by:Rhett Ying <85214957+Rhett-Ying@users.noreply.github.com>
-
- 17 May, 2023 1 commit
-
-
nv-dlasalle authored
[Performance Improvement] Make GPU sampling and to_block use pinned memory to decrease required synchronization (#5685)
-
- 10 May, 2023 2 commits
-
-
Ilia Taraban authored
-
Andrzej Kotłowski authored
Co-authored-by:Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
-
- 28 Apr, 2023 1 commit
-
-
Ilia Taraban authored
-
- 26 Apr, 2023 1 commit
-
-
Ilia Taraban authored
-
- 10 Apr, 2023 1 commit
-
-
peizhou001 authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-16-19.ap-northeast-1.compute.internal>
-
- 06 Apr, 2023 1 commit
-
-
Ilia Taraban authored
Co-authored-by:Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
-
- 23 Mar, 2023 1 commit
-
-
Xin Yao authored
* update for segmentMM * update for sddmm * fix a bug
-
- 15 Mar, 2023 1 commit
-
-
Daniil Sizov authored
* Enable AVX by default * Fix linting errors * Fix win64 build (libxsmm not linked) Libxsmm on Win64 is not linked, should be disabled by default * Fix clang format issues * Change lower supported cpu version to LIBXSMM_X86_AVX2 Change lower supported cpu version to LIBXSMM_X86_AVX2 to address https://github.com/dmlc/dgl/issues/3459 issue * Fix unit test Remove assumption that libxsmm is enabled in the config by default (only true for intel CPUs with AVX2 instructions) --------- Co-authored-by:
Ubuntu <ubuntu@ip-172-31-15-137.us-west-2.compute.internal> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 08 Mar, 2023 1 commit
-
-
Rhett Ying authored
-
- 01 Mar, 2023 1 commit
-
-
Kacper Pietkun authored
-
- 23 Feb, 2023 1 commit
-
-
Kacper Pietkun authored
* fixed leak in SpMMCreateBlocks * clang format
-
- 21 Feb, 2023 1 commit
-
-
peizhou001 authored
* change concurrent id hash map
-
- 16 Feb, 2023 1 commit
-
-
Songqing Zhang authored
Co-authored-by:songqing.zhang <songqing.zhang@shopee.com>
-
- 13 Feb, 2023 1 commit
-
-
Quan (Andy) Gan authored
* enable sparse on windows and mac * that was stupid * let's see what's going on.. * [Sparse] Fix the import error on Mac OS. When using template functions that are defined in source files from DGL, the loader of MacOS somehow cannot find their definitions. This fix simply avoids depending on template functions from DGL headers. With this fix, the sparse tests all pass on the MAC environment. * ok this is the problem * make errors clearer * uh * test * Update __init__.py * disabling ddp on windows --------- Co-authored-by:czkkkkkk <zekucai@gmail.com>
-
- 09 Feb, 2023 1 commit
-
-
peizhou001 authored
Add Id hash map
-
- 12 Jan, 2023 1 commit
-
-
nv-dlasalle authored
* Add failing unit test * Add fix * Remove extra newline * skip cpu test Co-authored-by:Xin Yao <yaox12@outlook.com>
-
- 06 Jan, 2023 1 commit
-
-
Andrzej Kotłowski authored
Co-authored-by:Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
-
- 15 Dec, 2022 1 commit
-
-
czkkkkkk authored
-
- 12 Dec, 2022 2 commits
-
-
czkkkkkk authored
* [Sparse] Add SpMM and SDDMM * Update * Add CSR and CSC SpMM tests
- 09 Dec, 2022 1 commit
-
-
Xin Yao authored
* fix empty tensor is treated as pinned * avoid calling cudaHostGetDevicePointer on nullptr * update empty array * add a comment
-
- 06 Dec, 2022 1 commit
-
-
Chang Liu authored
* Add support for next cusparse release * Fix lint * Add switch and tune the performance * Fix lint issue * Fine tune the heuristics * Fix lint issue * Address comments * Minor fix * Address comments
-
- 01 Dec, 2022 1 commit
-
-
Muhammed Fatih BALIN authored
* replace dgl PRNG with pcg32 * remove pcg submodule, add a simple implementation * replace pcg32 with std::mt19937_64 * fix include order * change RandomEngine to pcg32 * Remove custom pcg32 implementation, use the submodule provided by the original author. * minor bug * move include for linting * include pcg for tests too Co-authored-by:Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
-
- 24 Nov, 2022 1 commit
-
-
Xin Yao authored
-
- 22 Nov, 2022 2 commits
-
-
Ping Gong authored
* Leverage hashmap to accelerate CSRSliceMatrix * fix lint check * use `min` in cuda_runtime.ch * fix hash func * add some comments and adjust the <grid,block> of the _SegmentMaskColKernel kernel * set device and stream for thrust::for_each * use thrust::cuda::par_nosync Co-authored-by:Xin Yao <xiny@nvidia.com>
-
Muhammed Fatih BALIN authored
* adding LABOR sampling * add ladies and pladies samplers * fix compile error after rebase * add reference for ladies sampler * Improve ladies implementation. * weighted labor sampling initial implementation draft fix indentation and small bug in ladies script * importance_sampling currently doesn't work with weights * fix weighted importance sampling * move labor example into its own folder * lint fixes * Improve documentation * remove examples from the main PR * fix linting by not using c++17 features * fix documentation of labor_sampler.py * update documentation for labor.py * reformat the labor.py file with black * fix linting errors * replace exception use with if * fix typo in error comment * fixing win64 build for ci * fixing weighted implementation, works now. * fix bug in the weighted case and importance_sampling==0 * address part of the reviews * remove unused code paths from cuda * remove unused code path from cpu side * remove extra features of labor making use of random seed. * fix exclude_edges bug * remove pcg and seed logic from cpu implementation, seed logic should still work for cuda. * minor style change * refactor CPU implementation, take out the importance_sampling probability computation into a function. * improve CUDAWorkspaceAllocator * refactor importance_sampling part out to a function * minor optimization * fix linting issue * Revert "remove pcg and seed logic from cpu implementation, seed logic should still work for cuda." This reverts commit c250e07ac6d7e13f57e79e8a2c2f098d777378c2. * Revert "remove extra features of labor making use of random seed." This reverts commit 7f99034353080308f4783f27d9a08bea343fb796. * fix the documentation * disable NIDs * improve the documentation in the code * use the stream argument in pcg32 instead of skipping ahead t times, can discard the use of hashmap now since it is faster this way. * fix linting issue * address another round of reviews * further optimize CPU LABOR sampling implementation * fix linting error * update the comment * reformat * rename and rephrase comment * fix formatting according to new linting specs * fix compile error due to renaming, fix linting. * lint * rename DGLHeteroGraph to DGLGraph to match master * replace other occurrences of DGLHeteroGraph to DGLGraph Co-authored-by:
Muhammed Fatih BALIN <m.f.balin@gmail.com> Co-authored-by:
Kaan Sancak <kaansnck@gmail.com> Co-authored-by:
Quan Gan <coin2028@hotmail.com>
-
- 15 Nov, 2022 4 commits
-
-
Quan (Andy) Gan authored
This reverts commit 00c27cb2.
-
Quan (Andy) Gan authored
This reverts commit 56962858.
-
Quan (Andy) Gan authored
* make IdHashMap parallel * fix * Update array_utils.h
-
Quan (Andy) Gan authored
* use runtime parallel_for * grain size * Update array_index_select.cc
-
- 10 Nov, 2022 1 commit
-
-
Xin Yao authored
* update accumulator * rename half to __half * add bfloat16 * simplify code * fix another case * add unit test * disable half-precision SpMMCoo * fix lint
-
- 08 Nov, 2022 2 commits
-
-
Hongzhi (Steve), Chen authored
* [Misc] Change the max line length for cpp to 80 in lint. * blabla * blabla * blabla * ablabla Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
Hongzhi (Steve), Chen authored
* alternative * fix * remove_todo * blabl * ablabl Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
- 07 Nov, 2022 4 commits
-
-
Hongzhi (Steve), Chen authored
* [Misc] clang-format auto fix. * blabla * nolint * blabla Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
Hongzhi (Steve), Chen authored
* blabla * more * blabla * blabla * ablabla * blabla Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
Hongzhi (Steve), Chen authored
* [Misc] clang-format auto fix. * blabla * ablabla * blabla Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
Hongzhi (Steve), Chen authored
* replace * blabla * balbla * blabla Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
- 06 Nov, 2022 1 commit
-
-
Hongzhi (Steve), Chen authored
* param * brief * note * return * tparam * brief2 * file * return2 * return * blabla * all Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-