You need to sign in or sign up before continuing.
- 11 Apr, 2023 1 commit
-
-
Chang Liu authored
-
- 10 Apr, 2023 2 commits
-
-
Tianqi Zhang (张天启) authored
-
peizhou001 authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-16-19.ap-northeast-1.compute.internal>
-
- 06 Apr, 2023 1 commit
-
-
Ilia Taraban authored
Co-authored-by:Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
-
- 23 Mar, 2023 1 commit
-
-
Xin Yao authored
* update for segmentMM * update for sddmm * fix a bug
-
- 20 Mar, 2023 1 commit
-
-
Hongzhi (Steve), Chen authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-28-63.ap-northeast-1.compute.internal>
-
- 15 Mar, 2023 1 commit
-
-
Daniil Sizov authored
* Enable AVX by default * Fix linting errors * Fix win64 build (libxsmm not linked) Libxsmm on Win64 is not linked, should be disabled by default * Fix clang format issues * Change lower supported cpu version to LIBXSMM_X86_AVX2 Change lower supported cpu version to LIBXSMM_X86_AVX2 to address https://github.com/dmlc/dgl/issues/3459 issue * Fix unit test Remove assumption that libxsmm is enabled in the config by default (only true for intel CPUs with AVX2 instructions) --------- Co-authored-by:
Ubuntu <ubuntu@ip-172-31-15-137.us-west-2.compute.internal> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 08 Mar, 2023 2 commits
-
-
Xin Yao authored
* expose GeneratePermutation * add sparse_all_to_all_push * add sparse_all_to_all_pull * add unit test * handle world_size=1 * remove python nccl wrapper * remove the nccl dependency * use pinned memory to speedup D2H copy * fix lint * resolve comments * fix lint * fix ut * resolve comments
-
Rhett Ying authored
-
- 07 Mar, 2023 1 commit
-
-
peizhou001 authored
-
- 01 Mar, 2023 1 commit
-
-
Kacper Pietkun authored
-
- 27 Feb, 2023 1 commit
-
-
peizhou001 authored
-
- 23 Feb, 2023 1 commit
-
-
Kacper Pietkun authored
* fixed leak in SpMMCreateBlocks * clang format
-
- 22 Feb, 2023 1 commit
-
-
peizhou001 authored
-
- 21 Feb, 2023 1 commit
-
-
peizhou001 authored
* change concurrent id hash map
-
- 16 Feb, 2023 2 commits
-
-
Songqing Zhang authored
Co-authored-by:songqing.zhang <songqing.zhang@shopee.com>
-
nv-dlasalle authored
[bugfix] Fix assertions in /src/runtime/workspace.h and expand unit tests for sparse optimizer (#5299) * Fix assertions for size 0 workspaces * Expand unit test to cover case of communication * Fixes * Format * Fix c++ formatting
-
- 13 Feb, 2023 1 commit
-
-
Quan (Andy) Gan authored
* enable sparse on windows and mac * that was stupid * let's see what's going on.. * [Sparse] Fix the import error on Mac OS. When using template functions that are defined in source files from DGL, the loader of MacOS somehow cannot find their definitions. This fix simply avoids depending on template functions from DGL headers. With this fix, the sparse tests all pass on the MAC environment. * ok this is the problem * make errors clearer * uh * test * Update __init__.py * disabling ddp on windows --------- Co-authored-by:czkkkkkk <zekucai@gmail.com>
-
- 09 Feb, 2023 1 commit
-
-
peizhou001 authored
Add Id hash map
-
- 12 Jan, 2023 1 commit
-
-
nv-dlasalle authored
* Add failing unit test * Add fix * Remove extra newline * skip cpu test Co-authored-by:Xin Yao <yaox12@outlook.com>
-
- 06 Jan, 2023 1 commit
-
-
Andrzej Kotłowski authored
Co-authored-by:Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
-
- 15 Dec, 2022 1 commit
-
-
czkkkkkk authored
-
- 12 Dec, 2022 2 commits
-
-
czkkkkkk authored
* [Sparse] Add SpMM and SDDMM * Update * Add CSR and CSC SpMM tests
- 09 Dec, 2022 1 commit
-
-
Xin Yao authored
* fix empty tensor is treated as pinned * avoid calling cudaHostGetDevicePointer on nullptr * update empty array * add a comment
-
- 06 Dec, 2022 1 commit
-
-
Chang Liu authored
* Add support for next cusparse release * Fix lint * Add switch and tune the performance * Fix lint issue * Fine tune the heuristics * Fix lint issue * Address comments * Minor fix * Address comments
-
- 01 Dec, 2022 1 commit
-
-
Muhammed Fatih BALIN authored
* replace dgl PRNG with pcg32 * remove pcg submodule, add a simple implementation * replace pcg32 with std::mt19937_64 * fix include order * change RandomEngine to pcg32 * Remove custom pcg32 implementation, use the submodule provided by the original author. * minor bug * move include for linting * include pcg for tests too Co-authored-by:Hongzhi (Steve), Chen <chenhongzhi.nkcs@gmail.com>
-
- 24 Nov, 2022 1 commit
-
-
Xin Yao authored
-
- 22 Nov, 2022 2 commits
-
-
Ping Gong authored
* Leverage hashmap to accelerate CSRSliceMatrix * fix lint check * use `min` in cuda_runtime.ch * fix hash func * add some comments and adjust the <grid,block> of the _SegmentMaskColKernel kernel * set device and stream for thrust::for_each * use thrust::cuda::par_nosync Co-authored-by:Xin Yao <xiny@nvidia.com>
-
Muhammed Fatih BALIN authored
* adding LABOR sampling * add ladies and pladies samplers * fix compile error after rebase * add reference for ladies sampler * Improve ladies implementation. * weighted labor sampling initial implementation draft fix indentation and small bug in ladies script * importance_sampling currently doesn't work with weights * fix weighted importance sampling * move labor example into its own folder * lint fixes * Improve documentation * remove examples from the main PR * fix linting by not using c++17 features * fix documentation of labor_sampler.py * update documentation for labor.py * reformat the labor.py file with black * fix linting errors * replace exception use with if * fix typo in error comment * fixing win64 build for ci * fixing weighted implementation, works now. * fix bug in the weighted case and importance_sampling==0 * address part of the reviews * remove unused code paths from cuda * remove unused code path from cpu side * remove extra features of labor making use of random seed. * fix exclude_edges bug * remove pcg and seed logic from cpu implementation, seed logic should still work for cuda. * minor style change * refactor CPU implementation, take out the importance_sampling probability computation into a function. * improve CUDAWorkspaceAllocator * refactor importance_sampling part out to a function * minor optimization * fix linting issue * Revert "remove pcg and seed logic from cpu implementation, seed logic should still work for cuda." This reverts commit c250e07ac6d7e13f57e79e8a2c2f098d777378c2. * Revert "remove extra features of labor making use of random seed." This reverts commit 7f99034353080308f4783f27d9a08bea343fb796. * fix the documentation * disable NIDs * improve the documentation in the code * use the stream argument in pcg32 instead of skipping ahead t times, can discard the use of hashmap now since it is faster this way. * fix linting issue * address another round of reviews * further optimize CPU LABOR sampling implementation * fix linting error * update the comment * reformat * rename and rephrase comment * fix formatting according to new linting specs * fix compile error due to renaming, fix linting. * lint * rename DGLHeteroGraph to DGLGraph to match master * replace other occurrences of DGLHeteroGraph to DGLGraph Co-authored-by:
Muhammed Fatih BALIN <m.f.balin@gmail.com> Co-authored-by:
Kaan Sancak <kaansnck@gmail.com> Co-authored-by:
Quan Gan <coin2028@hotmail.com>
-
- 15 Nov, 2022 4 commits
-
-
Quan (Andy) Gan authored
This reverts commit 00c27cb2.
-
Quan (Andy) Gan authored
This reverts commit 56962858.
-
Quan (Andy) Gan authored
* make IdHashMap parallel * fix * Update array_utils.h
-
Quan (Andy) Gan authored
* use runtime parallel_for * grain size * Update array_index_select.cc
-
- 10 Nov, 2022 1 commit
-
-
Xin Yao authored
* update accumulator * rename half to __half * add bfloat16 * simplify code * fix another case * add unit test * disable half-precision SpMMCoo * fix lint
-
- 08 Nov, 2022 2 commits
-
-
Hongzhi (Steve), Chen authored
* [Misc] Change the max line length for cpp to 80 in lint. * blabla * blabla * blabla * ablabla Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
Hongzhi (Steve), Chen authored
* alternative * fix * remove_todo * blabl * ablabl Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
- 07 Nov, 2022 3 commits
-
-
Hongzhi (Steve), Chen authored
* [Misc] clang-format auto fix. * blabla * nolint * blabla Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
Hongzhi (Steve), Chen authored
* blabla * more * blabla * blabla * ablabla * blabla Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
Hongzhi (Steve), Chen authored
* [Misc] clang-format auto fix. * blabla * ablabla * blabla Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-