"examples/pytorch/git@developer.sourcefind.cn:OpenDAS/dgl.git" did not exist on "20e1bb455b0304d6d39125b9d4f528fe17947c27"
- 15 Feb, 2022 1 commit
-
-
Israt Nisa authored
* init * init * working cublasGemm * benchmark high-mem/low-mem, err gather_mm output * cuda kernel for bmm like kernel * removed cpu copy for E_per_Rel * benchmark code from Minjie * fixed cublas results in gathermm sorted * use GPU shared mem in unsorted gather mm * minor * Added an optimal version of gather_mm_unsorted * lint * init gather_mm_scatter * cublas transpose added * fixed h_offset for multiple rel * backward unittest * cublas support to transpose W * adding missed file * forgot to add header file * lint * lint * cleanup * lint * docstring * lint * added unittest * lint * lint * unittest * changed err type * skip cpu test * skip CPU code * move in-len loop inside * lint * added check different dim length for B * w_per_len is optional now * moved gather_mm to pytorch/backend with backward support * removed a_/b_trans support * transpose op inside GEMM call * removed out alloc from API, changed W 2D to 3D * Added se_gather_mm, Separate API for sortedE * Fixed gather_mm (unsorted) user interface * unsorted gmm backward + separate CAPI for un/sorted A * typecast to float to support atomicAdd * lint typecast * lint * added gather_mm_scatter * minor * const * design changes * Added idx_a, idx_b support gmm_scatter * dgl doc * lint * adding gather_mm in ops * lint * lint * minor * removed benchmark files * minor * empty commit Co-authored-by:Israt Nisa <nisisrat@amazon.com>
-
- 11 Feb, 2022 1 commit
-
-
ranzhejiang authored
* [feature] edge softmax refact. * delete file * fix backward and cmake version * fix backward * format function * fix setting * refix * refix * refix * refix * refix * refix * refix * refix * refix * refix * refix * refix * add cuda kernel for backward and rename some function * add benchmark for edge_softmax * fix format * remove cuda_backwrd * fix code format and add comment for op on CPU * fix lint Co-authored-by:Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 03 Dec, 2021 1 commit
-
-
Israt Nisa authored
* min/max support for forward CPU heterograph * Added etype with each argU values * scatter_add needs fix * added scatter_add_hetero. Grads dont match for max reducer * storing ntype in argX * fixing scatter_add_hetero * hetero matches with torch's scatter add * works copy_e forward+cpu * added backward for copy_rhs * Computes gradient for all node types in one kernel * bug fix * unnitest for max/min on CPU * renamed scatter_add_hetero to update_grad_minmax_hetero * lint check and comment out cuda call for max. Code is for CPU only * lint check * replace inf with zero * minor * lint check * removed LIBXSMM code from hetro code * fixing backward operator of UpdateGradMinMaxHetero * removed backward from update_grad_minmax_hetero * docstring * improved docstring and coding style * Added pass by pointer for output * typos and pass by references * Support for copy_rhs * Added header <string> * fix bug in copy_u_max * Added comments and dimension check of all etypes * skip mxnet check * pass by pointer output arrays * updated docstring Co-authored-by:
Israt Nisa <nisisrat@amazon.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 17 Nov, 2021 1 commit
-
-
Israt Nisa authored
* Added SDDMMCOO_hetero support * removed redundant CUDA kernels * added benchmark for regression test * fix * fixed bug for single src node type Co-authored-by:
Israt Nisa <nisisrat@amazon.com> Co-authored-by:
Quan (Andy) Gan <coin2028@hotmail.com>
-
- 22 Jun, 2021 1 commit
-
-
Israt Nisa authored
* Added heterograph support SpMM, SDDMM * bug fix cuda stream * add cudaStrm destroy and fix whitespace * Added heterograph support SpMM, SDDMM * bug fix cuda stream * add cudaStrm destroy and fix whitespace * changed max stream = 1 * Fixed ctx * using default stream * Added heterograph support SpMM, SDDMM * bug fix cuda stream * add cudaStrm destroy and fix whitespace * changed max stream = 1 * Fixed ctx * using default stream * fix bug in copy_rhs * changed by mistake * minor datatype change * added datatype check Co-authored-by:Israt Nisa <nisisrat@amazon.com>
-
- 03 Jun, 2021 1 commit
-
-
Israt Nisa authored
* SpMM for heterograph * C APIs SDDMM heterograph * passes initial result * renamed eid with nid * aggregation on same ntype for multiple etypes * fix link check failure * lint check part 2 * lint check part 3 * Fixed SpMMCmpCsr Min op * added mem references * fixed fill(Max/Min), added const * removed newline * brought back docstring Co-authored-by:
Israt Nisa <nisisrat@amazon.com> Co-authored-by:
Da Zheng <zhengda1936@gmail.com>
-
- 27 Apr, 2021 1 commit
-
-
Israt Nisa authored
* init cuda support * cuSPARSE err * passed unittest for csr_mm/SpGEMM. int64 not supported * Debugging cuSPARSE error 3 * csrgeam only supports int32? * disabling int64 for cuda * refactor and add CSRMask * lint * oops * remove todo * rewrite CSRMask with CSRGetData * lint * fix test * address comments * lint * fix * addresses comments and rename BUG_ON Co-authored-by:
Israt Nisa <nisisrat@amazon.com> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-30-71.ec2.internal> Co-authored-by:
Quan Gan <coin2028@hotmail.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com> Co-authored-by:
Minjie Wang <wmjlyjemaine@gmail.com>
-
- 24 Mar, 2021 1 commit
-
-
Quan (Andy) Gan authored
* test * more stuff * add test * fixes * optimize algo * replace unordered_map with arrays * lint * lint x2 * oops * disable gpu csrmm tests * remove gpu invocation * optimize with openmp * remove python functions * add back with docstrings * lint * lint * update python interface * functionize * functionize * lint * lint
-
- 21 Feb, 2021 1 commit
-
-
Zihao Ye authored
* upd * fix * upd * upd * upd * upd * upd * upd * upd * upd * upd * fix * refactor * upd test * large feat_len or n in segment reduce * lint
-
- 28 Jan, 2021 1 commit
-
-
Zihao Ye authored
* add tvm as submodule * compilation is ok but calling fails * can call now * pack multiple modules, change names * upd * upd * upd * fix cmake * upd * upd * upd * upd * fix * relative path * upd * upd * upd * singleton * upd * trigger * fix * upd * count reducible * upd * upd * upd * upd * upd * upd * upd * upd * upd * only keep related files * upd * upd * upd * upd * lint * lint * lint * lint * pylint * upd * upd * compilation * fix * upd * upd * upd * upd * upd * upd * upd doc * refactor * fix * upd number Co-authored-by:
Zhi Lin <linzhilynn@gmail.com> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-42-78.us-east-2.compute.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-21-156.us-east-2.compute.internal> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 22 Nov, 2020 1 commit
-
-
Zihao Ye authored
* upd * upd * update * upd * upd * upd * fix * lint * lint * pylint * doc
-
- 13 Jul, 2020 1 commit
-
-
Zihao Ye authored
* init * reverse(by minjie * upd * upd * upd * upd * upd * upd * upd * gpu * upd * upd * upd * upd * udp * upd * upd * imidiot * fix * upd * upd * upd * udp * upd * upd * fix * udp * upd * upd * upd * upd * upd * fix * remove redundency * upd * upd * upd * cache * upd * upd * upd * upd * upd * udp * upd * trigger * upd * fix * upd * unused import * upd * upd
-
- 28 Jun, 2020 1 commit
-
-
Minjie Wang authored
* add cub; array cumsum * CSRSliceRows * fix warning * operator << for ndarray; CSRSliceRows * add CSRIsSorted * add csr_sort * inplace coosort and outplace csrsort * WIP: coo is sorted * mv cuda_utils * add AllTrue utility * csr sort * coo sort * coo2csr for sorted coo arrays * CSRToCOO from sorted * pass tests for the new kernel changes * cannot use inplace sort * lint * try fix msvc error * Fix g.copy_to and g.asnumbits; ToBlock no longer uses CSC * stash * revert some hack * revert some changes * address comments * fix * fix to_block unittest * add todo note
-
- 22 Jun, 2020 1 commit
-
-
Zihao Ye authored
* udp * simplify * sddmm dot cpu * upd * format * upd * compatible with MJ's PR * lint * upd * upd * upd * python end * upd * upd * lint * lint * upd * upd * upd * upd * upd * lint * fix mxnet * upd * lint * use minjie's ptr * macro * upd * reorg * lint * fix corner cases * upd * enrich cpu docs * upd * upd * lint * lint * pylint * sx review * improve docstring * python doc * upd * restructure * lint * upd test * upd * pylint * fix corner cases and test
-