"...text-generation-inference.git" did not exist on "deb440b3a2179b1eccce9cf5dc1d4ff0e8a03135"
  1. 17 Jan, 2022 1 commit
  2. 11 Jan, 2022 1 commit
  3. 07 Jan, 2022 1 commit
    • Quan (Andy) Gan's avatar
      [Feature] Negative sampling (#3599) · 90f10b31
      Quan (Andy) Gan authored
      * first commit
      
      * a bunch of fixes
      
      * add unique
      
      * lint
      
      * lint
      
      * lint
      
      * address comments
      
      * Update negative_sampler.py
      
      * fix
      
      * description
      
      * address comments and fix
      
      * fix
      
      * replace unique with replace
      
      * test pylint
      
      * Update negative_sampler.py
      90f10b31
  4. 16 Dec, 2021 1 commit
  5. 06 Dec, 2021 1 commit
  6. 03 Dec, 2021 1 commit
    • Israt Nisa's avatar
      [Feature] Add Min/max reducer in heterogeneous API for unary message functions (#3514) · cb0e1103
      Israt Nisa authored
      
      
      * min/max support for forward CPU heterograph
      
      * Added etype with each argU values
      
      * scatter_add needs fix
      
      * added scatter_add_hetero. Grads dont match for max reducer
      
      * storing ntype in argX
      
      * fixing scatter_add_hetero
      
      * hetero matches with torch's scatter add
      
      * works copy_e forward+cpu
      
      * added backward for copy_rhs
      
      * Computes gradient for all node types in one kernel
      
      * bug fix
      
      * unnitest for max/min on CPU
      
      * renamed scatter_add_hetero to update_grad_minmax_hetero
      
      * lint check and comment out cuda call for max. Code is for CPU only
      
      * lint check
      
      * replace inf with zero
      
      * minor
      
      * lint check
      
      * removed LIBXSMM code from hetro code
      
      * fixing backward operator of UpdateGradMinMaxHetero
      
      * removed backward from update_grad_minmax_hetero
      
      * docstring
      
      * improved docstring and coding style
      
      * Added pass by pointer for output
      
      * typos and pass by references
      
      * Support for copy_rhs
      
      * Added header <string>
      
      * fix bug in copy_u_max
      
      * Added comments and dimension check of all etypes
      
      * skip mxnet check
      
      * pass by pointer output arrays
      
      * updated docstring
      Co-authored-by: default avatarIsrat Nisa <nisisrat@amazon.com>
      Co-authored-by: default avatarQuan (Andy) Gan <coin2028@hotmail.com>
      cb0e1103
  7. 17 Nov, 2021 1 commit
  8. 18 Oct, 2021 1 commit
  9. 17 Sep, 2021 1 commit
  10. 14 Sep, 2021 1 commit
  11. 13 Sep, 2021 2 commits
  12. 02 Sep, 2021 1 commit
  13. 01 Sep, 2021 1 commit
  14. 31 Aug, 2021 1 commit
  15. 28 Jul, 2021 1 commit
    • xiang song(charlie.song)'s avatar
      [New Feature] Per edge type sampler for to_homogeneous graphs. (#3131) · ba7e7cf9
      xiang song(charlie.song) authored
      
      
      * fix.
      
      * fix.
      
      * fix.
      
      * fix.
      
      * Fix test
      
      * Deprecate old DistEmbedding impl, use synchronized embedding impl
      
      * Basic imple of heterogeneous on homogenenous sampling
      
      * make pass
      
      * Pass C++ test
      
      * Add python test code
      
      * lint
      
      * lint
      
      * Add MultiLayerEtypeNeighborSampler
      
      * Add unitest for single machine dataloader
      
      * Add dist dataloader test for edge type sampler
      
      * Fix lint
      
      * fix
      
      * support for per etype sample
      
      * Fix some bug and enable distributed training with per edge sample
      
      * fix
      
      * Now distributed training works
      
      * turn off some mxnet
      
      * turn off mxnet for some dist test
      
      * fix
      
      * upd
      
      * upd according to the comments
      
      * Fix
      
      * Fix test and now distributed works.
      
      * upd
      
      * upd
      
      * Fix
      
      * Fix bug
      
      * remove dead code.
      
      * upd
      
      * Fix
      
      * upd
      
      * Fix
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-71-112.ec2.internal>
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-2-66.ec2.internal>
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      ba7e7cf9
  16. 21 Jul, 2021 1 commit
  17. 13 Jul, 2021 1 commit
    • sanchit-misra's avatar
      [CPU][Kernel] Single socket spmm (#3024) · fac75e16
      sanchit-misra authored
      
      
      * optimizations of spmm for CPU
      
      * Added names of contributors
      
      * Minor code cleanup
      
      * Moved the spmm optimization code to a new header file
      
      * Moved to DGL's logging method
      
      * removed duplicate code between SpMMSumCsr and SpMMCmpCsr
      
      * Changes made to follow Google coding style
      
      * Fixed lint errors in spmm.h
      
      * Fixed some lint errors from spmm_blocking_libxsmm.h
      
      * Fixed lint errors from spmm_blocking_libxsmm.h
      
      * Added comments to SpMMCreateLibxsmmKernel
      
      * to enable building of tests, and other cosmetic changes
      
      * disabling libxsmm on windows
      
      * Put a condition to avoid opt impl for FP64 as libxsmm does not have FP64 support yet
      
      * cosmetic changes and documentation
      
      * cosmetic changes
      
      * to pass lint tests
      
      * replaced multiple allocations for buffers of indices and edges with a single allocation
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      fac75e16
  18. 23 Jun, 2021 1 commit
    • Qidong Su's avatar
      [Feature] Biased Neighbor Sampling (#2987) · e56bbafd
      Qidong Su authored
      
      
      * update
      
      * update
      
      * update
      
      * update
      
      * lint
      
      * lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * lint
      
      * update
      
      * clone
      
      * update
      
      * update
      
      * update
      
      * update
      
      * replace idarray with ndarray
      
      * refactor cpp part
      
      * refactor python part
      
      * debug
      
      * refactor interface
      
      * test and doc
      
      * lint and test
      
      * lint
      
      * fix
      
      * fix
      
      * fix
      
      * const
      
      * doc
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix & doc
      
      * fix
      
      * fix
      
      * update
      
      * update
      
      * update
      
      * merge
      
      * doc
      
      * doc
      
      * lint
      
      * fix
      
      * more tests
      
      * doc
      
      * fix
      
      * fix
      
      * update
      
      * update
      
      * update
      
      * fix
      
      * fix
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      e56bbafd
  19. 03 Jun, 2021 1 commit
  20. 01 Jun, 2021 1 commit
    • Qidong Su's avatar
      [Feature][Sampler] Sort CSR by tag (#1664) · b8fe2b48
      Qidong Su authored
      
      
      * update
      
      * update
      
      * update
      
      * update
      
      * lint
      
      * lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * lint
      
      * update
      
      * clone
      
      * update
      
      * update
      
      * update
      
      * update
      
      * replace idarray with ndarray
      
      * refactor cpp part
      
      * refactor python part
      
      * debug
      
      * refactor interface
      
      * test and doc
      
      * lint and test
      
      * lint
      
      * fix
      
      * fix
      
      * fix
      
      * const
      
      * doc
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix & doc
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * update
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      b8fe2b48
  21. 20 May, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d
      nv-dlasalle authored
      
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)
      
      * Split NCCL wrapper from sparse optimizer and sparse embedding
      
      * Add more unit tests for single node nccl
      
      * Fix unit test for tf
      
      * Switch to device histogram
      
      * Fix histgram issues
      
      * Finish migration to histogram
      
      * Handle cases with zero send/recieve data
      
      * Start on partition object
      
      * Get compiling
      
      * Updates
      
      * Add unit tests
      
      * Switch to partition object
      
      * Fix linting issues
      
      * Rename partition file
      
      * Add python doc
      
      * Fix python assert and finish doxygen comments
      
      * Remove stubs for range based partition to satisfy pylint
      
      * Wrap unit test in GPU only
      
      * Wrap explicit cuda call in ifdef
      
      * Merge with partition.py
      
      * update docstrings
      
      * Cleanup partition_op
      
      * Add Workspace object
      
      * Switch to using workspace object
      
      * Move last remainder based function out of nccl_api
      
      * Add error messages
      
      * Update docs with examples
      
      * Fix linting erros
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      ae8dbe6d
  22. 17 May, 2021 1 commit
  23. 27 Apr, 2021 1 commit
  24. 16 Apr, 2021 1 commit
    • nv-dlasalle's avatar
      [Performance] Track sorted status of COO from creation (#2645) · bbebde46
      nv-dlasalle authored
      
      
      * Add row/col sorted flags
      
      * improve sorting paths
      
      * Remove print statement
      
      * Keep track of sorted matrices
      
      * Remove sort check in to_block
      
      * Improve CPU sorted COO->CSR
      
      * Handle the zero edge case
      
      * Remove omp default clause to work with MSVC
      
      * Update comments on sorted COO->CSR cpu implementatoin
      
      * Expose sorted to python interface
      
      * Make check_sorted default to false for dgl.graph()
      
      * remove check sorted; add utests
      
      * remove check_sorted flag
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      bbebde46
  25. 25 Mar, 2021 1 commit
  26. 24 Mar, 2021 1 commit
    • Quan (Andy) Gan's avatar
      [Feature] Sparse-sparse matrix multiplication, addition, and masking (#2753) · 929d8634
      Quan (Andy) Gan authored
      * test
      
      * more stuff
      
      * add test
      
      * fixes
      
      * optimize algo
      
      * replace unordered_map with arrays
      
      * lint
      
      * lint x2
      
      * oops
      
      * disable gpu csrmm tests
      
      * remove gpu invocation
      
      * optimize with openmp
      
      * remove python functions
      
      * add back with docstrings
      
      * lint
      
      * lint
      
      * update python interface
      
      * functionize
      
      * functionize
      
      * lint
      
      * lint
      929d8634
  27. 21 Feb, 2021 1 commit
  28. 29 Jan, 2021 1 commit
  29. 28 Jan, 2021 1 commit
  30. 27 Jan, 2021 1 commit
    • nv-dlasalle's avatar
      [Performance] Improve COO to CSR, and sort columns of CSR only when necessary. (#2391) · 2576647c
      nv-dlasalle authored
      * Remove double-checking sorted
      
      * Remove sorting of CSR by default
      
      * Update unit test to use unsorted matix
      
      * delete whitespace
      
      * Expand unit tests
      
      * Replace cusparse sort
      
      * Fix row column sorting
      
      * Explicitly don't sort columns
      
      * Fix linting errors
      
      * Fix bit-width calculation
      
      * Fix sorting assertion and unit test
      
      * Fix linting
      
      * Improve CPU COO2CSR
      
      * Remove references
      
      * Rename and add documentation to edge encoding/decoding funcionts
      
      * Fix sorting keys as 64 bit
      
      * Revert cosmetic changes to unit tests
      
      * Update documentation
      
      * Update complexity documentation for coo to csr conversion
      
      * Remove COOIsSorted check in CPU implementation too
      2576647c
  31. 25 Dec, 2020 1 commit
    • Quan (Andy) Gan's avatar
      [Performance] Use allocator from PyTorch if possible (#2328) · 9a7235fa
      Quan (Andy) Gan authored
      * first commit
      
      * some thoughts
      
      * move around
      
      * more commit
      
      * more fixes
      
      * now it uses torch allocator
      
      * fix symbol export error
      
      * fix
      
      * fixes
      
      * test fix
      
      * add script
      
      * building separate library per version
      
      * fix for vs2019
      
      * more fixes
      
      * fix on windows build
      
      * update jenkinsfile
      
      * auto copy built dlls for windows
      
      * lint and installation guide update
      
      * fix
      
      * specify conda environment
      
      * set environment for ci
      
      * fix
      
      * fix
      
      * fix
      
      * fix again
      
      * revert
      
      * fix cmake
      
      * fix
      
      * switch to using python interpreter path
      
      * remove scripts
      
      * debug
      
      * oops sorry
      
      * Update index.rst
      
      * Update index.rst
      
      * copies automatically, no need for this
      
      * do not print message if library not found
      
      * tiny fixes
      
      * debug on nightly
      
      * replace add_compile_definitions to make CMake 3.5 happy
      
      * fix linking to wrong lib for multiple pytorch envs
      
      * changed building strategy
      
      * fix nightly
      
      * fix windows
      
      * fix windows again
      
      * setup bugfix
      
      * address comments
      
      * change README
      9a7235fa
  32. 17 Dec, 2020 1 commit
  33. 10 Dec, 2020 1 commit
  34. 27 Nov, 2020 1 commit
  35. 26 Nov, 2020 1 commit
  36. 22 Nov, 2020 1 commit
  37. 17 Nov, 2020 1 commit
  38. 13 Nov, 2020 1 commit
  39. 15 Sep, 2020 1 commit