1. 06 Sep, 2022 1 commit
    • Chang Liu's avatar
      [Feature] Unify the cuda stream used in core library (#4480) · 1c9d2a03
      Chang Liu authored
      
      
      * Use an internal cuda stream for CopyDataFromTo
      
      * small fix white space
      
      * Fix to compile
      
      * Make stream optional in copydata for compile
      
      * fix lint issue
      
      * Update cub functions to use internal stream
      
      * Lint check
      
      * Update CopyTo/CopyFrom/CopyFromTo to use internal stream
      
      * Address comments
      
      * Fix backward CUDA stream
      
      * Avoid overloading CopyFromTo()
      
      * Minor comment update
      
      * Overload copydatafromto in cuda device api
      Co-authored-by: default avatarxiny <xiny@nvidia.com>
      1c9d2a03
  2. 11 Jun, 2022 1 commit
  3. 22 Mar, 2021 1 commit
  4. 27 Jan, 2021 1 commit
    • nv-dlasalle's avatar
      [Performance] Improve COO to CSR, and sort columns of CSR only when necessary. (#2391) · 2576647c
      nv-dlasalle authored
      * Remove double-checking sorted
      
      * Remove sorting of CSR by default
      
      * Update unit test to use unsorted matix
      
      * delete whitespace
      
      * Expand unit tests
      
      * Replace cusparse sort
      
      * Fix row column sorting
      
      * Explicitly don't sort columns
      
      * Fix linting errors
      
      * Fix bit-width calculation
      
      * Fix sorting assertion and unit test
      
      * Fix linting
      
      * Improve CPU COO2CSR
      
      * Remove references
      
      * Rename and add documentation to edge encoding/decoding funcionts
      
      * Fix sorting keys as 64 bit
      
      * Revert cosmetic changes to unit tests
      
      * Update documentation
      
      * Update complexity documentation for coo to csr conversion
      
      * Remove COOIsSorted check in CPU implementation too
      2576647c
  5. 30 Jul, 2020 1 commit