1. 23 Sep, 2024 1 commit
  2. 11 Apr, 2023 1 commit
  3. 07 Nov, 2022 1 commit
  4. 06 Nov, 2022 1 commit
  5. 04 Nov, 2022 1 commit
  6. 15 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Import PyTorch's CUDA stream management (#4503) · 9a00cf19
      Xin Yao authored
      * add set_stream
      
      * add .record_stream for NDArray and HeteroGraph
      
      * refactor dgl stream Python APIs
      
      * test record_stream
      
      * add unit test for record stream
      
      * use pytorch's stream
      
      * fix lint
      
      * fix cpu build
      
      * address comments
      
      * address comments
      
      * add record stream tests for dgl.graph
      
      * record frames and update dataloder
      
      * add docstring
      
      * update frame
      
      * add backend check for record_stream
      
      * remove CUDAThreadEntry::stream
      
      * record stream for newly created formats
      
      * fix bug
      
      * fix cpp test
      
      * fix None c_void_p to c_handle
      9a00cf19
  7. 06 Sep, 2022 1 commit
    • Chang Liu's avatar
      [Feature] Unify the cuda stream used in core library (#4480) · 1c9d2a03
      Chang Liu authored
      
      
      * Use an internal cuda stream for CopyDataFromTo
      
      * small fix white space
      
      * Fix to compile
      
      * Make stream optional in copydata for compile
      
      * fix lint issue
      
      * Update cub functions to use internal stream
      
      * Lint check
      
      * Update CopyTo/CopyFrom/CopyFromTo to use internal stream
      
      * Address comments
      
      * Fix backward CUDA stream
      
      * Avoid overloading CopyFromTo()
      
      * Minor comment update
      
      * Overload copydatafromto in cuda device api
      Co-authored-by: default avatarxiny <xiny@nvidia.com>
      1c9d2a03
  8. 31 Aug, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Make TensorAdapter Stream Aware (#4472) · 2b766740
      Xin Yao authored
      * Allocate tensors in DGL's current stream
      
      * make tensoradaptor stream-aware
      
      * replace TAemtpy with cpu allocator
      
      * fix typo
      
      * try fix cpu allocation
      
      * clean header
      
      * redirect AllocDataSpace as well
      
      * resolve comments
      2b766740
  9. 07 Jul, 2022 1 commit
  10. 10 Mar, 2022 1 commit
  11. 25 Dec, 2020 1 commit
    • Quan (Andy) Gan's avatar
      [Performance] Use allocator from PyTorch if possible (#2328) · 9a7235fa
      Quan (Andy) Gan authored
      * first commit
      
      * some thoughts
      
      * move around
      
      * more commit
      
      * more fixes
      
      * now it uses torch allocator
      
      * fix symbol export error
      
      * fix
      
      * fixes
      
      * test fix
      
      * add script
      
      * building separate library per version
      
      * fix for vs2019
      
      * more fixes
      
      * fix on windows build
      
      * update jenkinsfile
      
      * auto copy built dlls for windows
      
      * lint and installation guide update
      
      * fix
      
      * specify conda environment
      
      * set environment for ci
      
      * fix
      
      * fix
      
      * fix
      
      * fix again
      
      * revert
      
      * fix cmake
      
      * fix
      
      * switch to using python interpreter path
      
      * remove scripts
      
      * debug
      
      * oops sorry
      
      * Update index.rst
      
      * Update index.rst
      
      * copies automatically, no need for this
      
      * do not print message if library not found
      
      * tiny fixes
      
      * debug on nightly
      
      * replace add_compile_definitions to make CMake 3.5 happy
      
      * fix linking to wrong lib for multiple pytorch envs
      
      * changed building strategy
      
      * fix nightly
      
      * fix windows
      
      * fix windows again
      
      * setup bugfix
      
      * address comments
      
      * change README
      9a7235fa