"...en/git@developer.sourcefind.cn:renzhc/diffusers_dcu.git" did not exist on "b3e5cd6b4d7fd5d03d75e78688dce52be02217b3"
  1. 25 Sep, 2024 1 commit
  2. 23 Sep, 2024 1 commit
  3. 13 Sep, 2024 1 commit
  4. 29 Feb, 2024 1 commit
  5. 23 Nov, 2023 1 commit
  6. 22 Nov, 2023 1 commit
  7. 14 Aug, 2023 1 commit
  8. 10 Aug, 2023 1 commit
  9. 19 Jul, 2023 1 commit
  10. 14 Jul, 2023 2 commits
  11. 13 Jul, 2023 1 commit
  12. 17 May, 2023 1 commit
  13. 23 Mar, 2023 1 commit
  14. 08 Mar, 2023 1 commit
  15. 12 Jan, 2023 1 commit
  16. 09 Dec, 2022 1 commit
  17. 06 Dec, 2022 1 commit
    • Chang Liu's avatar
      Add support for next cusparse release (#4974) · fb223d47
      Chang Liu authored
      * Add support for next cusparse release
      
      * Fix lint
      
      * Add switch and tune the performance
      
      * Fix lint issue
      
      * Fine tune the heuristics
      
      * Fix lint issue
      
      * Address comments
      
      * Minor fix
      
      * Address comments
      fb223d47
  18. 24 Nov, 2022 1 commit
  19. 22 Nov, 2022 2 commits
    • Ping Gong's avatar
      [Performance] Leverage hashmap to accelerate CSRSliceMatrix<kDGLCUDA, IdType> (#4924) · aa419895
      Ping Gong authored
      
      
      * Leverage hashmap to accelerate CSRSliceMatrix
      
      * fix lint check
      
      * use `min` in cuda_runtime.ch
      
      * fix hash func
      
      * add some comments and adjust the <grid,block> of the _SegmentMaskColKernel kernel
      
      * set device and stream for thrust::for_each
      
      * use thrust::cuda::par_nosync
      Co-authored-by: default avatarXin Yao <xiny@nvidia.com>
      aa419895
    • Muhammed Fatih BALIN's avatar
      [Feature] (La)yer-Neigh(bor) sampling implementation (#4668) · bf264d00
      Muhammed Fatih BALIN authored
      
      
      * adding LABOR sampling
      
      * add ladies and pladies samplers
      
      * fix compile error after rebase
      
      * add reference for ladies sampler
      
      * Improve ladies implementation.
      
      * weighted labor sampling initial implementation draft
      fix indentation and small bug in ladies script
      
      * importance_sampling currently doesn't work with weights
      
      * fix weighted importance sampling
      
      * move labor example into its own folder
      
      * lint fixes
      
      * Improve documentation
      
      * remove examples from the main PR
      
      * fix linting by not using c++17 features
      
      * fix documentation of labor_sampler.py
      
      * update documentation for labor.py
      
      * reformat the labor.py file with black
      
      * fix linting errors
      
      * replace exception use with if
      
      * fix typo in error comment
      
      * fixing win64 build for ci
      
      * fixing weighted implementation, works now.
      
      * fix bug in the weighted case and importance_sampling==0
      
      * address part of the reviews
      
      * remove unused code paths from cuda
      
      * remove unused code path from cpu side
      
      * remove extra features of labor making use of random seed.
      
      * fix exclude_edges bug
      
      * remove pcg and seed logic from cpu implementation, seed logic should still work for cuda.
      
      * minor style change
      
      * refactor CPU implementation, take out the importance_sampling probability computation into a function.
      
      * improve CUDAWorkspaceAllocator
      
      * refactor importance_sampling part out to a function
      
      * minor optimization
      
      * fix linting issue
      
      * Revert "remove pcg and seed logic from cpu implementation, seed logic should still work for cuda."
      
      This reverts commit c250e07ac6d7e13f57e79e8a2c2f098d777378c2.
      
      * Revert "remove extra features of labor making use of random seed."
      
      This reverts commit 7f99034353080308f4783f27d9a08bea343fb796.
      
      * fix the documentation
      
      * disable NIDs
      
      * improve the documentation in the code
      
      * use the stream argument in pcg32 instead of skipping ahead t times, can discard the use of hashmap now since it is faster this way.
      
      * fix linting issue
      
      * address another round of reviews
      
      * further optimize CPU LABOR sampling implementation
      
      * fix linting error
      
      * update the comment
      
      * reformat
      
      * rename and rephrase comment
      
      * fix formatting according to new linting specs
      
      * fix compile error due to renaming, fix linting.
      
      * lint
      
      * rename DGLHeteroGraph to DGLGraph to match master
      
      * replace other occurrences of DGLHeteroGraph to DGLGraph
      Co-authored-by: default avatarMuhammed Fatih BALIN <m.f.balin@gmail.com>
      Co-authored-by: default avatarKaan Sancak <kaansnck@gmail.com>
      Co-authored-by: default avatarQuan Gan <coin2028@hotmail.com>
      bf264d00
  20. 10 Nov, 2022 1 commit
  21. 08 Nov, 2022 1 commit
  22. 07 Nov, 2022 4 commits
  23. 06 Nov, 2022 2 commits
  24. 03 Nov, 2022 2 commits
  25. 28 Oct, 2022 1 commit
  26. 13 Oct, 2022 1 commit
  27. 11 Oct, 2022 1 commit
  28. 21 Sep, 2022 1 commit
  29. 19 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Bump DLPack to v0.7 and decouple DLPack from the core library (#4454) · cded5b80
      Xin Yao authored
      * rename `DLContext` to `DGLContext`
      
      * rename `kDLGPU` to `kDLCUDA`
      
      * replace DLTensor with DGLArray
      
      * fix linting
      
      * Unify DGLType and DLDataType to DGLDataType
      
      * Fix FFI
      
      * rename DLDeviceType to DGLDeviceType
      
      * decouple dlpack from the core library
      
      * fix bug
      
      * fix lint
      
      * fix merge
      
      * fix build
      
      * address comments
      
      * rename dl_converter to dlpack_convert
      
      * remove redundant comments
      cded5b80
  30. 15 Sep, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Import PyTorch's CUDA stream management (#4503) · 9a00cf19
      Xin Yao authored
      * add set_stream
      
      * add .record_stream for NDArray and HeteroGraph
      
      * refactor dgl stream Python APIs
      
      * test record_stream
      
      * add unit test for record stream
      
      * use pytorch's stream
      
      * fix lint
      
      * fix cpu build
      
      * address comments
      
      * address comments
      
      * add record stream tests for dgl.graph
      
      * record frames and update dataloder
      
      * add docstring
      
      * update frame
      
      * add backend check for record_stream
      
      * remove CUDAThreadEntry::stream
      
      * record stream for newly created formats
      
      * fix bug
      
      * fix cpp test
      
      * fix None c_void_p to c_handle
      9a00cf19
  31. 06 Sep, 2022 1 commit
    • Chang Liu's avatar
      [Feature] Unify the cuda stream used in core library (#4480) · 1c9d2a03
      Chang Liu authored
      
      
      * Use an internal cuda stream for CopyDataFromTo
      
      * small fix white space
      
      * Fix to compile
      
      * Make stream optional in copydata for compile
      
      * fix lint issue
      
      * Update cub functions to use internal stream
      
      * Lint check
      
      * Update CopyTo/CopyFrom/CopyFromTo to use internal stream
      
      * Address comments
      
      * Fix backward CUDA stream
      
      * Avoid overloading CopyFromTo()
      
      * Minor comment update
      
      * Overload copydatafromto in cuda device api
      Co-authored-by: default avatarxiny <xiny@nvidia.com>
      1c9d2a03
  32. 12 Aug, 2022 1 commit
  33. 09 Aug, 2022 1 commit