1. 15 Dec, 2023 1 commit
  2. 12 Dec, 2023 1 commit
  3. 04 Dec, 2023 1 commit
  4. 01 Dec, 2023 5 commits
  5. 30 Nov, 2023 2 commits
  6. 22 Aug, 2023 1 commit
  7. 31 Jul, 2023 1 commit
  8. 09 Jun, 2023 1 commit
  9. 29 Mar, 2023 1 commit
  10. 02 Mar, 2023 1 commit
  11. 29 Dec, 2022 1 commit
  12. 26 Dec, 2022 1 commit
  13. 10 Nov, 2022 1 commit
  14. 06 Nov, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Add bfloat16 (bf16) support (#4648) · 96297fb8
      Xin Yao authored
      * add bf16 specializations
      
      * remove SWITCH_BITS
      
      * enable amp for bf16
      
      * remove SWITCH_BITS for cpu kernels
      
      * enbale bf16 based on CUDART
      
      * fix compiling for sm<80
      
      * fix cpu build
      
      * enable unit tests
      
      * update doc
      
      * disable test for CUDA < 11.0
      
      * address comments
      
      * address comments
      96297fb8
  15. 27 Oct, 2022 1 commit
  16. 08 Oct, 2022 1 commit
  17. 25 Aug, 2022 1 commit
  18. 22 Aug, 2022 1 commit
  19. 18 Aug, 2022 1 commit
  20. 17 Aug, 2022 1 commit
  21. 29 Jul, 2022 1 commit
    • Xin Yao's avatar
      [Feature] Add CUDA Weighted Neighborhood Sampling (#4064) · 86c81b4e
      Xin Yao authored
      
      
      * add weighted sampling without replacement (A-Chao)
      
      * improve Algorithm A-Chao with block-wise prefix sum
      
      * correctly fill out_idxs
      
      * implement weighted sampling with replacement
      
      * small fix
      
      * merge host-side code of weighted/uniform sampling
      
      * enable unit tests for cuda weighted sampling
      
      * move thrust/cub wrapper to the cmake file
      
      * update docs accordingly
      
      * fix linting
      
      * fix linting
      
      * fix unit test
      
      * Bump external CUB/Thrust versions
      
      * Fix code style and update description of algorithm design
      
      * [Feature] GPU support weighted graph neighbor sampling
      commit by pengqirong(OPPO)
      
      * merge pengqirong's implementation
      
      * revert the change to cub and thrust
      
      * fix linting
      
      * use DeviceSegmentedSort for better performance
      
      * add more comments
      
      * add necessary notes
      
      * add necessary notes
      
      * resolve some comments
      
      * define THRUST_CUB_WRAPPED_NAMESPACE
      
      * fix doc
      Co-authored-by: default avatar彭齐荣 <657017034@qq.com>
      86c81b4e
  22. 26 Jul, 2022 1 commit
  23. 21 Jul, 2022 1 commit
  24. 02 Jun, 2022 2 commits
  25. 26 Mar, 2022 1 commit
  26. 25 Mar, 2022 2 commits
  27. 28 Feb, 2022 1 commit
  28. 27 Feb, 2022 1 commit
  29. 24 Feb, 2022 1 commit
  30. 23 Feb, 2022 1 commit
  31. 17 Feb, 2022 3 commits