1. 26 Dec, 2023 1 commit
  2. 16 Nov, 2023 1 commit
  3. 15 Nov, 2023 1 commit
  4. 12 Sep, 2023 1 commit
  5. 01 Sep, 2023 1 commit
  6. 15 Aug, 2023 1 commit
  7. 14 Aug, 2023 1 commit
  8. 07 Aug, 2023 2 commits
  9. 03 Aug, 2023 1 commit
  10. 02 Aug, 2023 2 commits
  11. 01 Aug, 2023 1 commit
  12. 24 Jul, 2023 1 commit
  13. 02 Jun, 2023 1 commit
  14. 17 Apr, 2023 1 commit
  15. 22 Mar, 2023 1 commit
  16. 08 Mar, 2023 1 commit
    • Xin Yao's avatar
      [Refactor] Replace third_party/nccl with PyTorch's NCCL backend (#4989) · 8d5d8962
      Xin Yao authored
      * expose GeneratePermutation
      
      * add sparse_all_to_all_push
      
      * add sparse_all_to_all_pull
      
      * add unit test
      
      * handle world_size=1
      
      * remove python nccl wrapper
      
      * remove the nccl dependency
      
      * use pinned memory to speedup D2H copy
      
      * fix lint
      
      * resolve comments
      
      * fix lint
      
      * fix ut
      
      * resolve comments
      8d5d8962
  17. 05 Jan, 2023 1 commit
  18. 19 Nov, 2022 1 commit
  19. 17 Nov, 2022 1 commit
  20. 15 Dec, 2021 1 commit
    • lixiaobai's avatar
      [PinSAGESampler] support PinSAGE sampler on GPU (#3567) · dd762a1e
      lixiaobai authored
      
      
      * Feat: support API "randomwalk_topk" in library
      
      * Feat: use the new API "randomwalk_topk" for PinSAGESampler
      
      * Minor
      
      * Minor
      
      * Refactor: modified codes as checker required
      
      * Minor
      
      * Minor
      
      * Minor
      
      * Minor
      
      * Fix: checking errors in RandomWalkTopk
      
      * Refactor: modified the docstring for randomwalk_topk
      
      * change randomwalk_topk to internal
      
      * fix
      
      * rename
      
      * Minor for pinsage.py
      
      * Feat: support randomwalk and SelectPinSageNeighbors on GPU
      
      Port RandomWalk algorithm on GPU,
      and port SelectPinSageNeighbors on GPU.
      
      * Feat: support GPU on python APIs
      
      * Feat: remove perf print information in FrequenchHashmap
      
      * Fix: modified the code format
      
      Modified the code format as task_lint.sh suggested
      
      * Feat: let test script support PinSAGESampler on GPU
      
      Let test script support PinSAGESampler on GPU,
      minor of "restart_prob".
      
      * Minor
      
      * Minor
      
      * Minor
      
      * Refactor: use the atomic operations from the array module
      
      * Minor: change the long lines
      
      * Refactor: modified the get_node_types for gpu
      
      * Feat: update the contributor date
      
      * Perf: remove unnecessary stream sync
      
      * Feat: support other random walk
      
      But the non-uniform choice is still not supported.
      
      * Fix: add CUDA switch for random walk
      Co-authored-by: default avatarQuan Gan <coin2028@hotmail.com>
      dd762a1e
  21. 08 Nov, 2021 1 commit
  22. 16 Jul, 2021 1 commit
  23. 16 Jun, 2021 1 commit
  24. 25 May, 2021 1 commit
  25. 20 May, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d
      nv-dlasalle authored
      
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)
      
      * Split NCCL wrapper from sparse optimizer and sparse embedding
      
      * Add more unit tests for single node nccl
      
      * Fix unit test for tf
      
      * Switch to device histogram
      
      * Fix histgram issues
      
      * Finish migration to histogram
      
      * Handle cases with zero send/recieve data
      
      * Start on partition object
      
      * Get compiling
      
      * Updates
      
      * Add unit tests
      
      * Switch to partition object
      
      * Fix linting issues
      
      * Rename partition file
      
      * Add python doc
      
      * Fix python assert and finish doxygen comments
      
      * Remove stubs for range based partition to satisfy pylint
      
      * Wrap unit test in GPU only
      
      * Wrap explicit cuda call in ifdef
      
      * Merge with partition.py
      
      * update docstrings
      
      * Cleanup partition_op
      
      * Add Workspace object
      
      * Switch to using workspace object
      
      * Move last remainder based function out of nccl_api
      
      * Add error messages
      
      * Update docs with examples
      
      * Fix linting erros
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      ae8dbe6d
  26. 09 Mar, 2021 1 commit
  27. 08 Feb, 2021 1 commit
    • nv-dlasalle's avatar
      [Sampling] Implement `dgl.to_block()` for the GPU (#2339) · bc3a532f
      nv-dlasalle authored
      
      
      * Add start of to_block gpu implementation
      
      * Pull in more changes from 0.4.2 cuda_to_block
      
      * Move more code to IdArray
      
      * Refactor DeviceNodeMapMaker
      
      * Updates
      
      * get compiling
      
      * Integrate to_block
      
      * Fix ID allocation
      
      * Minor fixes
      
      * Cleanup cuda calls to use cuda_common
      
      * Reduce kernel calls
      
      * Lint cleanup
      
      * Expand documentation
      
      * Remove unused function
      
      * Rename variables for consistency
      
      * Add doxygen comments
      
      * Fix file extension
      
      * Remove raw asynccopy for deviceapi
      
      * Remove unused function
      
      * Fix block/tile configuration
      
      * Add cuda_device_common.cuh
      
      * Add basic hashtable
      
      * Migrate part of hashtable
      
      * Refactor to use external hashtable
      
      * Make functions members
      
      * Format hash table functions
      
      * Migrate duplicate filling
      
      * Move last function over
      
      * Refactor with cu file
      
      * lint c++ code
      
      * Move context check to C++ code
      
      * Use macro switch
      
      * Add missing files
      
      * Update docstring
      
      * update docs
      
      * Move atomic functions
      
      * Refactor hashtable
      
      * Fix linting
      
      * Expand docs
      
      * Fix mismatched argument names
      
      * Switch doxygen comments from using @param to \param
      Co-authored-by: default avatarJinjing Zhou <VoVAllen@users.noreply.github.com>
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      bc3a532f
  28. 28 Jan, 2021 1 commit
  29. 31 Dec, 2020 1 commit
  30. 25 Dec, 2020 1 commit
    • Quan (Andy) Gan's avatar
      [Performance] Use allocator from PyTorch if possible (#2328) · 9a7235fa
      Quan (Andy) Gan authored
      * first commit
      
      * some thoughts
      
      * move around
      
      * more commit
      
      * more fixes
      
      * now it uses torch allocator
      
      * fix symbol export error
      
      * fix
      
      * fixes
      
      * test fix
      
      * add script
      
      * building separate library per version
      
      * fix for vs2019
      
      * more fixes
      
      * fix on windows build
      
      * update jenkinsfile
      
      * auto copy built dlls for windows
      
      * lint and installation guide update
      
      * fix
      
      * specify conda environment
      
      * set environment for ci
      
      * fix
      
      * fix
      
      * fix
      
      * fix again
      
      * revert
      
      * fix cmake
      
      * fix
      
      * switch to using python interpreter path
      
      * remove scripts
      
      * debug
      
      * oops sorry
      
      * Update index.rst
      
      * Update index.rst
      
      * copies automatically, no need for this
      
      * do not print message if library not found
      
      * tiny fixes
      
      * debug on nightly
      
      * replace add_compile_definitions to make CMake 3.5 happy
      
      * fix linking to wrong lib for multiple pytorch envs
      
      * changed building strategy
      
      * fix nightly
      
      * fix windows
      
      * fix windows again
      
      * setup bugfix
      
      * address comments
      
      * change README
      9a7235fa
  31. 21 Dec, 2020 1 commit
  32. 17 Dec, 2020 1 commit
  33. 14 Nov, 2020 1 commit
  34. 13 Nov, 2020 1 commit
  35. 07 Nov, 2020 1 commit
  36. 27 Aug, 2020 1 commit
  37. 21 Jun, 2020 1 commit
    • Tong He's avatar
      [Op] Farthest Point Sampler in Cpp and CUDA (#1630) · 3d47693b
      Tong He authored
      * working framework without actual algorithm logic
      
      * rename
      
      * fix
      
      * fps passes compilation
      
      * correct algorithm
      
      * add cuda implementation
      
      * update random start
      
      * before refactor
      
      * pass compilation but cuda not working
      
      * working
      
      * code working, will add docstring
      
      * add mxnet support
      
      * update docstring
      
      * update doc and test
      
      * cpplint
      
      * cpcplint
      
      * pylint
      
      * temporary fix
      
      * fix for win64
      
      * fix unitetest
      
      * fix
      
      * fix
      
      * remove comment
      
      * move to geometry package
      
      * remove redundant include
      
      * add docstrings and comments
      
      * add proof
      
      * add validity check
      3d47693b
  38. 17 Jul, 2019 1 commit
    • Minjie Wang's avatar
      [Refactor] Separating graph and sparse matrix operations (#699) · b0d9e7aa
      Minjie Wang authored
      * WIP: array refactoring
      
      * WIP: implementation
      
      * wip
      
      * most csr part
      
      * WIP: on coo
      
      * WIP: coo
      
      * finish refactoring immutable graph
      
      * compiled
      
      * fix undefined ndarray copy bug; add COOToCSR when coo has no data array
      
      * fix bug in COOToCSR
      
      * fix bug in CSR constructor
      
      * fix bug in in_edges(vid)
      
      * fix OutEdges bug
      
      * pass test_graph
      
      * pass test_graph
      
      * fix bug in CSR constructor
      
      * fix bug in CSR constructor
      
      * fix bug in CSR constructor
      
      * fix stupid bug
      
      * pass gpu test
      
      * remove debug printout
      
      * fix lint
      
      * rm biparate grpah
      
      * fix lint
      
      * address comments
      
      * fix bug in Clone
      
      * cpp utests
      b0d9e7aa