1. 08 Jun, 2022 1 commit
  2. 11 May, 2022 1 commit
    • Vikram Sharma's avatar
      Make USE_AVX flag default value OFF (#3983) · 1a6806e2
      Vikram Sharma authored
      
      
      With the emergence of new ISA (like ARM and RISCV) retaining USE_AVX ON default makes the default build instructions fail. Fundamentally DGL does not require the use of AVX for functional working. AVX is mainly needed when to enable optimization. So proposal is to default turn it off and then later during build instructions, folks with AVX capabilities can enable with 
      `cmake .. -DUSE_AVX=ON`
      Co-authored-by: default avatarZihao Ye <expye@outlook.com>
      1a6806e2
  3. 07 Feb, 2022 1 commit
  4. 11 Jan, 2022 1 commit
  5. 06 Dec, 2021 1 commit
    • Jinjing Zhou's avatar
      [RPC] Use tensorpipe for rpc communication (#3335) · a3ce780d
      Jinjing Zhou authored
      * doesn't know whether works
      
      * add change
      
      * fix
      
      * fix
      
      * fix
      
      * remove
      
      * revert
      
      * lint
      
      * lint
      
      * fix
      
      * revert
      
      * lint
      
      * fix
      
      * only build rpc on linux
      
      * lint
      
      * lint
      
      * fix build on windows
      
      * fix windows
      
      * remove old test
      
      * fix cmake
      
      * Revert "remove old test"
      
      This reverts commit f1ea75c777c34cdc1f08c0589676ba6aee1feb29.
      
      * fix windows
      
      * fix
      
      * fix
      
      * fix indent
      
      * fix indent
      
      * address comment
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * lint
      
      * fix indent
      
      * fix lint
      
      * add introduction
      
      * fix
      
      * lint
      
      * lint
      
      * add more logs
      
      * fix
      
      * update xbyak for C++14 with gcc5
      
      * Remove channels
      
      * fix
      
      * add test script
      
      * fix
      
      * remove unused file
      
      * fix lint
      
      * add timeout
      a3ce780d
  6. 03 Dec, 2021 1 commit
  7. 02 Dec, 2021 1 commit
  8. 29 Nov, 2021 1 commit
  9. 14 Oct, 2021 1 commit
    • zexi yuan's avatar
      [Bugfix] three bugs related to using DGL as a subdirectory(third_party) of another project. (#3379) · 18863069
      zexi yuan authored
      * [Bugfix] fix a compile error for Debug-BuildType on Windows Platform
      
      When using CMakeLists.txt to build the "Debug" BuildType on the Windows Platform, it has three compile errors (C4716) in the file "dgl\src\runtime\shared_mem.cc":
      
      'dgl::runtime::SharedMemory::CreateNew': must return a value
      'dgl::runtime::SharedMemory::Open': must return a value
      'dgl::runtime::SharedMemory::Exist': must return a value
      
      * [Bugfix] cmake error "cannot find load file" when DGL as a sub_directory on Linux
      
      When using DGL as a subdirectory in a CMake Project, the "CMAKE_SOURCE_DIR" here will return the parent cmake scope dir, which is not a expected dir.
      Maybe it is better to use "CMAKE_CURRENT_SOURCE_DIR" to set "GKLIB_PATH".
      
      * [Bugfix] cmd cmake error when DGL as a subdirectory
      
      When DGL as a subdirectory of another project, the WORKING_DIRECTORY of "add_custom_command" will be incorrect at the line 255 of "CMakeLists.txt", such that making a cmake "setlocal" error.
      18863069
  10. 28 Sep, 2021 1 commit
  11. 06 Sep, 2021 1 commit
  12. 13 Jul, 2021 2 commits
    • Quan (Andy) Gan's avatar
      Remove march=native flag (#3134) · 7c3e1f94
      Quan (Andy) Gan authored
      7c3e1f94
    • sanchit-misra's avatar
      [CPU][Kernel] Single socket spmm (#3024) · fac75e16
      sanchit-misra authored
      
      
      * optimizations of spmm for CPU
      
      * Added names of contributors
      
      * Minor code cleanup
      
      * Moved the spmm optimization code to a new header file
      
      * Moved to DGL's logging method
      
      * removed duplicate code between SpMMSumCsr and SpMMCmpCsr
      
      * Changes made to follow Google coding style
      
      * Fixed lint errors in spmm.h
      
      * Fixed some lint errors from spmm_blocking_libxsmm.h
      
      * Fixed lint errors from spmm_blocking_libxsmm.h
      
      * Added comments to SpMMCreateLibxsmmKernel
      
      * to enable building of tests, and other cosmetic changes
      
      * disabling libxsmm on windows
      
      * Put a condition to avoid opt impl for FP64 as libxsmm does not have FP64 support yet
      
      * cosmetic changes and documentation
      
      * cosmetic changes
      
      * to pass lint tests
      
      * replaced multiple allocations for buffers of indices and edges with a single allocation
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      fac75e16
  13. 27 Jun, 2021 1 commit
    • Jinjing Zhou's avatar
      [Build] Make nccl optional (#3056) · 9664cdff
      Jinjing Zhou authored
      * fix
      
      * remove nvidiasmi
      
      * fix
      
      * fix docs
      
      * fix
      
      * fix
      
      * 1
      
      * fix
      
      * remove
      
      * skip deprecated kernel
      
      * fix
      
      * Revert "skip deprecated kernel"
      
      This reverts commit c5ceb7f60dbbaf065b81cc3680757fd611d90ad3.
      
      * fix
      9664cdff
  14. 03 Jun, 2021 1 commit
  15. 25 May, 2021 1 commit
  16. 20 May, 2021 1 commit
    • nv-dlasalle's avatar
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings... · ae8dbe6d
      nv-dlasalle authored
      
      [Feature][Performance] Implement NCCL wrapper for communicating NodeEmbeddings and sparse gradients. (#2825)
      
      * Split NCCL wrapper from sparse optimizer and sparse embedding
      
      * Add more unit tests for single node nccl
      
      * Fix unit test for tf
      
      * Switch to device histogram
      
      * Fix histgram issues
      
      * Finish migration to histogram
      
      * Handle cases with zero send/recieve data
      
      * Start on partition object
      
      * Get compiling
      
      * Updates
      
      * Add unit tests
      
      * Switch to partition object
      
      * Fix linting issues
      
      * Rename partition file
      
      * Add python doc
      
      * Fix python assert and finish doxygen comments
      
      * Remove stubs for range based partition to satisfy pylint
      
      * Wrap unit test in GPU only
      
      * Wrap explicit cuda call in ifdef
      
      * Merge with partition.py
      
      * update docstrings
      
      * Cleanup partition_op
      
      * Add Workspace object
      
      * Switch to using workspace object
      
      * Move last remainder based function out of nccl_api
      
      * Add error messages
      
      * Update docs with examples
      
      * Fix linting erros
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      ae8dbe6d
  17. 09 Apr, 2021 1 commit
  18. 24 Mar, 2021 1 commit
    • Quan (Andy) Gan's avatar
      [Feature] Sparse-sparse matrix multiplication, addition, and masking (#2753) · 929d8634
      Quan (Andy) Gan authored
      * test
      
      * more stuff
      
      * add test
      
      * fixes
      
      * optimize algo
      
      * replace unordered_map with arrays
      
      * lint
      
      * lint x2
      
      * oops
      
      * disable gpu csrmm tests
      
      * remove gpu invocation
      
      * optimize with openmp
      
      * remove python functions
      
      * add back with docstrings
      
      * lint
      
      * lint
      
      * update python interface
      
      * functionize
      
      * functionize
      
      * lint
      
      * lint
      929d8634
  19. 28 Jan, 2021 1 commit
  20. 31 Dec, 2020 1 commit
  21. 25 Dec, 2020 1 commit
    • Quan (Andy) Gan's avatar
      [Performance] Use allocator from PyTorch if possible (#2328) · 9a7235fa
      Quan (Andy) Gan authored
      * first commit
      
      * some thoughts
      
      * move around
      
      * more commit
      
      * more fixes
      
      * now it uses torch allocator
      
      * fix symbol export error
      
      * fix
      
      * fixes
      
      * test fix
      
      * add script
      
      * building separate library per version
      
      * fix for vs2019
      
      * more fixes
      
      * fix on windows build
      
      * update jenkinsfile
      
      * auto copy built dlls for windows
      
      * lint and installation guide update
      
      * fix
      
      * specify conda environment
      
      * set environment for ci
      
      * fix
      
      * fix
      
      * fix
      
      * fix again
      
      * revert
      
      * fix cmake
      
      * fix
      
      * switch to using python interpreter path
      
      * remove scripts
      
      * debug
      
      * oops sorry
      
      * Update index.rst
      
      * Update index.rst
      
      * copies automatically, no need for this
      
      * do not print message if library not found
      
      * tiny fixes
      
      * debug on nightly
      
      * replace add_compile_definitions to make CMake 3.5 happy
      
      * fix linking to wrong lib for multiple pytorch envs
      
      * changed building strategy
      
      * fix nightly
      
      * fix windows
      
      * fix windows again
      
      * setup bugfix
      
      * address comments
      
      * change README
      9a7235fa
  22. 21 Dec, 2020 1 commit
  23. 17 Dec, 2020 1 commit
  24. 27 Nov, 2020 1 commit
  25. 17 Nov, 2020 1 commit
  26. 14 Nov, 2020 1 commit
  27. 13 Nov, 2020 1 commit
  28. 07 Nov, 2020 1 commit
  29. 30 Oct, 2020 1 commit
    • nv-dlasalle's avatar
      [Dataloading] Add class for copying tensors to/from the GPU on a non-default stream (#2284) · f673fc25
      nv-dlasalle authored
      * Add async transferer class
      
      * Add async ndarray copy interface
      
      * Add python bindings
      
      * Fix comment
      
      * Add python class
      
      * Fix linting issues
      
      * Add python unit test
      
      * Update python interface
      
      * move async_transferer to cuda only directory
      
      * Fix linting issue
      
      * Move out of contrib
      
      * Add doc strings
      
      * Move test compute from backend
      
      * Update comment
      
      * Fix test naming
      
      * Fix argument usage
      
      * Wrap/unwrap backend parameters
      
      * Move to dataloading
      
      * Move to 'dataloading'
      
      * Make GPU/CPU compatible
      
      * Fix unit tests
      
      * Add docs
      
      * Use only backend interface for datamovement in unit test
      f673fc25
  30. 26 Aug, 2020 1 commit
  31. 10 Aug, 2020 1 commit
    • Da Zheng's avatar
      Fix the performance issue of graph partitioning in new DGLGraph (#1934) · 729ff2ef
      Da Zheng authored
      
      
      * fix perf.
      
      * fix.
      
      * accelerate metis.
      
      * fix lint.
      
      * use gklib.
      
      * fix perf.
      
      * fix.
      
      * update metis.
      
      * update launch script
      
      * handle synchronized API.
      
      * fix.
      
      * fix example.
      
      * fix dataloader.
      
      * temp fix.
      
      * temp fix omp.
      
      * distinguish roles.
      
      * initialize iterator of DistDataloader correctly.
      
      * check the correctness of launch script.
      
      * move feature copy to sampler.
      
      * measure mem/network copy time.
      
      * remove
      
      * Revert "measure mem/network copy time."
      
      This reverts commit 86cefdc14b7815fcf5aad6496af912dba48e4aa6.
      
      * fix.
      
      * fix
      
      * fix.
      
      * fix cmake.
      
      * disable metis in windows.
      
      * disable metis tests in windows.
      
      * remove test for multigraph.
      
      * fix test.
      
      * fix.
      
      * fix cmake.
      
      * fix.
      
      * revert.
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-19-115.us-west-2.compute.internal>
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-19-1.us-west-2.compute.internal>
      729ff2ef
  32. 09 Jul, 2020 1 commit
  33. 30 Jun, 2020 1 commit
  34. 28 Jun, 2020 1 commit
    • Minjie Wang's avatar
      [CUDA][Kernel] More CUDA kernels; Standardize the behavior for sorted COO/CSR (#1704) · 870da747
      Minjie Wang authored
      * add cub; array cumsum
      
      * CSRSliceRows
      
      * fix warning
      
      * operator << for ndarray; CSRSliceRows
      
      * add CSRIsSorted
      
      * add csr_sort
      
      * inplace coosort and outplace csrsort
      
      * WIP: coo is sorted
      
      * mv cuda_utils
      
      * add AllTrue utility
      
      * csr sort
      
      * coo sort
      
      * coo2csr for sorted coo arrays
      
      * CSRToCOO from sorted
      
      * pass tests for the new kernel changes
      
      * cannot use inplace sort
      
      * lint
      
      * try fix msvc error
      
      * Fix g.copy_to and g.asnumbits; ToBlock no longer uses CSC
      
      * stash
      
      * revert some hack
      
      * revert some changes
      
      * address comments
      
      * fix
      
      * fix to_block unittest
      
      * add todo note
      870da747
  35. 21 Jun, 2020 1 commit
    • Tong He's avatar
      [Op] Farthest Point Sampler in Cpp and CUDA (#1630) · 3d47693b
      Tong He authored
      * working framework without actual algorithm logic
      
      * rename
      
      * fix
      
      * fps passes compilation
      
      * correct algorithm
      
      * add cuda implementation
      
      * update random start
      
      * before refactor
      
      * pass compilation but cuda not working
      
      * working
      
      * code working, will add docstring
      
      * add mxnet support
      
      * update docstring
      
      * update doc and test
      
      * cpplint
      
      * cpcplint
      
      * pylint
      
      * temporary fix
      
      * fix for win64
      
      * fix unitetest
      
      * fix
      
      * fix
      
      * remove comment
      
      * move to geometry package
      
      * remove redundant include
      
      * add docstrings and comments
      
      * add proof
      
      * add validity check
      3d47693b
  36. 22 May, 2020 1 commit
    • Chao Ma's avatar
      [RPC] New RPC infrastructure. (#1549) · 3e696922
      Chao Ma authored
      
      
      * WIP: rpc components
      
      * client & server
      
      * move network package to rpc
      
      * fix include
      
      * fix compile
      
      * c api
      
      * wip: test
      
      * add basic tests
      
      * missing file
      
      * [RPC] Zero copy serializer (#1517)
      
      * zerocopy serialization
      
      * add test for HeteroGraph
      
      * fix lint
      
      * remove unnecessary codes
      
      * add comment
      
      * lint
      
      * lint
      
      * disable pylint for now
      
      * add include for win
      
      * windows guard
      
      * lint
      
      * lint
      
      * skip test on windows
      
      * refactor
      
      * add comment
      
      * fix
      
      * comment
      
      * 1111
      
      * fix
      
      * Update Jenkinsfile
      
      * [RPC] Implementation of RPC infra (#1544)
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * remove client.cc and server.cc
      
      * fix lint
      
      * update
      
      * update
      
      * fix linr
      
      * update
      
      * fix lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update test
      
      * update
      
      * update test
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update comment
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix lint
      
      * fix lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      Co-authored-by: default avatarJinjing Zhou <VoVAllen@users.noreply.github.com>
      3e696922
  37. 13 Mar, 2020 1 commit
  38. 08 Mar, 2020 1 commit
    • Da Zheng's avatar
      [Feature] add metis partitioning to DGL (#1308) · 0e153c4b
      Da Zheng authored
      
      
      * add metis.
      
      * add test.
      
      * construct partition id.
      
      * link to METIS github repo.
      
      * update metis.
      
      * add a tool for partitioning a graph.
      
      * update metis.
      
      * update.
      
      * update.
      
      * fix metis.
      
      * fix lint
      
      * fix indent.
      
      * another way of building metis.
      
      * disable metis in windows.
      
      * test windows
      
      * fix.
      
      * disable metis for windows properly.
      
      * fix for tensorflow.
      
      * skip test for gpu.
      
      * make graph symmetric
      
      * address comments.
      
      * more comments.
      
      * fix compile
      
      * fix a bug.
      
      * add test.
      
      * change the default #hops of HALO nodes.
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-26-167.us-east-2.compute.internal>
      0e153c4b
  39. 31 Jan, 2020 1 commit
    • Quan (Andy) Gan's avatar
      [Feature] Random Walk for 0.5 (#1209) · a9dabcc7
      Quan (Andy) Gan authored
      * trying to refactor IndexSelect
      
      * partial implementation
      
      * add index select and assign for floats as well
      
      * move to random choice source
      
      * more updates
      
      * fixes
      
      * fixes
      
      * more fixes
      
      * adding python impl
      
      * fixes
      
      * unit test
      
      * lint
      
      * lint x2
      
      * lint x3
      
      * update metapath2vec
      
      * debugging performance
      
      * still debugging for performance
      
      * tuning
      
      * switching to succvec
      
      * redo
      
      * revert non-uniform sampler to use vector
      
      * still not fast
      
      * why does this crash with OpenMP???
      
      * because there was a data race!!!
      
      * add documentations and remove assign op
      
      * lint
      
      * lint x2
      
      * lol what have i done
      
      * lint x3
      
      * fix and disable gpu testing
      
      * bugfix
      
      * generic random walk
      
      * reorg the random walk source code
      
      * Update randomwalks.h
      
      * Update randomwalks_cpu.cc
      
      * rename file
      
      * move internal function to anonymous ns
      
      * reorg & docstrings
      
      * constant restart probability
      
      * docstring fix
      
      * more commit
      
      * random walk with restart, tested
      
      * some fixes
      
      * switch to using NDArray for choice
      
      * massive fix & docstring
      
      * lint x?
      
      * lint x??
      
      * fix
      
      * export symbols
      
      * skip gpu test
      
      * addresses comments
      
      * replaces another VecToIdArray
      
      * add randomwalks.h to include
      
      * replace void * with template
      a9dabcc7