1. 10 Aug, 2020 1 commit
    • Da Zheng's avatar
      Fix the performance issue of graph partitioning in new DGLGraph (#1934) · 729ff2ef
      Da Zheng authored
      
      
      * fix perf.
      
      * fix.
      
      * accelerate metis.
      
      * fix lint.
      
      * use gklib.
      
      * fix perf.
      
      * fix.
      
      * update metis.
      
      * update launch script
      
      * handle synchronized API.
      
      * fix.
      
      * fix example.
      
      * fix dataloader.
      
      * temp fix.
      
      * temp fix omp.
      
      * distinguish roles.
      
      * initialize iterator of DistDataloader correctly.
      
      * check the correctness of launch script.
      
      * move feature copy to sampler.
      
      * measure mem/network copy time.
      
      * remove
      
      * Revert "measure mem/network copy time."
      
      This reverts commit 86cefdc14b7815fcf5aad6496af912dba48e4aa6.
      
      * fix.
      
      * fix
      
      * fix.
      
      * fix cmake.
      
      * disable metis in windows.
      
      * disable metis tests in windows.
      
      * remove test for multigraph.
      
      * fix test.
      
      * fix.
      
      * fix cmake.
      
      * fix.
      
      * revert.
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-19-115.us-west-2.compute.internal>
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-19-1.us-west-2.compute.internal>
      729ff2ef
  2. 09 Jul, 2020 1 commit
  3. 30 Jun, 2020 1 commit
  4. 28 Jun, 2020 1 commit
    • Minjie Wang's avatar
      [CUDA][Kernel] More CUDA kernels; Standardize the behavior for sorted COO/CSR (#1704) · 870da747
      Minjie Wang authored
      * add cub; array cumsum
      
      * CSRSliceRows
      
      * fix warning
      
      * operator << for ndarray; CSRSliceRows
      
      * add CSRIsSorted
      
      * add csr_sort
      
      * inplace coosort and outplace csrsort
      
      * WIP: coo is sorted
      
      * mv cuda_utils
      
      * add AllTrue utility
      
      * csr sort
      
      * coo sort
      
      * coo2csr for sorted coo arrays
      
      * CSRToCOO from sorted
      
      * pass tests for the new kernel changes
      
      * cannot use inplace sort
      
      * lint
      
      * try fix msvc error
      
      * Fix g.copy_to and g.asnumbits; ToBlock no longer uses CSC
      
      * stash
      
      * revert some hack
      
      * revert some changes
      
      * address comments
      
      * fix
      
      * fix to_block unittest
      
      * add todo note
      870da747
  5. 21 Jun, 2020 1 commit
    • Tong He's avatar
      [Op] Farthest Point Sampler in Cpp and CUDA (#1630) · 3d47693b
      Tong He authored
      * working framework without actual algorithm logic
      
      * rename
      
      * fix
      
      * fps passes compilation
      
      * correct algorithm
      
      * add cuda implementation
      
      * update random start
      
      * before refactor
      
      * pass compilation but cuda not working
      
      * working
      
      * code working, will add docstring
      
      * add mxnet support
      
      * update docstring
      
      * update doc and test
      
      * cpplint
      
      * cpcplint
      
      * pylint
      
      * temporary fix
      
      * fix for win64
      
      * fix unitetest
      
      * fix
      
      * fix
      
      * remove comment
      
      * move to geometry package
      
      * remove redundant include
      
      * add docstrings and comments
      
      * add proof
      
      * add validity check
      3d47693b
  6. 22 May, 2020 1 commit
    • Chao Ma's avatar
      [RPC] New RPC infrastructure. (#1549) · 3e696922
      Chao Ma authored
      
      
      * WIP: rpc components
      
      * client & server
      
      * move network package to rpc
      
      * fix include
      
      * fix compile
      
      * c api
      
      * wip: test
      
      * add basic tests
      
      * missing file
      
      * [RPC] Zero copy serializer (#1517)
      
      * zerocopy serialization
      
      * add test for HeteroGraph
      
      * fix lint
      
      * remove unnecessary codes
      
      * add comment
      
      * lint
      
      * lint
      
      * disable pylint for now
      
      * add include for win
      
      * windows guard
      
      * lint
      
      * lint
      
      * skip test on windows
      
      * refactor
      
      * add comment
      
      * fix
      
      * comment
      
      * 1111
      
      * fix
      
      * Update Jenkinsfile
      
      * [RPC] Implementation of RPC infra (#1544)
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * remove client.cc and server.cc
      
      * fix lint
      
      * update
      
      * update
      
      * fix linr
      
      * update
      
      * fix lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update test
      
      * update
      
      * update test
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update comment
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix lint
      
      * fix lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      Co-authored-by: default avatarMinjie Wang <wmjlyjemaine@gmail.com>
      Co-authored-by: default avatarJinjing Zhou <VoVAllen@users.noreply.github.com>
      3e696922
  7. 13 Mar, 2020 1 commit
  8. 08 Mar, 2020 1 commit
    • Da Zheng's avatar
      [Feature] add metis partitioning to DGL (#1308) · 0e153c4b
      Da Zheng authored
      
      
      * add metis.
      
      * add test.
      
      * construct partition id.
      
      * link to METIS github repo.
      
      * update metis.
      
      * add a tool for partitioning a graph.
      
      * update metis.
      
      * update.
      
      * update.
      
      * fix metis.
      
      * fix lint
      
      * fix indent.
      
      * another way of building metis.
      
      * disable metis in windows.
      
      * test windows
      
      * fix.
      
      * disable metis for windows properly.
      
      * fix for tensorflow.
      
      * skip test for gpu.
      
      * make graph symmetric
      
      * address comments.
      
      * more comments.
      
      * fix compile
      
      * fix a bug.
      
      * add test.
      
      * change the default #hops of HALO nodes.
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-26-167.us-east-2.compute.internal>
      0e153c4b
  9. 31 Jan, 2020 1 commit
    • Quan (Andy) Gan's avatar
      [Feature] Random Walk for 0.5 (#1209) · a9dabcc7
      Quan (Andy) Gan authored
      * trying to refactor IndexSelect
      
      * partial implementation
      
      * add index select and assign for floats as well
      
      * move to random choice source
      
      * more updates
      
      * fixes
      
      * fixes
      
      * more fixes
      
      * adding python impl
      
      * fixes
      
      * unit test
      
      * lint
      
      * lint x2
      
      * lint x3
      
      * update metapath2vec
      
      * debugging performance
      
      * still debugging for performance
      
      * tuning
      
      * switching to succvec
      
      * redo
      
      * revert non-uniform sampler to use vector
      
      * still not fast
      
      * why does this crash with OpenMP???
      
      * because there was a data race!!!
      
      * add documentations and remove assign op
      
      * lint
      
      * lint x2
      
      * lol what have i done
      
      * lint x3
      
      * fix and disable gpu testing
      
      * bugfix
      
      * generic random walk
      
      * reorg the random walk source code
      
      * Update randomwalks.h
      
      * Update randomwalks_cpu.cc
      
      * rename file
      
      * move internal function to anonymous ns
      
      * reorg & docstrings
      
      * constant restart probability
      
      * docstring fix
      
      * more commit
      
      * random walk with restart, tested
      
      * some fixes
      
      * switch to using NDArray for choice
      
      * massive fix & docstring
      
      * lint x?
      
      * lint x??
      
      * fix
      
      * export symbols
      
      * skip gpu test
      
      * addresses comments
      
      * replaces another VecToIdArray
      
      * add randomwalks.h to include
      
      * replace void * with template
      a9dabcc7
  10. 20 Dec, 2019 1 commit
    • VoVAllen's avatar
      [Backend] TF backend (#978) · d30a69bf
      VoVAllen authored
      * tf
      
      * add builtin support
      
      * fiix
      
      * pytest
      
      * fix
      
      * fix
      
      * fix some bugs
      
      * fix selecting
      
      * fix todo
      
      * fix test
      
      * fix test fail in tf
      
      * fix
      
      * fix
      
      * fix gather row
      
      * fix gather row
      
      * log backend
      
      * fix gather row
      
      * fix gather row
      
      * fix for pytorch
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix tests
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix convert
      
      * fix
      
      * fix
      
      * fix
      
      * fix inplace
      
      * add alignment setting
      
      * add debug option
      
      * Revert "add alignment setting"
      
      This reverts commit ec63fb3506ea84fff7d447a1fbdfd1d5d1fb6110.
      
      * tf ci
      
      * fix lint
      
      * fix lint
      
      * add tfdlpack
      
      * fix type
      
      * add env
      
      * fix backend
      
      * fix
      
      * fix tests
      
      * remove one_hot
      
      * remove comment
      
      * remove comment
      
      * fix
      
      * use pip to install all
      
      * fix test
      
      * fix base
      
      * fix
      
      * fix
      
      * add skip
      
      * upgrade cmake
      
      * change version
      
      * change ci
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix seg fault
      
      * fix
      
      * fix python version
      
      * fix
      
      * try fix
      
      * fix
      
      * fix
      
      * tf takes longer time in ci
      
      * change py version
      
      * fix
      
      * fix
      
      * fix oom
      
      * change kg env
      
      * change kg env
      
      * 啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊
      
      * 我再也不搞各种乱七八糟环境了……
      
      * use pytest
      
      * Chang image
      d30a69bf
  11. 09 Sep, 2019 1 commit
    • VoVAllen's avatar
      [Feature] Data format (#728) · 0fb13f7b
      VoVAllen authored
      * Add serialization
      
      * add serialization
      
      * add serialization
      
      * lalalalalalalala
      
      * lalalalalalalala
      
      * serialize
      
      * serialize
      
      * nnn
      
      * WIP: import tvm runtime node system
      
      * WIP: object system
      
      * containers
      
      * tested basic container composition
      
      * tested custom object
      
      * tmp
      
      * fix setattr bug
      
      * tested object container return
      
      * fix lint
      
      * some comments about get/set state
      
      * fix lint
      
      * fix lint
      
      * update cython
      
      * fix cython
      
      * ffi doc
      
      * fix doc
      
      * WIP: using object system for graph
      
      * c++ side refactoring done; compiled
      
      * remove stale apis
      
      * fix bug in DGLGraphCreate; passed test_graph.py
      
      * fix bug in python modify; passed utest for pytorch/cpu
      
      * fix lint
      
      * Add serialization
      
      * Add serialization
      
      * fix
      
      * fix typo
      
      * serialize with new ffi
      
      * commit
      
      * commit
      
      * commit
      
      * save
      
      * save
      
      * save
      
      * save
      
      * commit
      
      * clean
      
      * Delete tt2.py
      
      * fix lint
      
      * Add serialization
      
      * fix lint 2
      
      * fix lint
      
      * fix lint
      
      * fix lint
      
      * fix lint
      
      * Fix Lint
      
      * Add serialization
      
      * Change to Macro
      
      * fix
      
      * fix
      
      * fix bugs
      
      * refactor
      
      * refactor
      
      * updating dmlc-core to include force flag
      
      * trying tempfile
      
      * delete leaked pointer
      
      * Fix assert
      
      * fix assert
      
      * add comment and test case
      
      * add graph labels
      
      * add load labels
      
      * lint
      
      * lint
      
      * add graph labels
      
      * lint
      
      *  fix windows
      
      * fix
      
      * update dmlc-core to latest
      
      * fix
      
      * fix camel naming
      0fb13f7b
  12. 17 Jul, 2019 1 commit
    • Minjie Wang's avatar
      [Refactor] Separating graph and sparse matrix operations (#699) · b0d9e7aa
      Minjie Wang authored
      * WIP: array refactoring
      
      * WIP: implementation
      
      * wip
      
      * most csr part
      
      * WIP: on coo
      
      * WIP: coo
      
      * finish refactoring immutable graph
      
      * compiled
      
      * fix undefined ndarray copy bug; add COOToCSR when coo has no data array
      
      * fix bug in COOToCSR
      
      * fix bug in CSR constructor
      
      * fix bug in in_edges(vid)
      
      * fix OutEdges bug
      
      * pass test_graph
      
      * pass test_graph
      
      * fix bug in CSR constructor
      
      * fix bug in CSR constructor
      
      * fix bug in CSR constructor
      
      * fix stupid bug
      
      * pass gpu test
      
      * remove debug printout
      
      * fix lint
      
      * rm biparate grpah
      
      * fix lint
      
      * address comments
      
      * fix bug in Clone
      
      * cpp utests
      b0d9e7aa
  13. 01 Jul, 2019 1 commit
    • Minjie Wang's avatar
      [FFI] FFI container support, custom structure extension via Object (#693) · 90e78c58
      Minjie Wang authored
      * WIP: import tvm runtime node system
      
      * WIP: object system
      
      * containers
      
      * tested basic container composition
      
      * tested custom object
      
      * fix setattr bug
      
      * tested object container return
      
      * fix lint
      
      * some comments about get/set state
      
      * fix lint
      
      * fix lint
      
      * update cython
      
      * fix cython
      
      * ffi doc
      
      * fix doc
      90e78c58
  14. 09 Jun, 2019 1 commit
    • Minjie Wang's avatar
      [Examples] Run all the examples (#623) · dec8b49b
      Minjie Wang authored
      * all pytorch examples
      
      * scan through mxnet examples
      
      * change reddit data
      
      * tweak numerical range for unittest
      
      * fix ci
      
      * fix ci
      
      * fix
      
      * add seed to workaround
      dec8b49b
  15. 07 Jun, 2019 1 commit
  16. 06 Jun, 2019 1 commit
    • Lingfan Yu's avatar
      [Feature][Kernel] DGL kernel support (#596) · 653428bd
      Lingfan Yu authored
      * [Kernel] Minigun integration and fused kernel support (#519)
      
      * kernel interface
      
      * add minigun
      
      * Add cuda build
      
      * functors
      
      * working on binary elewise
      
      * binary reduce
      
      * change kernel interface
      
      * WIP
      
      * wip
      
      * fix minigun
      
      * compile
      
      * binary reduce kernels
      
      * compile
      
      * simple test passed
      
      * more reducers
      
      * fix thrust problem
      
      * fix cmake
      
      * fix cmake; add proper guard for atomic
      
      * WIP: bcast
      
      * WIP
      
      * bcast kernels
      
      * update to new minigun pass-by-value practice
      
      * broadcasting dim
      
      * add copy src and copy edge
      
      * fix linking
      
      * fix none array problem
      
      * fix copy edge
      
      * add device_type and device_id to backend operator
      
      * cache csr adj, remove cache for adjmat and incmat
      
      * custom ops in backend and pytorch impl
      
      * change dgl-mg kernel python interface
      
      * add id_mapping var
      
      * clean up plus v2e spmv schedule
      
      * spmv schedule & clean up fall back
      
      * symbolic message and reduce func, remove bundle func
      
      * new executors
      
      * new backend interface for dgl kernels and pytorch impl
      
      * minor fix
      
      * fix
      
      * fix docstring, comments, func names
      
      * nodeflow
      
      * fix message id mapping and bugs...
      
      * pytorch test case & fix
      
      * backward binary reduce
      
      * fix bug
      
      * WIP: cusparse
      
      * change to int32 csr for cusparse workaround
      
      * disable cusparse
      
      * change back to int64
      
      * broadcasting backward
      
      * cusparse; WIP: add rev_csr
      
      * unit test for kernels
      
      * pytorch backward with dgl kernel
      
      * edge softmax
      
      * fix backward
      
      * improve softmax
      
      * cache edge on device
      
      * cache mappings on device
      
      * fix partial forward code
      
      * cusparse done
      
      * copy_src_sum with cusparse
      
      * rm id getter
      
      * reduce grad for broadcast
      
      * copy edge reduce backward
      
      * kernel unit test for broadcasting
      
      * full kernel unit test
      
      * add cpu kernels
      
      * edge softmax unit test
      
      * missing ref
      
      * fix compile and small bugs
      
      * fix bug in bcast
      
      * Add backward both
      
      * fix torch utests
      
      * expose infershape
      
      * create out tensor in python
      
      * fix c++ lint
      
      * [Kernel] Add GPU utest and kernel utest (#524)
      
      * fix gpu utest
      
      * cuda utest runnable
      
      * temp disable test nodeflow; unified test for kernel
      
      * cuda test kernel done
      
      * [Kernel] Update kernel branch (#550)
      
      * [Model] add multiprocessing training with sampling. (#484)
      
      * reorganize sampling code.
      
      * add multi-process training.
      
      * speed up gcn_cv
      
      * fix graphsage_cv.
      
      * add new API in graph store.
      
      * update barrier impl.
      
      * support both local and distributed training.
      
      * fix multiprocess train.
      
      * fix.
      
      * fix barrier.
      
      * add script for loading data.
      
      * multiprocessing sampling.
      
      * accel training.
      
      * replace pull with spmv for speedup.
      
      * nodeflow copy from parent with context.
      
      * enable GPU.
      
      * fix a bug in graph store.
      
      * enable multi-GPU training.
      
      * fix lint.
      
      * add comments.
      
      * rename to run_store_server.py
      
      * fix gcn_cv.
      
      * fix a minor bug in sampler.
      
      * handle error better in graph store.
      
      * improve graphsage_cv for distributed mode.
      
      * update README.
      
      * fix.
      
      * update.
      
      * [Tutorial] add sampling tutorial. (#522)
      
      * add sampling tutorial.
      
      * add readme
      
      * update author list.
      
      * fix indent in the code.
      
      * rename the file.
      
      * update tutorial.
      
      * fix the last API.
      
      * update image.
      
      * [BUGFIX] fix the problems in the sampling tutorial. (#523)
      
      * add index.
      
      * update.
      
      * update tutorial.
      
      * fix gpu utest
      
      * cuda utest runnable
      
      * temp disable test nodeflow; unified test for kernel
      
      * cuda test kernel done
      
      * Fixing typo in JTNN after interface change (#536)
      
      * [BugFix] Fix getting src and dst id of ALL edges in NodeFlow.apply_block (#515)
      
      * [Bug Fix] Fix inplace op at backend (#546)
      
      * Fix inplace operation
      
      * fix line seprator
      
      * [Feature] Add batch and unbatch for immutable graph (#539)
      
      * Add batch and unbatch for immutable graph
      
      * fix line seprator
      
      * fix lintr
      
      * remove unnecessary include
      
      * fix code review
      
      * [BUGFix] Improve multi-processing training (#526)
      
      * fix.
      
      * add comment.
      
      * remove.
      
      * temp fix.
      
      * initialize for shared memory.
      
      * fix graphsage.
      
      * fix gcn.
      
      * add more unit tests.
      
      * add more tests.
      
      * avoid creating shared-memory exclusively.
      
      * redefine remote initializer.
      
      * improve initializer.
      
      * fix unit test.
      
      * fix lint.
      
      * fix lint.
      
      * initialize data in the graph store server properly.
      
      * fix test.
      
      * fix test.
      
      * fix test.
      
      * small fix.
      
      * add comments.
      
      * cleanup server.
      
      * test graph store with a random port.
      
      * print.
      
      * print to stderr.
      
      * test1
      
      * test2
      
      * remove comment.
      
      * adjust the initializer signature.
      
      * [API] update graph store API. (#549)
      
      * add init_ndata and init_edata in DGLGraph.
      
      * adjust SharedMemoryGraph API.
      
      * print warning.
      
      * fix comment.
      
      * update example
      
      * fix.
      
      * fix examples.
      
      * add unit tests.
      
      * add comments.
      
      * [Refactor] Immutable graph index (#543)
      
      * WIP
      
      * header
      
      * WIP .cc
      
      * WIP
      
      * transpose
      
      * wip
      
      * immutable graph .h and .cc
      
      * WIP: nodeflow.cc
      
      * compile
      
      * remove all tmp dl managed ctx; they caused refcount issue
      
      * one simple test
      
      * WIP: testing
      
      * test_graph
      
      * fix graph index
      
      * fix bug in sampler; pass pytorch utest
      
      * WIP on mxnet
      
      * fix lint
      
      * fix mxnet unittest w/ unfortunate workaround
      
      * fix msvc
      
      * fix lint
      
      * SliceRows and test_nodeflow
      
      * resolve reviews
      
      * resolve reviews
      
      * try fix win ci
      
      * try fix win ci
      
      * poke win ci again
      
      * poke
      
      * lazy multigraph flag; stackoverflow error
      
      * revert node subgraph test
      
      * lazy object
      
      * try fix win build
      
      * try fix win build
      
      * poke ci
      
      * fix build script
      
      * fix compile
      
      * add a todo
      
      * fix reviews
      
      * fix compile
      
      * [Kernel] Update kernel branch (#576)
      
      * [Model] add multiprocessing training with sampling. (#484)
      
      * reorganize sampling code.
      
      * add multi-process training.
      
      * speed up gcn_cv
      
      * fix graphsage_cv.
      
      * add new API in graph store.
      
      * update barrier impl.
      
      * support both local and distributed training.
      
      * fix multiprocess train.
      
      * fix.
      
      * fix barrier.
      
      * add script for loading data.
      
      * multiprocessing sampling.
      
      * accel training.
      
      * replace pull with spmv for speedup.
      
      * nodeflow copy from parent with context.
      
      * enable GPU.
      
      * fix a bug in graph store.
      
      * enable multi-GPU training.
      
      * fix lint.
      
      * add comments.
      
      * rename to run_store_server.py
      
      * fix gcn_cv.
      
      * fix a minor bug in sampler.
      
      * handle error better in graph store.
      
      * improve graphsage_cv for distributed mode.
      
      * update README.
      
      * fix.
      
      * update.
      
      * [Tutorial] add sampling tutorial. (#522)
      
      * add sampling tutorial.
      
      * add readme
      
      * update author list.
      
      * fix indent in the code.
      
      * rename the file.
      
      * update tutorial.
      
      * fix the last API.
      
      * update image.
      
      * [BUGFIX] fix the problems in the sampling tutorial. (#523)
      
      * add index.
      
      * update.
      
      * update tutorial.
      
      * fix gpu utest
      
      * cuda utest runnable
      
      * temp disable test nodeflow; unified test for kernel
      
      * cuda test kernel done
      
      * Fixing typo in JTNN after interface change (#536)
      
      * [BugFix] Fix getting src and dst id of ALL edges in NodeFlow.apply_block (#515)
      
      * [Bug Fix] Fix inplace op at backend (#546)
      
      * Fix inplace operation
      
      * fix line seprator
      
      * [Feature] Add batch and unbatch for immutable graph (#539)
      
      * Add batch and unbatch for immutable graph
      
      * fix line seprator
      
      * fix lintr
      
      * remove unnecessary include
      
      * fix code review
      
      * [BUGFix] Improve multi-processing training (#526)
      
      * fix.
      
      * add comment.
      
      * remove.
      
      * temp fix.
      
      * initialize for shared memory.
      
      * fix graphsage.
      
      * fix gcn.
      
      * add more unit tests.
      
      * add more tests.
      
      * avoid creating shared-memory exclusively.
      
      * redefine remote initializer.
      
      * improve initializer.
      
      * fix unit test.
      
      * fix lint.
      
      * fix lint.
      
      * initialize data in the graph store server properly.
      
      * fix test.
      
      * fix test.
      
      * fix test.
      
      * small fix.
      
      * add comments.
      
      * cleanup server.
      
      * test graph store with a random port.
      
      * print.
      
      * print to stderr.
      
      * test1
      
      * test2
      
      * remove comment.
      
      * adjust the initializer signature.
      
      * [API] update graph store API. (#549)
      
      * add init_ndata and init_edata in DGLGraph.
      
      * adjust SharedMemoryGraph API.
      
      * print warning.
      
      * fix comment.
      
      * update example
      
      * fix.
      
      * fix examples.
      
      * add unit tests.
      
      * add comments.
      
      * [Refactor] Immutable graph index (#543)
      
      * WIP
      
      * header
      
      * WIP .cc
      
      * WIP
      
      * transpose
      
      * wip
      
      * immutable graph .h and .cc
      
      * WIP: nodeflow.cc
      
      * compile
      
      * remove all tmp dl managed ctx; they caused refcount issue
      
      * one simple test
      
      * WIP: testing
      
      * test_graph
      
      * fix graph index
      
      * fix bug in sampler; pass pytorch utest
      
      * WIP on mxnet
      
      * fix lint
      
      * fix mxnet unittest w/ unfortunate workaround
      
      * fix msvc
      
      * fix lint
      
      * SliceRows and test_nodeflow
      
      * resolve reviews
      
      * resolve reviews
      
      * try fix win ci
      
      * try fix win ci
      
      * poke win ci again
      
      * poke
      
      * lazy multigraph flag; stackoverflow error
      
      * revert node subgraph test
      
      * lazy object
      
      * try fix win build
      
      * try fix win build
      
      * poke ci
      
      * fix build script
      
      * fix compile
      
      * add a todo
      
      * fix reviews
      
      * fix compile
      
      * all demo use python-3 (#555)
      
      * [DEMO] Reproduce numbers of distributed training in AMLC giant graph paper (#556)
      
      * update
      
      * update
      
      * update
      
      * update num_hops
      
      * fix bug
      
      * update
      
      * report numbers of distributed training in AMLC giant graph paper
      
      * [DEMO] Remove duplicate code for sampling (#557)
      
      * update
      
      * update
      
      * re-use single-machine code
      
      * update
      
      * use relative path
      
      * update
      
      * update
      
      * update
      
      * add __init__.py
      
      * add __init__.py
      
      * import sys, os
      
      * fix typo
      
      * update
      
      * [Perf] Improve performance of graph store. (#554)
      
      * fix.
      
      * use inplace.
      
      * move to shared memory graph store.
      
      * fix.
      
      * add more unit tests.
      
      * fix.
      
      * fix test.
      
      * fix test.
      
      * disable test.
      
      * fix.
      
      * [BUGIFX] fix a bug in edge_ids (#560)
      
      * add test.
      
      * fix compute.
      
      * fix test.
      
      * turn on test.
      
      * fix a bug.
      
      * add test.
      
      * fix.
      
      * disable test.
      
      * [DEMO] Add Pytorch demo for distributed sampler (#562)
      
      * update
      
      * update
      
      * update
      
      * add sender
      
      * update
      
      * remove duplicate cpde
      
      * [Test] Add gtest to project (#547)
      
      * add gtest module
      
      * add gtest
      
      * fix
      
      * Update CMakeLists.txt
      
      * Update README.md
      
      * [Perf] lazily create msg_index. (#563)
      
      * lazily create msg_index.
      
      * update test.
      
      * [BUGFIX] fix bugs for running GCN on giant graphs. (#561)
      
      * load mxnet csr.
      
      * enable load large csr.
      
      * fix
      
      * fix.
      
      * fix int overflow.
      
      * fix test.
      
      * [BugFix] Fix error when bfs_level = 0 in Entity Classification with RGCN (#559)
      
      * [DEMO] Update demo of distributed sampler (#564)
      
      * update
      
      * update
      
      * update demo
      
      * add network cpp test (#565)
      
      * Add unittest for C++ RPC (#566)
      
      * [CI] Fix CI for cpp test (#570)
      
      * fix CI for cpp test
      
      * update port number
      
      * [Docker] update docker image (#575)
      
      * update docker image
      
      * specify lint version
      
      * rm torch import from unified tests
      
      * [Kernel][Scheduler][MXNet] Scheduler for DGL kernels and MXNet backend support (#541)
      
      * [Model] add multiprocessing training with sampling. (#484)
      
      * reorganize sampling code.
      
      * add multi-process training.
      
      * speed up gcn_cv
      
      * fix graphsage_cv.
      
      * add new API in graph store.
      
      * update barrier impl.
      
      * support both local and distributed training.
      
      * fix multiprocess train.
      
      * fix.
      
      * fix barrier.
      
      * add script for loading data.
      
      * multiprocessing sampling.
      
      * accel training.
      
      * replace pull with spmv for speedup.
      
      * nodeflow copy from parent with context.
      
      * enable GPU.
      
      * fix a bug in graph store.
      
      * enable multi-GPU training.
      
      * fix lint.
      
      * add comments.
      
      * rename to run_store_server.py
      
      * fix gcn_cv.
      
      * fix a minor bug in sampler.
      
      * handle error better in graph store.
      
      * improve graphsage_cv for distributed mode.
      
      * update README.
      
      * fix.
      
      * update.
      
      * [Tutorial] add sampling tutorial. (#522)
      
      * add sampling tutorial.
      
      * add readme
      
      * update author list.
      
      * fix indent in the code.
      
      * rename the file.
      
      * update tutorial.
      
      * fix the last API.
      
      * update image.
      
      * [BUGFIX] fix the problems in the sampling tutorial. (#523)
      
      * add index.
      
      * update.
      
      * update tutorial.
      
      * fix gpu utest
      
      * cuda utest runnable
      
      * temp disable test nodeflow; unified test for kernel
      
      * cuda test kernel done
      
      * edge softmax module
      
      * WIP
      
      * Fixing typo in JTNN after interface change (#536)
      
      * mxnet backend support
      
      * improve reduce grad
      
      * add max to unittest backend
      
      * fix kernel unittest
      
      * [BugFix] Fix getting src and dst id of ALL edges in NodeFlow.apply_block (#515)
      
      * lint
      
      * lint
      
      * win build
      
      * [Bug Fix] Fix inplace op at backend (#546)
      
      * Fix inplace operation
      
      * fix line seprator
      
      * [Feature] Add batch and unbatch for immutable graph (#539)
      
      * Add batch and unbatch for immutable graph
      
      * fix line seprator
      
      * fix lintr
      
      * remove unnecessary include
      
      * fix code review
      
      * [BUGFix] Improve multi-processing training (#526)
      
      * fix.
      
      * add comment.
      
      * remove.
      
      * temp fix.
      
      * initialize for shared memory.
      
      * fix graphsage.
      
      * fix gcn.
      
      * add more unit tests.
      
      * add more tests.
      
      * avoid creating shared-memory exclusively.
      
      * redefine remote initializer.
      
      * improve initializer.
      
      * fix unit test.
      
      * fix lint.
      
      * fix lint.
      
      * initialize data in the graph store server properly.
      
      * fix test.
      
      * fix test.
      
      * fix test.
      
      * small fix.
      
      * add comments.
      
      * cleanup server.
      
      * test graph store with a random port.
      
      * print.
      
      * print to stderr.
      
      * test1
      
      * test2
      
      * remove comment.
      
      * adjust the initializer signature.
      
      * try
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * try
      
      * test
      
      * test
      
      * test
      
      * try
      
      * try
      
      * try
      
      * test
      
      * fix
      
      * try gen_target
      
      * fix gen_target
      
      * fix msvc var_args expand issue
      
      * fix
      
      * [API] update graph store API. (#549)
      
      * add init_ndata and init_edata in DGLGraph.
      
      * adjust SharedMemoryGraph API.
      
      * print warning.
      
      * fix comment.
      
      * update example
      
      * fix.
      
      * fix examples.
      
      * add unit tests.
      
      * add comments.
      
      * [Refactor] Immutable graph index (#543)
      
      * WIP
      
      * header
      
      * WIP .cc
      
      * WIP
      
      * transpose
      
      * wip
      
      * immutable graph .h and .cc
      
      * WIP: nodeflow.cc
      
      * compile
      
      * remove all tmp dl managed ctx; they caused refcount issue
      
      * one simple test
      
      * WIP: testing
      
      * test_graph
      
      * fix graph index
      
      * fix bug in sampler; pass pytorch utest
      
      * WIP on mxnet
      
      * fix lint
      
      * fix mxnet unittest w/ unfortunate workaround
      
      * fix msvc
      
      * fix lint
      
      * SliceRows and test_nodeflow
      
      * resolve reviews
      
      * resolve reviews
      
      * try fix win ci
      
      * try fix win ci
      
      * poke win ci again
      
      * poke
      
      * lazy multigraph flag; stackoverflow error
      
      * revert node subgraph test
      
      * lazy object
      
      * try fix win build
      
      * try fix win build
      
      * poke ci
      
      * fix build script
      
      * fix compile
      
      * add a todo
      
      * fix reviews
      
      * fix compile
      
      * WIP
      
      * WIP
      
      * all demo use python-3 (#555)
      
      * ToImmutable and CopyTo
      
      * [DEMO] Reproduce numbers of distributed training in AMLC giant graph paper (#556)
      
      * update
      
      * update
      
      * update
      
      * update num_hops
      
      * fix bug
      
      * update
      
      * report numbers of distributed training in AMLC giant graph paper
      
      * [DEMO] Remove duplicate code for sampling (#557)
      
      * update
      
      * update
      
      * re-use single-machine code
      
      * update
      
      * use relative path
      
      * update
      
      * update
      
      * update
      
      * add __init__.py
      
      * add __init__.py
      
      * import sys, os
      
      * fix typo
      
      * update
      
      * [Perf] Improve performance of graph store. (#554)
      
      * fix.
      
      * use inplace.
      
      * move to shared memory graph store.
      
      * fix.
      
      * add more unit tests.
      
      * fix.
      
      * fix test.
      
      * fix test.
      
      * disable test.
      
      * fix.
      
      * [BUGIFX] fix a bug in edge_ids (#560)
      
      * add test.
      
      * fix compute.
      
      * fix test.
      
      * turn on test.
      
      * fix a bug.
      
      * add test.
      
      * fix.
      
      * disable test.
      
      * DGLRetValue DGLContext conversion
      
      * [DEMO] Add Pytorch demo for distributed sampler (#562)
      
      * update
      
      * update
      
      * update
      
      * add sender
      
      * update
      
      * remove duplicate cpde
      
      * [Test] Add gtest to project (#547)
      
      * add gtest module
      
      * add gtest
      
      * fix
      
      * Update CMakeLists.txt
      
      * Update README.md
      
      * Add support to convert immutable graph to 32 bits
      
      * [Perf] lazily create msg_index. (#563)
      
      * lazily create msg_index.
      
      * update test.
      
      * fix binary reduce following new minigun template
      
      * enable both int64 and int32 kernels
      
      * [BUGFIX] fix bugs for running GCN on giant graphs. (#561)
      
      * load mxnet csr.
      
      * enable load large csr.
      
      * fix
      
      * fix.
      
      * fix int overflow.
      
      * fix test.
      
      * new kernel interface done for CPU
      
      * docstring
      
      * rename & docstring
      
      * copy reduce and backward
      
      * [BugFix] Fix error when bfs_level = 0 in Entity Classification with RGCN (#559)
      
      * [DEMO] Update demo of distributed sampler (#564)
      
      * update
      
      * update
      
      * update demo
      
      * adapt cuda kernels to the new interface
      
      * add network cpp test (#565)
      
      * fix bug
      
      * Add unittest for C++ RPC (#566)
      
      * [CI] Fix CI for cpp test (#570)
      
      * fix CI for cpp test
      
      * update port number
      
      * [Docker] update docker image (#575)
      
      * update docker image
      
      * specify lint version
      
      * rm torch import from unified tests
      
      * remove pytorch-specific test_function
      
      * fix unittest
      
      * fix
      
      * fix unittest backend bug in converting tensor to numpy array
      
      * fix
      
      * mxnet version
      
      * [BUGFIX] fix for MXNet 1.5. (#552)
      
      * remove clone.
      
      * turn on numpy compatible.
      
      * Revert "remove clone."
      
      This reverts commit 17bbf76ed72ff178df6b3f35addc428048672457.
      
      * revert format changes
      
      * fix mxnet api name
      
      * revert mistakes in previous revert
      
      * roll back CI to 20190523 build
      
      * fix unittest
      
      * disable test_shared_mem_store.py for now
      
      * remove mxnet/test_specialization.py
      
      * sync win64 test script
      
      * fix lowercase
      
      * missing backend in gpu unit test
      
      * transpose to get forward graph
      
      * pass update all
      
      * add sanity check
      
      * passing test_specialization.py
      
      * fix and pass test_function
      
      * fix check
      
      * fix pytorch softmax
      
      * mxnet kernels
      
      * c++ lint
      
      * pylint
      
      * try
      
      * win build
      
      * fix
      
      * win
      
      * ci enable gpu build
      
      * init submodule recursively
      
      * backend docstring
      
      * try
      
      * test win dev
      
      * doc string
      
      * disable pytorch test_nn
      
      * try to fix windows issue
      
      * bug fixed, revert changes
      
      * [Test] fix CI. (#586)
      
      * disable unit test in mxnet tutorial.
      
      * retry socket connection.
      
      * roll back to set_np_compat
      
      * try to fix multi-processing test hangs when it fails.
      
      * fix test.
      
      * fix.
      
      * doc string
      
      * doc string and clean up
      
      * missing field in ctypes
      
      * fix node flow schedule and unit test
      
      * rename
      
      * pylint
      
      * copy from parent default context
      
      * fix unit test script
      
      * fix
      
      * demo bug in nodeflow gpu test
      
      * [Kernel][Bugfix] fix nodeflow bug (#604)
      
      * fix nodeflow bug
      
      * remove debug code
      
      * add build gtest option
      
      * fix cmake; fix graph index bug in spmv.py
      
      * remove clone
      
      * fix div rhs grad bug
      
      * [Kernel] Support full builtin method, edge softmax and unit tests (#605)
      
      * add full builtin support
      
      * unit test
      
      * unit test backend
      
      * edge softmax
      
      * apply edge with builtin
      
      * fix kernel unit test
      
      * disable mxnet test_shared_mem_store
      
      * gen builtin reduce
      
      * enable mxnet gpu unittest
      
      * revert some changes
      
      * docstring
      
      * add note for the hack
      
      * [Kernel][Unittest][CI] Fix MXNet GPU CI (#607)
      
      * update docker image for MXNet GPU CI
      
      * force all dgl graph input and output on CPU
      
      * fix gpu unittest
      
      * speedup compilation
      
      * add some comments
      
      * lint
      
      * add more comments
      
      * fix as requested
      
      * add some comments
      
      * comment
      
      * lint
      
      * lint
      
      * update pylint
      
      * fix as requested
      
      * lint
      
      * lint
      
      * lint
      
      * docstrings of python DGL kernel entries
      
      * disable lint warnings on arguments in kernel.py
      
      * fix docstring in scheduler
      
      * fix some bug in unittest; try again
      
      * Revert "Merge branch 'kernel' of github.com:zzhang-cn/dgl into kernel"
      
      This reverts commit 1d2299e68b004182ea6130b088de1f1122b18a49, reversing
      changes made to ddc97fbf1bec2b7815c0da7c74f7ecb2f428889b.
      
      * Revert "fix some bug in unittest; try again"
      
      This reverts commit ddc97fbf1bec2b7815c0da7c74f7ecb2f428889b.
      
      * more comprehensive kernel test
      
      * remove shape check in test_specialization
      653428bd
  17. 30 May, 2019 1 commit
    • VoVAllen's avatar
      [Test] Add cpp test to CI (#567) · 16af6ebd
      VoVAllen authored
      * add cpptest to ci
      
      * try windows
      
      * Update Jenkinsfile
      
      * Update Jenkinsfile
      
      * Update Jenkinsfile
      
      * Update task_cpp_unit_test.bat
      
      * Change localhost to 127.0.0.1
      
      * windows socket gtest
      
      * lint
      
      * Update task_cpp_unit_test.bat
      
      * fix batch script
      
      * export all symbols
      
      * Update Jenkinsfile
      
      * compile once
      
      * try mxnet compact
      16af6ebd
  18. 24 May, 2019 1 commit
  19. 08 Apr, 2019 1 commit
    • Da Zheng's avatar
      [Feature] Create shared memory graph store. (#468) · bfdd1eaa
      Da Zheng authored
      * accelerate gcn_ns.
      
      * add timing.
      
      * run infer with whole graph.
      
      * distributed gcn_ns.
      
      * reconstruct gcn_ns.
      
      * minor fix.
      
      * change graphsage_cv for numa.
      
      * fix #OMP threads.
      
      * accelerate graphsage_cv.
      
      * fix a weird bug.
      
      * add profiler in graphsage_cv.
      
      * accelerate graphsage_cv.
      
      manually aggregate neighbors' embeddings with pull.
      
      * load csr directly in gcn_ns_sc.
      
      * parallel sort for graph index.
      
      * Revert "parallel sort for graph index."
      
      This reverts commit 86fe2c7117fe5e56b0d481b39849c258b166945b.
      
      * run gcn_ns_sc on GPUs.
      
      * acc gcn_cv_sc.
      
      * change gcn_cv for numa.
      
      * fix gcn_cv to use numa and gpu.
      
      * improve graphsage_cv to use numa and gpu.
      
      * improve gcn_ns.
      
      * improve graphsage_cv.
      
      * init shared memory graph store.
      
      * fix.
      
      * enable init ndata.
      
      * improve tests.
      
      * add bidirectional communication.
      
      * link to rt.
      
      * fix compilation error.
      
      * fix shared memory init.
      
      * use MessageQueue for inter-process communication.
      
      * reconstruct immutable graph csr.
      
      * fix gcn.
      
      * load csr to shared memory.
      
      * fix minor bugs.
      
      * add comments.
      
      * refactor SharedMemory.
      
      * fix bugs in ImmutableGraph.
      
      * create CSR graph from shared memory.
      
      * add more test for loading a csr graph.
      
      * terminate graph store properly.
      
      * allow initializing ndata in the graph store server.
      
      * use RPC for inter-process communication.
      
      * a script for loading a graph.
      
      * allow customizing port.
      
      * list all ndata and edata.
      
      * support dtype.
      
      * reorganize SharedMemoryGraphStore.
      
      * fix ndata shape.
      
      * reconstruct gcn_ns.
      
      * print info.
      
      * set omp in gcn_ns.
      
      * reset sampling examples.
      
      * fix lint.
      
      * fix lint.
      
      * reset gcn.
      
      * disable shared memory in windows.
      
      * fix.
      
      * fix.
      
      * reset changes.
      
      * revert nodeflow changes.
      
      * fix cmake.
      
      * fix test.
      
      * fix test.
      
      * fix test.
      
      * fix test.
      
      * add comments.
      
      * fix test.
      
      * move vector out.
      
      * fix lint.
      
      * fix lint.
      
      * move SharedMemory.
      
      * update cmake.
      
      * update comment.
      
      * fix comments.
      
      * Revert "update cmake."
      
      This reverts commit 592445e37077f70a6e3f2e5245f9a3d086b04f3b.
      
      * update cmake.
      
      * add comments.
      
      * rename.
      
      * change the comment.
      
      * fix a bug.
      
      * rename.
      
      * add comments.
      
      * add comments.
      
      * add init_edata.
      
      * rewrite memory alloc.
      
      * move vector to CSR.
      
      * fix.
      
      * init data.
      
      * Revert "init data."
      
      This reverts commit 2b217b9553911b7dd84a9f1d9b68430b5aa18e23.
      
      * init data.
      
      * init new columns correctly.
      bfdd1eaa
  20. 29 Mar, 2019 1 commit
    • Chao Ma's avatar
      [RPC] add C++ RPC infrastructure and distributed sampler (#465) · 5d494c62
      Chao Ma authored
      * add C++ rpc infrastructure and distributed sampler
      
      * update
      
      * update lint
      
      * update lint
      
      * update lint
      
      * update
      
      * update
      
      * update
      
      * updare
      
      * update
      
      * update
      
      * update
      
      * update serialize and unittest
      
      * update serialize
      
      * lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update unittest
      
      * put Finalize() to __del__
      
      * update unittest
      
      * update
      
      * delete buffer in Finalize
      
      * update unittest
      
      * update unittest
      
      * update unittest
      
      * update unittest
      
      * update
      
      * update
      
      * fix small bug
      
      * windows socket impl
      
      * update API
      
      * fix bug in serialize
      
      * fix bug in serialzie
      
      * set parent graph
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix lint
      
      * fix lint
      
      * fix
      
      * fix windows compilation error
      
      * fix windows error
      
      * change API to lower-case
      
      * update test
      
      * fix typo
      
      * update
      
      * add SamplerPool
      
      * add SamplerPool
      
      * update
      
      * update test
      
      * update
      
      * update
      
      * update
      
      * update
      
      * add example
      
      * update
      
      * update
      5d494c62
  21. 20 Feb, 2019 1 commit
  22. 05 Dec, 2018 2 commits
  23. 17 Oct, 2018 1 commit
    • Lingfan Yu's avatar
      Efficient send_and_recv (#83) · 750e5037
      Lingfan Yu authored
      * python side send_and_recv shortcut
      
      * common util function for c apis
      
      * degree bucketing scheduler
      
      * scheduler c apis
      
      * cmake
      
      * python side logics
      
      * minor
      
      * fix multiple msg/red func cases
      
      * remove shell echo command in Jenkinsfile...
      
      * fuse apply for send_and_recv
      
      * fuse update func for update_all
      
      * typo...
      
      * handle anonymous
      
      * add degree bucketing for entire graph
      
      * degree bucketing executor
      
      * minor
      
      * style
      
      * rename graph_mapping to recv_nodes
      
      * fix some comments
      750e5037
  24. 25 Sep, 2018 1 commit
  25. 05 Sep, 2018 1 commit