1. 20 May, 2019 1 commit
    • Da Zheng's avatar
      [BUGFix] Improve multi-processing training (#526) · cdfca992
      Da Zheng authored
      * fix.
      
      * add comment.
      
      * remove.
      
      * temp fix.
      
      * initialize for shared memory.
      
      * fix graphsage.
      
      * fix gcn.
      
      * add more unit tests.
      
      * add more tests.
      
      * avoid creating shared-memory exclusively.
      
      * redefine remote initializer.
      
      * improve initializer.
      
      * fix unit test.
      
      * fix lint.
      
      * fix lint.
      
      * initialize data in the graph store server properly.
      
      * fix test.
      
      * fix test.
      
      * fix test.
      
      * small fix.
      
      * add comments.
      
      * cleanup server.
      
      * test graph store with a random port.
      
      * print.
      
      * print to stderr.
      
      * test1
      
      * test2
      
      * remove comment.
      
      * adjust the initializer signature.
      cdfca992
  2. 07 May, 2019 1 commit
    • Da Zheng's avatar
      [Model] add multiprocessing training with sampling. (#484) · 3a1392e6
      Da Zheng authored
      * reorganize sampling code.
      
      * add multi-process training.
      
      * speed up gcn_cv
      
      * fix graphsage_cv.
      
      * add new API in graph store.
      
      * update barrier impl.
      
      * support both local and distributed training.
      
      * fix multiprocess train.
      
      * fix.
      
      * fix barrier.
      
      * add script for loading data.
      
      * multiprocessing sampling.
      
      * accel training.
      
      * replace pull with spmv for speedup.
      
      * nodeflow copy from parent with context.
      
      * enable GPU.
      
      * fix a bug in graph store.
      
      * enable multi-GPU training.
      
      * fix lint.
      
      * add comments.
      
      * rename to run_store_server.py
      
      * fix gcn_cv.
      
      * fix a minor bug in sampler.
      
      * handle error better in graph store.
      
      * improve graphsage_cv for distributed mode.
      
      * update README.
      
      * fix.
      
      * update.
      3a1392e6
  3. 16 Apr, 2019 1 commit
  4. 08 Apr, 2019 1 commit
    • Da Zheng's avatar
      [Feature] Create shared memory graph store. (#468) · bfdd1eaa
      Da Zheng authored
      * accelerate gcn_ns.
      
      * add timing.
      
      * run infer with whole graph.
      
      * distributed gcn_ns.
      
      * reconstruct gcn_ns.
      
      * minor fix.
      
      * change graphsage_cv for numa.
      
      * fix #OMP threads.
      
      * accelerate graphsage_cv.
      
      * fix a weird bug.
      
      * add profiler in graphsage_cv.
      
      * accelerate graphsage_cv.
      
      manually aggregate neighbors' embeddings with pull.
      
      * load csr directly in gcn_ns_sc.
      
      * parallel sort for graph index.
      
      * Revert "parallel sort for graph index."
      
      This reverts commit 86fe2c7117fe5e56b0d481b39849c258b166945b.
      
      * run gcn_ns_sc on GPUs.
      
      * acc gcn_cv_sc.
      
      * change gcn_cv for numa.
      
      * fix gcn_cv to use numa and gpu.
      
      * improve graphsage_cv to use numa and gpu.
      
      * improve gcn_ns.
      
      * improve graphsage_cv.
      
      * init shared memory graph store.
      
      * fix.
      
      * enable init ndata.
      
      * improve tests.
      
      * add bidirectional communication.
      
      * link to rt.
      
      * fix compilation error.
      
      * fix shared memory init.
      
      * use MessageQueue for inter-process communication.
      
      * reconstruct immutable graph csr.
      
      * fix gcn.
      
      * load csr to shared memory.
      
      * fix minor bugs.
      
      * add comments.
      
      * refactor SharedMemory.
      
      * fix bugs in ImmutableGraph.
      
      * create CSR graph from shared memory.
      
      * add more test for loading a csr graph.
      
      * terminate graph store properly.
      
      * allow initializing ndata in the graph store server.
      
      * use RPC for inter-process communication.
      
      * a script for loading a graph.
      
      * allow customizing port.
      
      * list all ndata and edata.
      
      * support dtype.
      
      * reorganize SharedMemoryGraphStore.
      
      * fix ndata shape.
      
      * reconstruct gcn_ns.
      
      * print info.
      
      * set omp in gcn_ns.
      
      * reset sampling examples.
      
      * fix lint.
      
      * fix lint.
      
      * reset gcn.
      
      * disable shared memory in windows.
      
      * fix.
      
      * fix.
      
      * reset changes.
      
      * revert nodeflow changes.
      
      * fix cmake.
      
      * fix test.
      
      * fix test.
      
      * fix test.
      
      * fix test.
      
      * add comments.
      
      * fix test.
      
      * move vector out.
      
      * fix lint.
      
      * fix lint.
      
      * move SharedMemory.
      
      * update cmake.
      
      * update comment.
      
      * fix comments.
      
      * Revert "update cmake."
      
      This reverts commit 592445e37077f70a6e3f2e5245f9a3d086b04f3b.
      
      * update cmake.
      
      * add comments.
      
      * rename.
      
      * change the comment.
      
      * fix a bug.
      
      * rename.
      
      * add comments.
      
      * add comments.
      
      * add init_edata.
      
      * rewrite memory alloc.
      
      * move vector to CSR.
      
      * fix.
      
      * init data.
      
      * Revert "init data."
      
      This reverts commit 2b217b9553911b7dd84a9f1d9b68430b5aa18e23.
      
      * init data.
      
      * init new columns correctly.
      bfdd1eaa