"src/vscode:/vscode.git/clone" did not exist on "b09b152f77b02b4d72d88030593ac9ff7ffa3d81"
- 10 Aug, 2020 1 commit
-
-
Da Zheng authored
* fix perf. * fix. * accelerate metis. * fix lint. * use gklib. * fix perf. * fix. * update metis. * update launch script * handle synchronized API. * fix. * fix example. * fix dataloader. * temp fix. * temp fix omp. * distinguish roles. * initialize iterator of DistDataloader correctly. * check the correctness of launch script. * move feature copy to sampler. * measure mem/network copy time. * remove * Revert "measure mem/network copy time." This reverts commit 86cefdc14b7815fcf5aad6496af912dba48e4aa6. * fix. * fix * fix. * fix cmake. * disable metis in windows. * disable metis tests in windows. * remove test for multigraph. * fix test. * fix. * fix cmake. * fix. * revert. Co-authored-by:
Ubuntu <ubuntu@ip-172-31-19-115.us-west-2.compute.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-19-1.us-west-2.compute.internal>
-
- 08 Aug, 2020 1 commit
-
-
Da Zheng authored
* distinguish roles. * add comments. * fix lint. * move roles to server_state. * fix text. * fix tests. * fix tests. * Revert "fix tests." This reverts commit 5baa136b872a4550d4e612bfb1dfe363d7814adf.
-
- 05 Aug, 2020 1 commit
-
-
Jinjing Zhou authored
* 111 * 111 * fix * 111 * fix * 11 * fix * lint * Update __init__.py * lint * fix * lint * fix * fix * fix * fix * fix * try fix * try fix * fix * Revert "fix" This reverts commit a0b954fd4e99b7df92b53db8334dcb583d6e1551. * fixes. * fix. * fix test. * fix exit. * fix. * fix * fix * lint * lint * lint * fix * Update .gitignore * 111 * fix * 111 * 111 * fff * 1111 * 111 * 1325315 * ffff * f??? * fff * 1111 * 111 * fix * 111 * asda * 1111 * 11 * 123 * 啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊 * spawn * 1231231 * up * 111 * fix * fix * Revert "fix" This reverts commit 7373f95312fdcaa36d2fc330bf242339e89c045d. * fix * fix * 1111 * fix * fix tests * start kvclient as early as possible. * lint * fix test * lint * 1111 * fix * fix * 111 * fix * fix * 1 * fix * fix * lint * fix * lint * lint * remove quit * fix * lint * fix * fix several * lint * fix minor * fix * lint Co-authored-by:
Da Zheng <zhengda1936@gmail.com> Co-authored-by:
xiang song(charlie.song) <classicxsong@gmail.com>
-
- 03 Aug, 2020 1 commit
-
-
Da Zheng authored
* client init graph on the backup servers. * fix. * test multi-server. * fix anonymous dist tensors. * check #parts. * fix init_data * add multi-server multi-client tests. * update tests in kvstore. * fix. * verify the loaded partition. * fix a bug. * fix lint. * fix. * fix example. * fix rpc. * fix pull/push handler for backup kvstore * fix example readme. * change ip. * update docstring. Co-authored-by:Ubuntu <ubuntu@ip-172-31-19-1.us-west-2.compute.internal>
-
- 31 Jul, 2020 1 commit
-
-
Da Zheng authored
* fix bugs. * eval on both vaidation and testing. * add script. * update. * update launch. * make train_dist.py independent. * update readme. * update readme. * update readme. * update readme. * generate undirected graph. * rename conf_file to part_config * use rsync * make train_dist independent. Co-authored-by:
Ubuntu <ubuntu@ip-172-31-19-1.us-west-2.compute.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-19-115.us-west-2.compute.internal> Co-authored-by:
xiang song(charlie.song) <classicxsong@gmail.com>
-
- 29 Jul, 2020 1 commit
-
-
Da Zheng authored
* fix tests in partition. * fix DistGraph. * fix without shared memory. * fix sampling. * enable distributed test. * fix tests. * fix a bug in shared-mem heterograph. * print better error messages. * fix. * don't specify formats. * fix. * fix * small fix.
-
- 27 Jul, 2020 2 commits
- 26 Jul, 2020 1 commit
-
-
Da Zheng authored
* update. * update Embedding. * add comments. * fix lint Co-authored-by:Chao Ma <mctt90@gmail.com>
-
- 22 Jul, 2020 1 commit
-
-
Da Zheng authored
* add eval. * extend DistTensor. * fix. * add barrier. * add more print. * add more checks in kvstore. * fix lint. * get all neighbors for eval. * reorganize. * fix. * fix. * fix. * fix test. * add reuse_if_exist. * add test for reuse_if_exist. * fix lint. * fix bugs. * fix. * print errors of tcp socket. * support delete tensors. * fix lint. * fix * fix example Co-authored-by:Ubuntu <ubuntu@ip-172-31-19-1.us-west-2.compute.internal>
-
- 20 Jul, 2020 1 commit
-
-
Chao Ma authored
* exit client * update * update * update * update * update * update * update * update test * update * update * update * update * update * update * update * update * update
-
- 15 Jul, 2020 1 commit
-
-
Da Zheng authored
* add standalone mode * add comments. * add tests for sampling. * fix. * make the code to run the standalone mode * fix * fix * fix readme. * fix. * fix test Co-authored-by:Chao Ma <mctt90@gmail.com>
-
- 14 Jul, 2020 1 commit
-
-
Da Zheng authored
* run dist server in dgl. * fix bugs. * fix example. * check environment variables and fix lint. * fix lint
-
- 03 Jul, 2020 2 commits
-
-
Da Zheng authored
* add sparse embedding. * fix * add test. * man fixes. * many fixes * fix sparse emb. * fix. * fix lint. * fix lint. * fix kvstore. * expose DistTensor. * test sparse embeddings. * add attach_grad to the backends. * remove part_id * fix. * move backward computation. * move more computation to backend. * fix a bug when applying learning rate. * fix a few things. * fix a few things. * add docstring * fix. * apply no_grad. * fix tests. * fix for other frameworks. * add examples in docstring.
-
- 02 Jul, 2020 1 commit
-
-
xiang song(charlie.song) authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-51-214.ec2.internal>
-
- 01 Jul, 2020 1 commit
-
-
Da Zheng authored
* fix. * fix tests. * fix * add tests. * fix. * have default rank. * add comment. * fix test. * remove check * simplify code. * add test. * split data evenly. * simplify the distributed training code. * add comments. * add comments.
-
- 18 Jun, 2020 1 commit
-
-
Da Zheng authored
* support no shared memory. * add test. * add CAPI to check existence of shared memory. * revert the change in src/runtime/ndarray.cc * update docstring. * fix compile.
-
- 15 Jun, 2020 1 commit
-
-
Da Zheng authored
* add reorder immutable graph. * add python API. * add reorder for csr. * remove gk version. * fix * add cpp test. * bug fixes * fix tests. * fix bugs and add check * fix test. * add omp. * add comments. * add coo reorder. * fix a bug. * handle reorder for different graph structues. * fix lint. * fix. * add original ids. * reshuffle nodes before metis partition. * inner nodes are in contiguous Id range. * reshuffle nodes/edges when partitioning. * load partition return graph partition book. * use inner_node/inner_edges * add and test range partition book. * count inner_edge correctly. * fix lint. * fix lint. * fix lint. * fix errors. * fix errors. * fix for TF. * fix. * fix. * change docstring. * support logical and. * add comments. * avoid copy. * fix * update docstring. * fix a bug. * add range search. * fix * fix a bug. * add more tests. * load graph partition book. * support shared memory for range partition book. * fix a bug. * fix. * fix lint. * remove check * fix test. * remove num_nodes and num_edges * fix lint. * fix graph partition book. * address comments. * use makedirs. * fix compile Co-authored-by:
xiang song(charlie.song) <classicxsong@gmail.com> Co-authored-by:
Chao Ma <mctt90@gmail.com>
-
- 11 Jun, 2020 3 commits
-
-
Jinjing Zhou authored
* update * update * update * update * update * update * fix ci using spawn Co-authored-by:aksnzhy <mctt90@gmail.com>
-
Chao Ma authored
* update * update * update * update * update * update * update * update
-
Da Zheng authored
* refactor graph store for new kvstore. * fix kvstore. * fix. * fix lint complains. * add docstring. * small fix in graph partition book. * fix. * remove tests * disable sampling. * Revert "disable sampling." This reverts commit 70451008f61ff1481d5dadbf8bd199470aee559d. * Revert "remove tests" This reverts commit 1394364243bdd73b669abde6193a74e2cda5521d.
-
- 14 May, 2020 1 commit
-
-
Da Zheng authored
* fix tests in graph partition book. * implement node_split and edge_split. * fix a bug. * add tests. * remove unnecessary code. * avoid storing graph structure in partition book. * add partition book in DistGraph. * Revert "avoid storing graph structure in partition book." This reverts commit 55b11fbf9293efcae5f8c97e93f1e121399dadae. * small fixes. * add a test for boolean mask vector. * fix test. * fix bugs. * make it work for all different cases. * fix tests. * fix boolean mask * fix for TF * fix for tensorflow. * fix test for TF * only support boolean mask for now. * fix tests. * make the code more readable. * fix test. Co-authored-by:
Chao Ma <mctt90@gmail.com> Co-authored-by:
Jinjing Zhou <VoVAllen@users.noreply.github.com>
-
- 03 May, 2020 1 commit
-
-
Da Zheng authored
* initial version from distributed training. This is copied from multiprocessing training. * modify for distributed training. * it's runnable now. * measure time in neighbor sampling. * simplify neighbor sampling. * fix a bug in distributed neighbor sampling. * allow single-machine training. * fix a bug. * fix a bug. * fix openmp. * make some improvement. * fix. * add prepare in the sampler. * prepare nodeflow async. * fix a bug. * get id. * simplify the code. * improve. * fix partition.py * fix the example. * add more features. * fix the example. * allow one partition * use distributed kvstore. * do g2l map manually. * fix commandline. * a temp script to save reddit. * fix pull_handler. * add pytorch version. * estimate the time for copying data. * delete unused code. * fix a bug. * print id. * fix a bug * fix a bug * fix a bug. * remove redundent code. * revert modify in sampler. * fix temp script. * remove pytorch version. * fix. * distributed training with pytorch. * add distributed graph store. * fix. * add metis_partition_assignment. * fix a few bugs in distributed graph store. * fix test. * fix bugs in distributed graph store. * fix tests. * remove code of defining DistGraphStore. * fix partition. * fix example. * update run.sh. * only read necessary node data. * batching data fetch of multiple NodeFlows. * simplify gcn. * remove unnecessary code. * use the new copy_from_kvstore. * update training script. * print time in graphsage. * make distributed training runnable. * use val_nid. * fix train_sampling. * add distributed training. * add run.sh * add more timing. * fix a bug. * save graph metadata when partition. * create ndata and edata in distributed graph store. * add timing in minibatch training of GraphSage. * use pytorch distributed. * add checks. * fix a bug in global vs. local ids. * remove fast pull * fix a compile error. * update and add new APIs. * implement more methods in DistGraphStore. * update more APIs. * rename it to DistGraph. * rename to DistTensor * remove some unnecessary API. * remove unnecessary files. * revert changes in sampler. * Revert "simplify gcn." This reverts commit 0ed3a34ca714203a5b45240af71555d4227ce452. * Revert "simplify neighbor sampling." This reverts commit 551c72d20f05a029360ba97f312c7a7a578aacec. * Revert "measure time in neighbor sampling." This reverts commit 63ae80c7b402bb626e24acbbc8fdfe9fffd0bc64. * Revert "add timing in minibatch training of GraphSage." This reverts commit e59dc8957a414c7df5c316f51d78bce822bdef5e. * Revert "fix train_sampling." This reverts commit ea6aea9a4aabb8ba0ff63070aa51e7ca81536ad9. * fix lint. * add comments and small update. * add more comments. * add more unit tests and fix bugs. * check the existence of shared-mem graph index. * use new partitioned graph storage. * fix bugs. * print error in fast pull. * fix lint * fix a compile error. * save absolute path after partitioning. * small fixes in the example * Revert "[kvstore] support any data type for init_data() (#1465)" This reverts commit 87b6997b . * fix a bug. * disable evaluation. * Revert "Revert "[kvstore] support any data type for init_data() (#1465)"" This reverts commit f5b8039c6326eb73bad8287db3d30d93175e5bee. * support set and init data. * support set and init data. * Revert "Revert "[kvstore] support any data type for init_data() (#1465)"" This reverts commit f5b8039c6326eb73bad8287db3d30d93175e5bee. * fix bugs. * fix unit test. * move to dgl.distributed. * fix lint. * fix lint. * remove local_nids. * fix lint. * fix test. * remove train_dist. * revert train_sampling. * rename funcs. * address comments. * address comments. Use NodeDataView/EdgeDataView to keep track of data. * address comments. * address comments. * revert. * save data with DGL serializer. * use the right way of getting shape. * fix lint. * address comments. * address comments. * fix an error in mxnet. * address comments. * add edge_map. * add more test and fix bugs. Co-authored-by:
Zheng <dzzhen@186590dc80ff.ant.amazon.com> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-6-131.us-east-2.compute.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-26-167.us-east-2.compute.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-16-150.us-west-2.compute.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-16-250.us-west-2.compute.internal> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-30-135.us-west-2.compute.internal>
-