1. 11 Aug, 2020 1 commit
    • Chao Ma's avatar
      [Distributed] Remove server_count from ip_config.txt (#1985) · d340ea3a
      Chao Ma authored
      
      
      * remove server_count from ip_config.txt
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * lint
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * Update dist_context.py
      
      * fix lint.
      
      * make it work for multiple spaces.
      
      * update ip_config.txt.
      
      * fix examples.
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * udpate
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-19-1.us-west-2.compute.internal>
      d340ea3a
  2. 10 Aug, 2020 1 commit
  3. 08 Aug, 2020 1 commit
  4. 05 Aug, 2020 1 commit
    • Jinjing Zhou's avatar
      [Distributed] DistDataloader (#1901) · 4f499c7f
      Jinjing Zhou authored
      
      
      * 111
      
      * 111
      
      * fix
      
      * 111
      
      * fix
      
      * 11
      
      * fix
      
      * lint
      
      * Update __init__.py
      
      * lint
      
      * fix
      
      * lint
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * try fix
      
      * try fix
      
      * fix
      
      * Revert "fix"
      
      This reverts commit a0b954fd4e99b7df92b53db8334dcb583d6e1551.
      
      * fixes.
      
      * fix.
      
      * fix test.
      
      * fix exit.
      
      * fix.
      
      * fix
      
      * fix
      
      * lint
      
      * lint
      
      * lint
      
      * fix
      
      * Update .gitignore
      
      * 111
      
      * fix
      
      * 111
      
      * 111
      
      * fff
      
      * 1111
      
      * 111
      
      * 1325315
      
      * ffff
      
      * f???
      
      * fff
      
      * 1111
      
      * 111
      
      * fix
      
      * 111
      
      * asda
      
      * 1111
      
      * 11
      
      * 123
      
      * 啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊啊
      
      * spawn
      
      * 1231231
      
      * up
      
      * 111
      
      * fix
      
      * fix
      
      * Revert "fix"
      
      This reverts commit 7373f95312fdcaa36d2fc330bf242339e89c045d.
      
      * fix
      
      * fix
      
      * 1111
      
      * fix
      
      * fix tests
      
      * start kvclient as early as possible.
      
      * lint
      
      * fix test
      
      * lint
      
      * 1111
      
      * fix
      
      * fix
      
      * 111
      
      * fix
      
      * fix
      
      * 1
      
      * fix
      
      * fix
      
      * lint
      
      * fix
      
      * lint
      
      * lint
      
      * remove quit
      
      * fix
      
      * lint
      
      * fix
      
      * fix several
      
      * lint
      
      * fix minor
      
      * fix
      
      * lint
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      4f499c7f
  5. 03 Aug, 2020 1 commit
    • Da Zheng's avatar
      [Distributed] Support multiple servers (#1886) · a4c931a9
      Da Zheng authored
      
      
      * client init graph on the backup servers.
      
      * fix.
      
      * test multi-server.
      
      * fix anonymous dist tensors.
      
      * check #parts.
      
      * fix init_data
      
      * add multi-server multi-client tests.
      
      * update tests in kvstore.
      
      * fix.
      
      * verify the loaded partition.
      
      * fix a bug.
      
      * fix lint.
      
      * fix.
      
      * fix example.
      
      * fix rpc.
      
      * fix pull/push handler for backup kvstore
      
      * fix example readme.
      
      * change ip.
      
      * update docstring.
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-19-1.us-west-2.compute.internal>
      a4c931a9
  6. 01 Aug, 2020 1 commit
  7. 31 Jul, 2020 1 commit
  8. 27 Jul, 2020 1 commit
  9. 22 Jul, 2020 1 commit
  10. 20 Jul, 2020 1 commit
    • Chao Ma's avatar
      [RPC] Rpc exit with explicit invocation (#1825) · 5c92f6c2
      Chao Ma authored
      * exit client
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update test
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      5c92f6c2
  11. 16 Jul, 2020 1 commit
    • Chao Ma's avatar
      [Distributed] Distributed launching script (#1772) · ca9d3216
      Chao Ma authored
      
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix launch script.
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      ca9d3216
  12. 15 Jul, 2020 1 commit
  13. 14 Jul, 2020 1 commit
  14. 02 Jul, 2020 1 commit
    • Quan (Andy) Gan's avatar
      [Sampling] NodeDataLoader for node classification (#1635) · 168a88e5
      Quan (Andy) Gan authored
      
      
      * neighbor sampler data loader first commit
      
      * more commit
      
      * nodedataloader
      
      * fix
      
      * update RGCN example
      
      * update OGB
      
      * fixes
      
      * fix minibatch RGCN crashing with self loop
      
      * reverting gatconv test code
      
      * fix
      
      * change to new solution that doesn't require tf dataloader
      
      * fix
      
      * lint
      
      * fix
      
      * fixes
      
      * change doc
      
      * fix docstring
      
      * docstring fixes
      
      * return seeds and input nodes from data loader
      
      * fixes
      
      * fix test
      
      * fix windows build problem
      
      * add pytorch wrapper
      
      * fixes
      
      * add pytorch wrapper
      
      * add unit test
      
      * add -1 support to sample_neighbors & fix docstrings
      
      * docstring fix
      
      * lint
      
      * add minibatch rgcn evaluations
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      Co-authored-by: default avatarTong He <hetong007@gmail.com>
      168a88e5
  15. 01 Jul, 2020 1 commit
  16. 28 Jun, 2020 1 commit
    • Da Zheng's avatar
      [Distributed] Pytorch example of distributed GraphSage. (#1495) · 02d31974
      Da Zheng authored
      
      
      * add train_dist.
      
      * Fix sampling example.
      
      * use distributed sampler.
      
      * fix a bug in DistTensor.
      
      * fix distributed training example.
      
      * add graph partition.
      
      * add command
      
      * disable pytorch parallel.
      
      * shutdown correctly.
      
      * load diff graphs.
      
      * add ip_config.txt.
      
      * record timing for each step.
      
      * use ogb
      
      * add profiler.
      
      * fix a bug.
      
      * add train_dist.
      
      * Fix sampling example.
      
      * use distributed sampler.
      
      * fix a bug in DistTensor.
      
      * fix distributed training example.
      
      * add graph partition.
      
      * add command
      
      * disable pytorch parallel.
      
      * shutdown correctly.
      
      * load diff graphs.
      
      * add ip_config.txt.
      
      * record timing for each step.
      
      * use ogb
      
      * add profiler.
      
      * add Ips of the cluster.
      
      * fix exit.
      
      * support multiple clients.
      
      * balance node types and edges.
      
      * move code.
      
      * remove run.sh
      
      * Revert "support multiple clients."
      
      * fix.
      
      * update train_sampling.
      
      * fix.
      
      * fix
      
      * remove run.sh
      
      * update readme.
      
      * update readme.
      
      * use pytorch distributed.
      
      * ensure all trainers run the same number of steps.
      
      * Update README.md
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-16-250.us-west-2.compute.internal>
      02d31974