1. 02 Jul, 2020 1 commit
    • Quan (Andy) Gan's avatar
      [Sampling] NodeDataLoader for node classification (#1635) · 168a88e5
      Quan (Andy) Gan authored
      
      
      * neighbor sampler data loader first commit
      
      * more commit
      
      * nodedataloader
      
      * fix
      
      * update RGCN example
      
      * update OGB
      
      * fixes
      
      * fix minibatch RGCN crashing with self loop
      
      * reverting gatconv test code
      
      * fix
      
      * change to new solution that doesn't require tf dataloader
      
      * fix
      
      * lint
      
      * fix
      
      * fixes
      
      * change doc
      
      * fix docstring
      
      * docstring fixes
      
      * return seeds and input nodes from data loader
      
      * fixes
      
      * fix test
      
      * fix windows build problem
      
      * add pytorch wrapper
      
      * fixes
      
      * add pytorch wrapper
      
      * add unit test
      
      * add -1 support to sample_neighbors & fix docstrings
      
      * docstring fix
      
      * lint
      
      * add minibatch rgcn evaluations
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      Co-authored-by: default avatarTong He <hetong007@gmail.com>
      168a88e5
  2. 01 Jul, 2020 2 commits
  3. 28 Jun, 2020 2 commits
    • Da Zheng's avatar
      [Distributed] Pytorch example of distributed GraphSage. (#1495) · 02d31974
      Da Zheng authored
      
      
      * add train_dist.
      
      * Fix sampling example.
      
      * use distributed sampler.
      
      * fix a bug in DistTensor.
      
      * fix distributed training example.
      
      * add graph partition.
      
      * add command
      
      * disable pytorch parallel.
      
      * shutdown correctly.
      
      * load diff graphs.
      
      * add ip_config.txt.
      
      * record timing for each step.
      
      * use ogb
      
      * add profiler.
      
      * fix a bug.
      
      * add train_dist.
      
      * Fix sampling example.
      
      * use distributed sampler.
      
      * fix a bug in DistTensor.
      
      * fix distributed training example.
      
      * add graph partition.
      
      * add command
      
      * disable pytorch parallel.
      
      * shutdown correctly.
      
      * load diff graphs.
      
      * add ip_config.txt.
      
      * record timing for each step.
      
      * use ogb
      
      * add profiler.
      
      * add Ips of the cluster.
      
      * fix exit.
      
      * support multiple clients.
      
      * balance node types and edges.
      
      * move code.
      
      * remove run.sh
      
      * Revert "support multiple clients."
      
      * fix.
      
      * update train_sampling.
      
      * fix.
      
      * fix
      
      * remove run.sh
      
      * update readme.
      
      * update readme.
      
      * use pytorch distributed.
      
      * ensure all trainers run the same number of steps.
      
      * Update README.md
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-16-250.us-west-2.compute.internal>
      02d31974
    • Hao Xiong's avatar
      [Example] Experimental results on ogbl-citation & ogbl-ppa (#1711) · 7ee72b66
      Hao Xiong authored
      
      
      * ogb-deepwalk
      
      * update readme
      
      * update readme
      
      * update readme
      
      * update readme
      
      * ogbl-ddi
      
      * readme
      
      * citation&ppa
      Co-authored-by: default avatarxiang song(charlie.song) <classicxsong@gmail.com>
      7ee72b66
  4. 27 Jun, 2020 1 commit
  5. 25 Jun, 2020 1 commit
  6. 22 Jun, 2020 1 commit
    • Tong He's avatar
      [Model] PointNet and PointNet++ for point cloud (#1510) · 558673e1
      Tong He authored
      
      
      * commit patch
      
      * commit patch
      
      * pointnet basic
      
      * fix data
      
      * reorg
      
      * reorg
      
      * temp status
      
      * remove validate set
      
      * add partseg data and model
      
      * partseg miou
      
      * clean up
      
      * fix loss
      
      * network definition match paper
      
      * fix
      
      * fix miou
      
      * update data format
      
      * fix
      
      * fix
      
      * working pointnet ssg cls
      
      * avoid some pytorch bug
      
      * fix script
      
      * update hyperparams
      
      * add msg module
      
      * try different dataset
      
      * update new dataset info
      
      * quick fix to subgraph
      
      * fix speed
      
      * update training
      
      * update
      
      * fix bs
      
      * update docstring
      
      * update
      
      * update
      
      * remove parallel reduction in fps
      
      * switch to kernel fps, training is 30% faster
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-20-181.us-west-2.compute.internal>
      558673e1
  7. 20 Jun, 2020 1 commit
  8. 15 Jun, 2020 1 commit
  9. 10 Jun, 2020 1 commit
  10. 07 Jun, 2020 1 commit
  11. 05 Jun, 2020 1 commit
  12. 04 Jun, 2020 1 commit
  13. 03 Jun, 2020 1 commit
  14. 31 May, 2020 1 commit
  15. 27 May, 2020 1 commit
  16. 17 May, 2020 2 commits
  17. 15 May, 2020 1 commit
  18. 14 May, 2020 1 commit
  19. 12 May, 2020 2 commits
  20. 08 May, 2020 1 commit
  21. 06 May, 2020 1 commit
    • Linfang He's avatar
      [Model] GATNE-T (#1470) · 6ae440db
      Linfang He authored
      
      
      * Add an example for GATNE-T
      
      * Update README.md
      
      * Add links for datasets
      
      * Update README.md with running results
      
      * Update README.md
      
      * Add comments
      
      * Update main.py
      
      * Remove node type name `user`
      Co-authored-by: default avatarDa Zheng <zhengda1936@gmail.com>
      6ae440db
  22. 03 May, 2020 1 commit
    • Da Zheng's avatar
      [Feature] Distributed graph store (#1383) · 2190c39d
      Da Zheng authored
      * initial version from distributed training.
      
      This is copied from multiprocessing training.
      
      * modify for distributed training.
      
      * it's runnable now.
      
      * measure time in neighbor sampling.
      
      * simplify neighbor sampling.
      
      * fix a bug in distributed neighbor sampling.
      
      * allow single-machine training.
      
      * fix a bug.
      
      * fix a bug.
      
      * fix openmp.
      
      * make some improvement.
      
      * fix.
      
      * add prepare in the sampler.
      
      * prepare nodeflow async.
      
      * fix a bug.
      
      * get id.
      
      * simplify the code.
      
      * improve.
      
      * fix partition.py
      
      * fix the example.
      
      * add more features.
      
      * fix the example.
      
      * allow one partition
      
      * use distributed kvstore.
      
      * do g2l map manually.
      
      * fix commandline.
      
      * a temp script to save reddit.
      
      * fix pull_handler.
      
      * add pytorch version.
      
      * estimate the time for copying data.
      
      * delete unused code.
      
      * fix a bug.
      
      * print id.
      
      * fix a bug
      
      * fix a bug
      
      * fix a bug.
      
      * remove redundent code.
      
      * revert modify in sampler.
      
      * fix temp script.
      
      * remove pytorch version.
      
      * fix.
      
      * distributed training with pytorch.
      
      * add distributed graph store.
      
      * fix.
      
      * add metis_partition_assignment.
      
      * fix a few bugs in distributed graph store.
      
      * fix test.
      
      * fix bugs in distributed graph store.
      
      * fix tests.
      
      * remove code of defining DistGraphStore.
      
      * fix partition.
      
      * fix example.
      
      * update run.sh.
      
      * only read necessary node data.
      
      * batching data fetch of multiple NodeFlows.
      
      * simplify gcn.
      
      * remove unnecessary code.
      
      * use the new copy_from_kvstore.
      
      * update training script.
      
      * print time in graphsage.
      
      * make distributed training runnable.
      
      * use val_nid.
      
      * fix train_sampling.
      
      * add distributed training.
      
      * add run.sh
      
      * add more timing.
      
      * fix a bug.
      
      * save graph metadata when partition.
      
      * create ndata and edata in distributed graph store.
      
      * add timing in minibatch training of GraphSage.
      
      * use pytorch distributed.
      
      * add checks.
      
      * fix a bug in global vs. local ids.
      
      * remove fast pull
      
      * fix a compile error.
      
      * update and add new APIs.
      
      * implement more methods in DistGraphStore.
      
      * update more APIs.
      
      * rename it to DistGraph.
      
      * rename to DistTensor
      
      * remove some unnecessary API.
      
      * remove unnecessary files.
      
      * revert changes in sampler.
      
      * Revert "simplify gcn."
      
      This reverts commit 0ed3a34ca714203a5b45240af71555d4227ce452.
      
      * Revert "simplify neighbor sampling."
      
      This reverts commit 551c72d20f05a029360ba97f312c7a7a578aacec.
      
      * Revert "measure time in neighbor sampling."
      
      This reverts commit 63ae80c7b402bb626e24acbbc8fdfe9fffd0bc64.
      
      * Revert "add timing in minibatch training of GraphSage."
      
      This reverts commit e59dc8957a414c7df5c316f51d78bce822bdef5e.
      
      * Revert "fix train_sampling."
      
      This reverts commit ea6aea9a4aabb8ba0ff63070aa51e7ca81536ad9.
      
      * fix lint.
      
      * add comments and small update.
      
      * add more comments.
      
      * add more unit tests and fix bugs.
      
      * check the existence of shared-mem graph index.
      
      * use new partitioned graph storage.
      
      * fix bugs.
      
      * print error in fast pull.
      
      * fix lint
      
      * fix a compile error.
      
      * save absolute path after partitioning.
      
      * small fixes in the example
      
      * Revert "[kvstore] support any data type for init_data() (#1465)"
      
      This reverts commit 87b6997b
      
      .
      
      * fix a bug.
      
      * disable evaluation.
      
      * Revert "Revert "[kvstore] support any data type for init_data() (#1465)""
      
      This reverts commit f5b8039c6326eb73bad8287db3d30d93175e5bee.
      
      * support set and init data.
      
      * support set and init data.
      
      * Revert "Revert "[kvstore] support any data type for init_data() (#1465)""
      
      This reverts commit f5b8039c6326eb73bad8287db3d30d93175e5bee.
      
      * fix bugs.
      
      * fix unit test.
      
      * move to dgl.distributed.
      
      * fix lint.
      
      * fix lint.
      
      * remove local_nids.
      
      * fix lint.
      
      * fix test.
      
      * remove train_dist.
      
      * revert train_sampling.
      
      * rename funcs.
      
      * address comments.
      
      * address comments.
      
      Use NodeDataView/EdgeDataView to keep track of data.
      
      * address comments.
      
      * address comments.
      
      * revert.
      
      * save data with DGL serializer.
      
      * use the right way of getting shape.
      
      * fix lint.
      
      * address comments.
      
      * address comments.
      
      * fix an error in mxnet.
      
      * address comments.
      
      * add edge_map.
      
      * add more test and fix bugs.
      Co-authored-by: default avatarZheng <dzzhen@186590dc80ff.ant.amazon.com>
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-6-131.us-east-2.compute.internal>
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-26-167.us-east-2.compute.internal>
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-16-150.us-west-2.compute.internal>
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-16-250.us-west-2.compute.internal>
      Co-authored-by: default avatarUbuntu <ubuntu@ip-172-31-30-135.us-west-2.compute.internal>
      2190c39d
  23. 28 Apr, 2020 2 commits
  24. 26 Apr, 2020 1 commit
  25. 24 Apr, 2020 1 commit
  26. 13 Apr, 2020 1 commit
  27. 09 Apr, 2020 1 commit
  28. 27 Mar, 2020 1 commit
    • Minjie Wang's avatar
      [NN] Add HeteroGraphConv module for cleaner module definition (#1385) · 3efb5d8e
      Minjie Wang authored
      * Add HeteroGraphConv
      
      * add custom aggregator; some docstring
      
      * debugging
      
      * rm print
      
      * fix some acc bugs
      
      * fix initialization problem in weight basis
      
      * passed tests
      
      * lint
      
      * fix graphconv flag; add error message
      
      * add mxnet heteroconv
      
      * more fix for mx
      
      * lint
      
      * fix torch cuda test
      
      * fix mx test_nn
      
      * add exhaust test for graphconv
      
      * add tf heteroconv
      
      * fix comment
      3efb5d8e
  29. 20 Mar, 2020 1 commit
  30. 18 Mar, 2020 1 commit
  31. 16 Mar, 2020 1 commit
    • Minjie Wang's avatar
      [Model] RGCN mini-batch training; Better bipartite graph support (#1337) · 6111ea46
      Minjie Wang authored
      * change the model to use node embedding only
      
      * minibatch training
      
      * add readme
      
      * small fix
      
      * backward-compatible
      
      * backward compatible
      
      * modify to_block; rgcn changed
      
      * fix
      
      * fix transform
      
      * fix bug in unittest script
      
      * docstring
      
      * fix lint
      
      * add tests
      
      * address comments; fix offline eval
      
      * gitignore
      6111ea46
  32. 15 Mar, 2020 1 commit
  33. 13 Mar, 2020 1 commit
  34. 10 Mar, 2020 1 commit
  35. 07 Mar, 2020 1 commit
    • Quan (Andy) Gan's avatar
      [Model][Sampler] GraphSAGE model, bipartite graph conversion & remove edges API (#1297) · a9520f71
      Quan (Andy) Gan authored
      * remove edge and to bipartite and graphsage with sampling
      
      * fixes
      
      * fixes
      
      * fixes
      
      * reenable multigpu training
      
      * fixes
      
      * compatibility in DGLGraph
      
      * rename to compact_as_bipartite
      
      * bugfix
      
      * lint
      
      * add offline inference
      
      * skip GPU tests
      
      * fix
      
      * addresses comments
      
      * fix
      
      * fix
      
      * fix
      
      * more tests
      
      * more docs and unit tests
      
      * workaround for empty slice on empty data
      a9520f71