- 10 Dec, 2021 1 commit
-
-
guoshzhao authored
**Description** Add ONNXRuntime inference benchmark based on ORT python API. **Major Revision** - Add `ORTInferenceBenchmark` class to export pytorch model to onnx model and do inference - Add tests and example for `ort-inference` benchmark - Update the introduction docs.
-
- 30 Oct, 2021 1 commit
-
-
Ziyue Yang authored
**Description** This commit does the following: 1) Adds CPU-initiated copy benchmark; 2) Adds dtod benchmark; 3) Support scanning NUMA nodes and GPUs inside the benchmark program; 4) Change the name of gpu-sm-copy to gpu-copy.
-
- 29 Oct, 2021 1 commit
-
-
Ziyue Yang authored
**Description** This commit fixes the URL of ROCm GPG file.
-
- 21 Oct, 2021 1 commit
-
-
Yuting Jiang authored
**Description** Add gpcnet as git submodule and building logic. **Major Revision** - add gpcnet as a submodule - add build logic in third_party/Makefile
-
- 02 Sep, 2021 1 commit
-
-
Yifan Xiong authored
__Description__ Resolve "too many open files" issue when runnning NCCL/RCCL on multiple nodes using Docker images, increase nofile number in limits.conf.
-
- 01 Sep, 2021 2 commits
- 31 Aug, 2021 1 commit
-
-
guoshzhao authored
**Description** Add dockerfile `rocm4.0-pytorch1.7.0.dockerfile` and `rocm4.2-pytorch1.7.0.dockerfile` for `rocm` platform.
-