- 29 Jul, 2021 3 commits
-
-
Yuting Jiang authored
**Description** Support rocm in third_party/makefile and add rccl-tests as a submodule with building logic. **Major Revision** - Support rocm in third_party/makefile - Add rccl-tests as a submodule - Add build logic in third_party/Makefile for rccl-tests
-
Yifan Xiong authored
__Description__ Cherry-pick bug fixes from v0.2.1 to main. __Major Revisions__ * Fix bug of VGG models failed on A100 GPU with batch_size=128. * Fix Ansible connection issue when running in localhost. * Update version in packages and docs.
-
Yuting Jiang authored
**Description** Support rocm in third_party/makefile. **Major Revision** - Split rocm and cuda target in makefile - Add target in dockerfile
-
- 27 Jul, 2021 2 commits
-
-
Yuting Jiang authored
**Description** Add the source code of rocm kernel launch overhead benchmark. **Major Revision** - Revise cmake build logic to support both cuda and rocm
-
Yuting Jiang authored
**Description** Support rocm cmake build. **Major Revision** - Add some envs in rocm_common.cmake to support rocm cmake build.
-
- 26 Jul, 2021 1 commit
-
-
Yuting Jiang authored
**Description** Add NCCL performance microbenchmark. **Major Revision** - Add microbenchmark, example, test, config for NCCL
-
- 23 Jul, 2021 2 commits
-
-
Yuting Jiang authored
**Description** Add RDMA Loopback performance microbenchmark. **Major Revision** - Add microbenchmark, example, test, config for RDMA Loopback
-
Ziyue Yang authored
**Description** Add disk performance microbenchmark. **Major Revision** - Add microbenchmark, example, test, config for disk performance. **Minor Revision** - Fix bugs in executor unit test related to default enabled tests.
-
- 21 Jul, 2021 1 commit
-
-
TobeyQin authored
**Description** Add result contributing rules
-
- 20 Jul, 2021 2 commits
-
-
TobeyQin authored
**Description** Add release process document.
-
Ziyue Yang authored
**Description** Fix bug of tests/executor/test_executor.py. **Major Revision** - Test default enabled benchmarks only instead of all benchmarks.
-
- 19 Jul, 2021 1 commit
-
-
Ziyue Yang authored
**Description** Add FIO benchmark tool into third-party dependency. **Major Revision** - Add FIO submodule into third-party directory and modify Makefile to enable it.
-
- 16 Jul, 2021 2 commits
-
-
Yuting Jiang authored
Add perftest as a submodule and add build logic
-
Yuting Jiang authored
Benchmarks: Build Pipeline - Add nccl-tests as a submodule and add build logic.
-
- 15 Jul, 2021 1 commit
-
-
Yuting Jiang authored
Benchmarks: Fix bug - fix bug of third_party/cuda-samples git checkout issue when building docker (#126) * fix bug in docker build of third_party/cuda-samples
-
- 13 Jul, 2021 2 commits
-
-
Yuting Jiang authored
Add microbenchmark, example, test, config for cuda memory performance and Add cuda-samples(tag with cuda version) as git submodule and update related makefile
-
Yuting Jiang authored
Update network common utils. Add get_ib_devices in network common utils and move get_free_port from test utils to network common utils
-
- 09 Jul, 2021 2 commits
-
-
guoshzhao authored
* Bug Fix - Fix race condition issue for multi ranks (#117) Fix race condition issue when multi ranks rotating the same directory. * Update pipeline for release branch (#122) * Bug Fix - Fix bug when convert bool config to store_true argument. (#120) Co-authored-by:Yifan Xiong <yifan.xiong@microsoft.com>
-
guoshzhao authored
* add validation config file for AMD MI100
-
- 08 Jul, 2021 1 commit
-
-
Yifan Xiong authored
Support both NVIDIA and AMD GPU and check GPU vendor during deployment and execution. * Add GPU environment check in sb deploy. * Check GPU vendor in executor.
-
- 02 Jul, 2021 2 commits
-
-
Yifan Xiong authored
Update README and version for v0.2 release.
-
Yifan Xiong authored
Fetch benchmarks results on all nodes, will rsync after each benchmark. The results directory structure on control node is as follows: ``` outputs/ └── datetime ├── nodes │ └── node-0 │ ├── benchmarks │ │ ├── benchmark-0 │ │ │ ├── rank-0 │ │ │ │ └── results.json │ └── sb-exec.log ├── sb-run.log └── sb.config.yaml ```
-
- 01 Jul, 2021 2 commits
-
-
Yifan Xiong authored
Support `--host-list` for deploy and run commands. Before this change, an inventory file is needed to use `sb deploy/run`. Now, `--host-list localhost` or `-l localhost` is sufficient for quick try.
-
Yifan Xiong authored
* Support custom output directory. * Update document.
-
- 30 Jun, 2021 2 commits
- 29 Jun, 2021 1 commit
-
-
guoshzhao authored
* fix bug for nvidia v100 * hard code the supported dict for different arch.
-
- 28 Jun, 2021 3 commits
- 25 Jun, 2021 2 commits
-
-
Yifan Xiong authored
* Initialize SuperBench website. * Add GitHub Actions for automatically build and publish.
-
Yifan Xiong authored
Update SuperBench documents.
-
- 23 Jun, 2021 1 commit
-
-
Yifan Xiong authored
* Add `sb deploy` command content. * Fix inline if-expression syntax in playbook. * Fix quote escape issue in bash command. * Add custom env in config. * Update default config for multi GPU benchmarks. * Update MANIFEST.in to include jinja2 template. * Require jinja2 minimum version. * Fix occasional duplicate output in Ansible runner. * Fix mixed color from Ansible and Python colorlog. * Update according to comments. * Change superbench.env from list to dict in config file.
-
- 21 Jun, 2021 1 commit
-
-
guoshzhao authored
Benchmarks: Add Feature - Add DistributedImpl and DistributedBackend arguments for micro benchmark. (#100)
-
- 20 Jun, 2021 1 commit
-
-
Yuting Jiang authored
rename bin name and result metric of cublas and cudnn microbenchmark
-
- 16 Jun, 2021 2 commits
-
-
Yifan Xiong authored
Update packages and add build cache for CUDA 11.1.1 Dockerfile: * Remove duplicate cmake and ompi, which are already in base image * Add hpcx and sharp lib * Add cache for gitmodules build * Sort apt-get packages
-
Yifan Xiong authored
Fix bugs and refine log in single GPU benchmarks: * Fix none framework issue * Fix empty parameter bug * Remove missed mobilenet_v3 models * Change benchmark registration log to debug level * Add pid in logging * Add missing benchmarks in default config * Fix deprecated logging warn
-
- 07 Jun, 2021 1 commit
-
-
guoshzhao authored
* Clean up the cache.
-
- 04 Jun, 2021 1 commit
-
-
guoshzhao authored
* fix return code reset issue
-
- 02 Jun, 2021 1 commit
-
-
Yifan Xiong authored
* Support local mode in runner.
-