Commits · c5aa4f4e382eeb1373fc715475911385e2241596 · tsoc / superbenchmark

22 Mar, 2022 1 commit
- Bug: Benchmarks - remove fp16 samples type converting time (#332) · c5aa4f4e
  user4543 authored Mar 22, 2022
```
**Description**
Remove fp16 samples type converting time for training cnn and lstm inference.
```
  c5aa4f4e
17 Mar, 2022 1 commit
- Bug: Benchmarks - remove fp16 samples type converting time for cnn and lstm models (#330) · 6e749180
  user4543 authored Mar 17, 2022
```
**Description**
Remove fp16  samples type converting time for cnn and lstm models.
```
  6e749180
06 Mar, 2022 1 commit

Benchmarks - Keep BatchNorm as fp32 for pytorch cnn models cast to fp16 (#322) · a9ef0f99

Jeff Daily authored Mar 06, 2022

**Description**
The BatchNorm operator is not numerically stable in fp16.  PyTorch documentation recommends to keep the BN op in fp32 for fp16 AMP models.  Refer to https://pytorch.org/docs/stable/amp.html#ops-that-can-autocast-to-float32.  Preserving BN in fp32 for superbench more accurately reflects real workloads.

a9ef0f99

10 Feb, 2022 1 commit

Benchmarks: Revise Code - Add support for pytorch>=1.9.0 of init_process_group (#305) · e31b8c9e

user4543 authored Feb 10, 2022

**Description**
Add support for pytorch>=1.9.0 of init_process_group.

**Major Revision**
- Use PrefixStore(TCPStore) to init_process_group manully for each model run

e31b8c9e

28 Jan, 2022 1 commit

Benchmarks: Add Feature - Sync the E2E training results among all workers for each step. (#287) · d03d110f

guoshzhao authored Jan 28, 2022

**Description**
Please write a brief description and link the related issue if have.

**Major Revision**
- Sync (do allreduce max) the E2E training results among all workers.
- Avoid using ':0' in metric name if there has only one rank having output.

d03d110f

19 Jan, 2022 1 commit
- Benchmarks: Add Feature - Add percentile metrics for ort and pytorch inference benchmarks (#283) · fd2bc9e0
  guoshzhao authored Jan 19, 2022
```
**Description**
Add 50th, 90th, 95th, 99th, 99.9th latency metrics for ORT and pytorch inference benchmarks.
```
  fd2bc9e0
13 Dec, 2021 1 commit
- Benchmarks - Add transformers for TensorRT inference (#254) · cb8a3cfb
  Yifan Xiong authored Dec 13, 2021
```
Add transformers for TensorRT inference.
```
  cb8a3cfb
09 Dec, 2021 1 commit
- Benchmarks: Unify metric names of benchmarks (#252) · 9f56b219
  Yuting Jiang authored Dec 09, 2021
```
**Description**
Unify metric names of benchmarks.
```
  9f56b219
28 Sep, 2021 1 commit
- Benchmarks: Fix bug - Fix bug when set force_fp32 option. (#214) · 1a86583b
  guoshzhao authored Sep 28, 2021
```
**Description**
Fix typo when set force_fp32 option.
```
  1a86583b
27 Sep, 2021 1 commit
- Benchmarks: Add Feature - Add option to use fp32 instead of tf32 (#213) · f9442456
  guoshzhao authored Sep 28, 2021
```
**Description**
Add option `force_fp32` to use fp32 instead of tf32, only takes effect on Ampere or newer GPUs.
```
  f9442456
26 Sep, 2021 1 commit

Release - SuperBench v0.3.0 (#212) · dfbd70b1

Yifan Xiong authored Sep 26, 2021



**Description**

Cherry-pick  bug fixes from v0.3.0 to main.

**Major Revisions**
* Docs - Upgrade version and release note (#209)
* Benchmarks: Build Pipeline - Update rccl-test git submodule to dc1ad48 (#210)
* Benchmarks: Update - Update benchmarks in configuration file (#208)
* CI/CD - Update GitHub Action VM (#211)
* Benchmarks: Fix Bug - Fix wrong parameters for gpu-sm-copy-bw in configuration examples (#203)
* CI/CD - Fix bug in build image for push event (#205)
* Benchmark: Fix Bug - fix error message of communication-computation-overlap (#204)
* Tool: Fix bug - Fix function naming issue in system info  (#200)
* CI/CD - Push images in GitHub Action (#202)
* Bug - Fix torch.distributed command for single node (#201)
* CLI - Integrate system info for node (#199)
* Benchmarks: Code Revision - Revise CMake files for microbenchmarks. (#196)
* CI/CD - Add ROCm image build in GitHub Actions (#194)
* Bug: Fix bug - fix bug of hipBusBandwidth build (#193)
* Benchmarks: Build Pipeline - Restore rocblas build logic (#197)
* Bug: Fix Bug - Add barrier before 'destroy_process_group' in model benchmarks (#198)
* Bug - Revise 'docker run' in sb deploy (#195)
* Bug - Fix Bug : fix bug of error param operations to operation in rccl-bw of hpe config (#190)
Co-authored-by: Yuting Jiang <v-yujiang@microsoft.com>
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
Co-authored-by: Ziyue Yang <ziyyang@microsoft.com>

dfbd70b1

06 Aug, 2021 2 commits
- Benchmarks: Add Feature - Set reduce type for current benchmarks' metrics. (#149) · acf365a8
  guoshzhao authored Aug 06, 2021
```
**Description**
Set reduce type for current benchmarks' metrics, including model benchmarks and ShardingMatmul.
```
  acf365a8
- Benchmarks: Code Revision - Calculate average value by using statistics module. (#148) · bc1a61b9
  guoshzhao authored Aug 06, 2021
```
**Description**
Replace `sum(results) / len(results)` with `statistics.mean(results)`
```
  bc1a61b9
29 Jul, 2021 1 commit

Release - SuperBench v0.2.1 (#142) · 69b2c631

Yifan Xiong authored Jul 29, 2021

__Description__
Cherry-pick bug fixes from v0.2.1 to main.

__Major Revisions__
* Fix bug of VGG models failed on A100 GPU with batch_size=128.
* Fix Ansible connection issue when running in localhost.
* Update version in packages and docs.

69b2c631

28 Jun, 2021 2 commits
- Benchmarks: Add Configuration - Add validation config file for azure NDv4. (#103) · f22bb3f2
  guoshzhao authored Jun 28, 2021
```
* add config file for ndv4.
```
  f22bb3f2
- Benchmarks: Code Revision - Replace torch.optim.AdamW with transformers.AdamW. (#106) · 9c748527
  guoshzhao authored Jun 28, 2021
```
* replace torch.optim.AdamW with transformers.AdamW.
```
  9c748527
21 Jun, 2021 1 commit
- Benchmarks: Add Feature - Add DistributedImpl and DistributedBackend arguments... · 216c5b5c
  guoshzhao authored Jun 21, 2021
```
Benchmarks: Add Feature - Add DistributedImpl and DistributedBackend arguments for micro benchmark. (#100)
```
  216c5b5c
16 Jun, 2021 1 commit

Bug bash - Fix bugs and refine log in single GPU benchmarks (#97) · ddbc51a1

Yifan Xiong authored Jun 16, 2021

Fix bugs and refine log in single GPU benchmarks:

* Fix none framework issue
* Fix empty parameter bug
* Remove missed mobilenet_v3 models
* Change benchmark registration log to debug level
* Add pid in logging
* Add missing benchmarks in default config
* Fix deprecated logging warn

ddbc51a1

07 Jun, 2021 1 commit
- Benchmarks: Fix Bug - Fix OOM issue when run pytorch models sequentially. (#93) · 03b41be1
  guoshzhao authored Jun 07, 2021
```
* Clean up the cache.
```
  03b41be1
04 Jun, 2021 1 commit
- Benchmarks: Fix Bug - Fix return code overwrite issue (#94) · 2d9be807
  guoshzhao authored Jun 04, 2021
```
* fix return code reset issue
```
  2d9be807
19 May, 2021 1 commit
- expose interface of pin memory and modify cnn configuration (#75) · b7d0ee32
  Yuting Jiang authored May 19, 2021
  
  b7d0ee32
26 Apr, 2021 2 commits
- Benchmarks: Fix Bug - Increase default sample count for benchmarking. (#64) · a7184da3
  guoshzhao authored Apr 26, 2021
  
  a7184da3
- Benchmarks: Fix Bug - Fix dataset precision for CNN and LSTM benchmarks. · 0324117f
  guoshzhao authored Apr 26, 2021
  
  0324117f
20 Apr, 2021 2 commits
- Benchmarks: Add Benchmark - Add LSTM model benchmarks. (#60) · 2a7ab691
  guoshzhao authored Apr 20, 2021
```
* Benchmarks: Add Benchmark - Add LSTM model benchmarks.
```
  2a7ab691
- Benchmarks: Add Benchmark - Add CNN model benchmarks. (#59) · 902ea211
  guoshzhao authored Apr 20, 2021
```
* Benchmarks: Add Benchmark - Add CNN model benchmarks.
```
  902ea211
16 Apr, 2021 2 commits
- Benchmarks: Code Revision - Fix some issue for BERT benchmark. (#58) · ce3ed24a
  guoshzhao authored Apr 16, 2021
```
Benchmarks: Code Revision - Fix some issue for BERT benchmark. (#58)
```
  ce3ed24a
- Benchmarks: Add Benchmark - Add GPT2 model benchmark. (#57) · af567cf6
  guoshzhao authored Apr 16, 2021
```
* Benchmarks: Add Benchmark - Add GPT2 model benchmark.
```
  af567cf6
12 Apr, 2021 1 commit
- add _post_process() implementation in pytorch_base.py to clean up distributed resource. (#45) · 1f726091
  guoshzhao authored Apr 12, 2021
```
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
```
  1f726091
08 Apr, 2021 1 commit
- Benchmarks: Code Revision - Revise result process interface and add result checking (#32) · 2871a68b
  guoshzhao authored Apr 08, 2021
```
* revise result process interface

* add more comments
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
```
  2871a68b
26 Mar, 2021 1 commit

Benchmarks: Add Benchmark - Add Pytorch BERT benchmarks, including bert-base... · 0972b223

guoshzhao authored Mar 26, 2021


Benchmarks: Add Benchmark - Add Pytorch BERT benchmarks, including bert-base and bert-large.   (#20)

* add pytorch bert benchmarks.

* revise code

* fix issue

* revise code.
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>

0972b223

22 Mar, 2021 2 commits

Benchmarks: Code Revision - Move benchmarks auto-registration from registry.py to __init__.py (#24) · 8d24d03d
guoshzhao authored Mar 22, 2021
```
* move benchmarks registration from registry.py to __init__.py

* revise __init__.
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
```
8d24d03d

Benchmarks: Add Feature - Add benchmark finish check according to... · 5dfcc6be

guoshzhao authored Mar 22, 2021


Benchmarks: Add Feature - Add benchmark finish check according to num_warmup/num_steps and duration in ModelBenchmark class. (#25)

* add is_finished function

* reuse current time.
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>

5dfcc6be

18 Mar, 2021 1 commit

Benchmarks: Add Feature - Add sample_count argument for ModelBenchmark. (#22) · c00dc670

guoshzhao authored Mar 18, 2021



* add sample_count argument.

* handle more condidatins.

* address comments.
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>

c00dc670

15 Mar, 2021 1 commit
- add more checks for PytorchBase module (#19) · 80f434cb
  guoshzhao authored Mar 15, 2021
```
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
```
  80f434cb
11 Mar, 2021 1 commit

Benchmarks: Add Feature - Add random dataset for Pytorch. (#17) · ebea2d50

guoshzhao authored Mar 12, 2021



* add random dataset.

* install pytorch-cpu for test docker.

* fix typo

* add more test cases.

* address comments.
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>

ebea2d50

09 Mar, 2021 2 commits
- Benchmarks: Add Feature - Add flag to disable GPU. (#15) · 52848d2f
  guoshzhao authored Mar 10, 2021
```
* add flag to disable GPU.

* fix spelling

* fix unittest.

* address comments.
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
```
  52848d2f
- rename _cal_params_size as _cal_params_count. (#16) · 83a4e93f
  guoshzhao authored Mar 09, 2021
```
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
```
  83a4e93f
08 Mar, 2021 2 commits

Benchmarks: Add Feature - Add pytorch base class (#11) · 088aa19a

guoshzhao authored Mar 08, 2021



* add pytorch base class

* address comments
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>

088aa19a

Benchmarks: Add Feature - Add optimizer definition in Model Base (#13) · 52b52c2c

guoshzhao authored Mar 08, 2021



* add optimizer definition and function to create torch optimizer.

* move optimizer enum into model_base module.
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>

52b52c2c

04 Mar, 2021 1 commit
- add more checks for model base (#12) · 9388f8f5
  guoshzhao authored Mar 04, 2021
```
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
```
  9388f8f5