Commits · 34cd2e8ca1ff7dde08373b8c66afa861fdf6fddf · tsoc / superbenchmark

26 Aug, 2021 1 commit

Benchmarks: Code Revision - Rename computation_communication_overlap microbenchmark metric (#167) · 34cd2e8c

Yuting Jiang authored Aug 26, 2021

**Description**
Rename computation_communication_overlap microbenchmark metric .

**Major Revision**
- remove rank info in metric.
- simplify and rename metric.

34cd2e8c

25 Aug, 2021 1 commit

Benchmarks: Code Revision - Extract base class for memory bandwidth microbenchmark (#159) · e5e84a2e

Yuting Jiang authored Aug 26, 2021

**Description**
extract base class for memory bandwidth microbenchmark.

**Major Revision**
- revise and optimize cuda_memory_bandwidth_performance
- extract base class for memory bandwidth microbenchmark
- add test for base class

e5e84a2e

22 Aug, 2021 1 commit
- Benchmarks: Revise Benchmark - Add readwrite I/O pattern (#161) · 6774d7b7
  Ziyue Yang authored Aug 22, 2021
```
**Description**
This commit adds readwrite I/O pattern for FIO benchmark. Read/write ratio is fixed at 4:1.
```
  6774d7b7
20 Aug, 2021 1 commit

Runner: Add Feature - Generate summarized output files. (#157) · 7595d794

guoshzhao authored Aug 20, 2021

**Description**
Generate the summarized output files from all nodes. For each metric, do the reduce operation according to the `reduce_op`

**Major Revision**
- Generate the summarized json file per node:
For microbenchmark, the format is `{benchmark_name}/[{run_count}/]{metric_name}[:rank]`
For modelbenchmark, the format is `{benchmark_name}/{sub_benchmark_name}/[{run_count}/]{metric_name}`
`[]` means optional.
```
{
  "kernel-launch/overhead_event:0": 0.00583,
  "kernel-launch/overhead_event:1": 0.00545,
  "kernel-launch/overhead_event:2": 0.00581,
  "kernel-launch/overhead_event:3": 0.00572,
  "kernel-launch/overhead_event:4": 0.00559,
  "kernel-launch/overhead_event:5": 0.00591,
  "kernel-launch/overhead_event:6": 0.00562,
  "kernel-launch/overhead_event:7": 0.00586,
  "resnet_models/pytorch-resnet50/steptime-train-float32": 544.0827468410134,
  "resnet_models/pytorch-resnet50/throughput-train-float32": 353.7607016465773,
  "resnet_models/pytorch-resnet50/steptime-train-float16": 425.40482617914677,
  "resnet_models/pytorch-resnet50/throughput-train-float16": 454.0142363793973,
  "pytorch-sharding-matmul/0/allreduce": 10.561786651611328,
  "pytorch-sharding-matmul/1/allreduce": 10.561786651611328,
  "pytorch-sharding-matmul/0/allgather": 10.088025093078613,
  "pytorch-sharding-matmul/1/allgather": 10.088025093078613
}
```
- Generate the summarized jsonl file for all nodes, each line is the result from one node in json format.

7595d794

16 Aug, 2021 1 commit
- Benchmarks: Code Revision - change 'reduce' to 'reduce_op' (#156) · 7293e783
  guoshzhao authored Aug 16, 2021
```
**Description**
Change the field name `reduce` to `reduce_op`.
```
  7293e783
06 Aug, 2021 2 commits
- Benchmarks: Add Feature - Set reduce type for current benchmarks' metrics. (#149) · acf365a8
  guoshzhao authored Aug 06, 2021
```
**Description**
Set reduce type for current benchmarks' metrics, including model benchmarks and ShardingMatmul.
```
  acf365a8
- Benchmarks: Code Revision - Calculate average value by using statistics module. (#148) · bc1a61b9
  guoshzhao authored Aug 06, 2021
```
**Description**
Replace `sum(results) / len(results)` with `statistics.mean(results)`
```
  bc1a61b9
05 Aug, 2021 1 commit

Benchmarks: Add Feature - Add reduce function support for output summary. (#147) · e41b1f62

guoshzhao authored Aug 05, 2021

**Description**
Add reduce function support for output summary.

**Major Revision**
- Add reducer class to maintain all reduce functions.
- Save reduce type of each metric into `BenchmarkResult`
- Fix UT.

e41b1f62

30 Jul, 2021 1 commit
- Benchmarks: Add Benchmark - Revise and add rccl microbenchmark for rocm (#143) · 157b4e2d
  Yuting Jiang authored Jul 30, 2021
```
**Description**
Add rccl bandwidth microbenchmark for rocm.

**Major Revision**
- Register rccl-bw benchmark.
```
  157b4e2d
29 Jul, 2021 1 commit

Release - SuperBench v0.2.1 (#142) · 69b2c631

Yifan Xiong authored Jul 29, 2021

__Description__
Cherry-pick bug fixes from v0.2.1 to main.

__Major Revisions__
* Fix bug of VGG models failed on A100 GPU with batch_size=128.
* Fix Ansible connection issue when running in localhost.
* Update version in packages and docs.

69b2c631

27 Jul, 2021 2 commits

Benchmarks: Add Benchmark - Add the source code of rocm kernel launch overhead benchmark. (#136) · 1ee8f7dc

Yuting Jiang authored Jul 27, 2021

**Description**
Add the source code of rocm kernel launch overhead benchmark. 

**Major Revision**
- Revise cmake build logic to support both cuda and rocm

1ee8f7dc

Benchmarks: Build Pipeline - Support rocm cmake build (#137) · fdc33f40

Yuting Jiang authored Jul 27, 2021

**Description**
Support rocm cmake build. 

**Major Revision**
- Add  some envs in rocm_common.cmake to support rocm cmake build.

fdc33f40

26 Jul, 2021 1 commit

Benchmarks: Add Benchmark - Add NCCL performance benchmark (#113) · e083a598

Yuting Jiang authored Jul 26, 2021

**Description**
Add NCCL performance microbenchmark.

**Major Revision**
- Add microbenchmark, example, test, config for NCCL

e083a598

23 Jul, 2021 2 commits

Benchmarks: Add Benchmark - Add IB Loopback performance benchmark. (#112) · b0c5addc

Yuting Jiang authored Jul 24, 2021

**Description**
Add RDMA Loopback performance microbenchmark.

**Major Revision**
- Add microbenchmark, example, test, config for RDMA Loopback

b0c5addc

Benchmarks: Add Benchmark - Add disk performance benchmark (#132) · db297fb4

Ziyue Yang authored Jul 23, 2021

**Description**
Add disk performance microbenchmark.

**Major Revision**
- Add microbenchmark, example, test, config for disk performance.

**Minor Revision**
- Fix bugs in executor unit test related to default enabled tests.

db297fb4

13 Jul, 2021 1 commit

Benchmarks: Add Benchmark - Add memory bandwidth benchmark for cuda. (#114) · f9550bd6

Yuting Jiang authored Jul 13, 2021

Add microbenchmark, example, test, config for cuda memory performance and Add cuda-samples(tag with cuda version) as git submodule and update related makefile

f9550bd6

30 Jun, 2021 1 commit
- Benchmarks: Fix Bug - Fix typo in gemm-flops benchmark. (#109) · 1e96c27e
  guoshzhao authored Jun 30, 2021
  
  1e96c27e
29 Jun, 2021 1 commit
- Benchmarks: Fix Bug - Fix gemm kernel bug for nvidia v100. (#105) · 8ffaddfa
  guoshzhao authored Jun 29, 2021
```
* fix bug for nvidia v100
* hard code the supported dict for different arch.
```
  8ffaddfa
28 Jun, 2021 2 commits
- Benchmarks: Add Configuration - Add validation config file for azure NDv4. (#103) · f22bb3f2
  guoshzhao authored Jun 28, 2021
```
* add config file for ndv4.
```
  f22bb3f2
- Benchmarks: Code Revision - Replace torch.optim.AdamW with transformers.AdamW. (#106) · 9c748527
  guoshzhao authored Jun 28, 2021
```
* replace torch.optim.AdamW with transformers.AdamW.
```
  9c748527
21 Jun, 2021 1 commit
- Benchmarks: Add Feature - Add DistributedImpl and DistributedBackend arguments... · 216c5b5c
  guoshzhao authored Jun 21, 2021
```
Benchmarks: Add Feature - Add DistributedImpl and DistributedBackend arguments for micro benchmark. (#100)
```
  216c5b5c
20 Jun, 2021 1 commit
- Bug bash - Rename bin name and metric name of cublas and cudnn microbenchmark (#99) · 3d72c078
  Yuting Jiang authored Jun 20, 2021
```
rename bin name and result metric of cublas and cudnn microbenchmark
```
  3d72c078
16 Jun, 2021 1 commit

Bug bash - Fix bugs and refine log in single GPU benchmarks (#97) · ddbc51a1

Yifan Xiong authored Jun 16, 2021

Fix bugs and refine log in single GPU benchmarks:

* Fix none framework issue
* Fix empty parameter bug
* Remove missed mobilenet_v3 models
* Change benchmark registration log to debug level
* Add pid in logging
* Add missing benchmarks in default config
* Fix deprecated logging warn

ddbc51a1

07 Jun, 2021 1 commit
- Benchmarks: Fix Bug - Fix OOM issue when run pytorch models sequentially. (#93) · 03b41be1
  guoshzhao authored Jun 07, 2021
```
* Clean up the cache.
```
  03b41be1
04 Jun, 2021 1 commit
- Benchmarks: Fix Bug - Fix return code overwrite issue (#94) · 2d9be807
  guoshzhao authored Jun 04, 2021
```
* fix return code reset issue
```
  2d9be807
02 Jun, 2021 2 commits
- Benchmarks: Code Revision - Change default shape of sharding-matmul. (#92) · 44c5103b
  guoshzhao authored Jun 02, 2021
```
* Change default shape of sharding-matmul.
```
  44c5103b
- Benchmarks: Add Benchmark - Add FLOPs performance benchmark for cuda. (#87) · 6c6f5269
  guoshzhao authored Jun 02, 2021
```
* add cuda flops performance benchmark.
```
  6c6f5269
01 Jun, 2021 3 commits
- Benchmarks: Add benchmark - add micro benchmark for cudnn test (#89) · 83235433
  Yuting Jiang authored Jun 01, 2021
```
* add python related cudnn microbenchmark
```
  83235433
- Benchmarks: Code Revision - add error return code for cublas microbenchmark (#90) · 08317481
  Yuting Jiang authored Jun 01, 2021
```
* add error return code for cublas micro benchmark
```
  08317481
- Benchmarks: Add benchmark - add source code of cudnn function micro benchmark (#78) · 61c258fe
  Yuting Jiang authored Jun 01, 2021
```
* Benchmarks: Add benchmark - add source code of cudnn function micro benchmark
```
  61c258fe
31 May, 2021 1 commit

Benchmarks: Add benchmark - add micro benchmark for cublas test (#80) · 18398fba

Yuting Jiang authored May 31, 2021



* add benchmark for cublas test

* format

* revise error handling and test

* add interface to read json file, revise json file path and include .json in packaging

* add random_seed in arguments

* revise preprocess of cublas benchmark

* fix lint error and note error in source code

* update according comments

* revise input arguments from json file to custom str and convert json file to built-in dict list

* restore package config

* fit lint issue

* update platform and comments

* rename files to match source code dir and fix comments error
Co-authored-by: root <root@sb-validation-000001.51z1chmys5fuzfqyo4niepozre.bx.internal.cloudapp.net>

18398fba

27 May, 2021 1 commit

Benchmarks: Add benchmark - add source code of cublas function micro benchmark (#77) · 87f6b371

Yuting Jiang authored May 27, 2021



* Superbenchmark: Add benchmarks - add cublas function micro benchmark

* format

* add python benchmark for cublas functions, example and test file

* detele python related and rename some files

* revise cmd_helper and move json package to cmake

* resolve conflict

* revise error handing to try-catch and update some code style

* revise cmd_helper.h, cublas_helper.h, cublas_helper.cpp

* revise structure of the cublas function

* add some comments and move cuda_init and cuda_free

* add comments for class member

* add ramdom seed, revise input from file to json string, simplify cmake

* delete json file in source code of cublas

* update according comments

* limit batchcount=1 in initialization of cublas function which do not use batch count

* revise and fix some errors of annotations

* update according comments and revise construction of CublasFunction
Co-authored-by: root <root@sb-validation-000001.51z1chmys5fuzfqyo4niepozre.bx.internal.cloudapp.net>

87f6b371

26 May, 2021 1 commit
- Benchmarks: Build Pipeline - Revise path of installing cmake projects (#83) · e9965162
  Yuting Jiang authored May 26, 2021
```
* Unify SB_MICRO_PATH and SB_MICRO_LIB

* fix bug of lib path
```
  e9965162
19 May, 2021 2 commits
- Benchmarks: Add Benchmark - Add kernel launch overhead benchmark. (#74) · e977bbc1
  guoshzhao authored May 19, 2021
```
* add kernel launch overhead benchmark.
```
  e977bbc1
- expose interface of pin memory and modify cnn configuration (#75) · b7d0ee32
  Yuting Jiang authored May 19, 2021
  
  b7d0ee32
18 May, 2021 1 commit

Benchmarks: Add Benchmark - Add the source code of cuda kernel launch overhead benchmark. (#71) · 7cfe7c16

guoshzhao authored May 18, 2021

* add cuda kernel launch overhead benchmark - source part.
* can customize the nvcc_archs_support.
* set SB_MICRO_PATH for azure pipeline tests.

7cfe7c16

17 May, 2021 1 commit
- Benchmarks: Add Feature - Add script to build all cmake benchmark projects. (#72) · 2bc7ada1
  guoshzhao authored May 17, 2021
```
* add script to build all native benchmarks with cmake.
```
  2bc7ada1
13 May, 2021 1 commit

Benchmarks: Code Revision - Revise MicroBenchmark class to be more flexible. (#66) · 729e04ab

guoshzhao authored May 13, 2021

* Revise MicroBenchmark class to be more flexible.
* use command index not the command as the parameter.
* changes according to discussion.

729e04ab

11 May, 2021 1 commit

Utils - Support lazy import (#67) · 57ce473a

Yifan Xiong authored May 11, 2021

__Major Revision__

* Support lazy import.
* Not importing benchmarks when running `help`, `version`, `deploy` commands, etc.

57ce473a

26 Apr, 2021 1 commit
- Benchmarks: Fix Bug - Increase default sample count for benchmarking. (#64) · a7184da3
  guoshzhao authored Apr 26, 2021
  
  a7184da3