- 30 Aug, 2021 2 commits
-
-
Yuting Jiang authored
**Description** Add gemm flops microbenchmark for amd. **Major Revision** - Add gemm flops microbenchmark for amd. - Add related example and test file.
-
Yuting Jiang authored
**Description** Extract base class for gemm flops microbenchmark. **Major Revision** - extract base class for gemm flops microbenchmark and add related test. - revise gemm_flops_performance for cuda.
-
- 27 Aug, 2021 4 commits
-
-
guoshzhao authored
**Description** Rename `kernel_launch_overhead_event` to `event_overhead`, `kernel_launch_overhead_wall` to `wall_overhead`.
-
Yuting Jiang authored
**Description** Add memory bus bandwidth performance microbenchmark for amd. **Major Revision** - Add memory bus bandwidth performance microbenchmark for amd. - Add related example and test file.
-
Ziyue Yang authored
**Description** This commit adds the benchmark program for GPU-initiated data transfer benchmark.
-
Yuting Jiang authored
Benchmarks: Fix Bug - fix bug of microbenmark building cublas and cudnn for amd in build pipeline (#166) **Description** Fix bug of microbenmark building cublas and cudnn for amd **Major Revision** - remove cuda LANGUAGES in project() - check CUDAToolkit quiet and then build if found
-
- 26 Aug, 2021 1 commit
-
-
Yuting Jiang authored
**Description** Rename computation_communication_overlap microbenchmark metric . **Major Revision** - remove rank info in metric. - simplify and rename metric.
-
- 25 Aug, 2021 1 commit
-
-
Yuting Jiang authored
**Description** extract base class for memory bandwidth microbenchmark. **Major Revision** - revise and optimize cuda_memory_bandwidth_performance - extract base class for memory bandwidth microbenchmark - add test for base class
-
- 22 Aug, 2021 1 commit
-
-
Ziyue Yang authored
**Description** This commit adds readwrite I/O pattern for FIO benchmark. Read/write ratio is fixed at 4:1.
-
- 06 Aug, 2021 2 commits
- 05 Aug, 2021 1 commit
-
-
guoshzhao authored
**Description** Add reduce function support for output summary. **Major Revision** - Add reducer class to maintain all reduce functions. - Save reduce type of each metric into `BenchmarkResult` - Fix UT.
-
- 30 Jul, 2021 1 commit
-
-
Yuting Jiang authored
**Description** Add rccl bandwidth microbenchmark for rocm. **Major Revision** - Register rccl-bw benchmark.
-
- 27 Jul, 2021 2 commits
-
-
Yuting Jiang authored
**Description** Add the source code of rocm kernel launch overhead benchmark. **Major Revision** - Revise cmake build logic to support both cuda and rocm
-
Yuting Jiang authored
**Description** Support rocm cmake build. **Major Revision** - Add some envs in rocm_common.cmake to support rocm cmake build.
-
- 26 Jul, 2021 1 commit
-
-
Yuting Jiang authored
**Description** Add NCCL performance microbenchmark. **Major Revision** - Add microbenchmark, example, test, config for NCCL
-
- 23 Jul, 2021 2 commits
-
-
Yuting Jiang authored
**Description** Add RDMA Loopback performance microbenchmark. **Major Revision** - Add microbenchmark, example, test, config for RDMA Loopback
-
Ziyue Yang authored
**Description** Add disk performance microbenchmark. **Major Revision** - Add microbenchmark, example, test, config for disk performance. **Minor Revision** - Fix bugs in executor unit test related to default enabled tests.
-
- 13 Jul, 2021 1 commit
-
-
Yuting Jiang authored
Add microbenchmark, example, test, config for cuda memory performance and Add cuda-samples(tag with cuda version) as git submodule and update related makefile
-
- 30 Jun, 2021 1 commit
-
-
guoshzhao authored
-
- 29 Jun, 2021 1 commit
-
-
guoshzhao authored
* fix bug for nvidia v100 * hard code the supported dict for different arch.
-
- 21 Jun, 2021 1 commit
-
-
guoshzhao authored
Benchmarks: Add Feature - Add DistributedImpl and DistributedBackend arguments for micro benchmark. (#100)
-
- 20 Jun, 2021 1 commit
-
-
Yuting Jiang authored
rename bin name and result metric of cublas and cudnn microbenchmark
-
- 02 Jun, 2021 2 commits
- 01 Jun, 2021 3 commits
-
-
Yuting Jiang authored
* add python related cudnn microbenchmark
-
Yuting Jiang authored
* add error return code for cublas micro benchmark
-
Yuting Jiang authored
* Benchmarks: Add benchmark - add source code of cudnn function micro benchmark
-
- 31 May, 2021 1 commit
-
-
Yuting Jiang authored
* add benchmark for cublas test * format * revise error handling and test * add interface to read json file, revise json file path and include .json in packaging * add random_seed in arguments * revise preprocess of cublas benchmark * fix lint error and note error in source code * update according comments * revise input arguments from json file to custom str and convert json file to built-in dict list * restore package config * fit lint issue * update platform and comments * rename files to match source code dir and fix comments error Co-authored-by:root <root@sb-validation-000001.51z1chmys5fuzfqyo4niepozre.bx.internal.cloudapp.net>
-
- 27 May, 2021 1 commit
-
-
Yuting Jiang authored
* Superbenchmark: Add benchmarks - add cublas function micro benchmark * format * add python benchmark for cublas functions, example and test file * detele python related and rename some files * revise cmd_helper and move json package to cmake * resolve conflict * revise error handing to try-catch and update some code style * revise cmd_helper.h, cublas_helper.h, cublas_helper.cpp * revise structure of the cublas function * add some comments and move cuda_init and cuda_free * add comments for class member * add ramdom seed, revise input from file to json string, simplify cmake * delete json file in source code of cublas * update according comments * limit batchcount=1 in initialization of cublas function which do not use batch count * revise and fix some errors of annotations * update according comments and revise construction of CublasFunction Co-authored-by:root <root@sb-validation-000001.51z1chmys5fuzfqyo4niepozre.bx.internal.cloudapp.net>
-
- 26 May, 2021 1 commit
-
-
Yuting Jiang authored
* Unify SB_MICRO_PATH and SB_MICRO_LIB * fix bug of lib path
-
- 19 May, 2021 1 commit
-
-
guoshzhao authored
* add kernel launch overhead benchmark.
-
- 18 May, 2021 1 commit
-
-
guoshzhao authored
* add cuda kernel launch overhead benchmark - source part. * can customize the nvcc_archs_support. * set SB_MICRO_PATH for azure pipeline tests.
-
- 13 May, 2021 1 commit
-
-
guoshzhao authored
* Revise MicroBenchmark class to be more flexible. * use command index not the command as the parameter. * changes according to discussion.
-
- 14 Apr, 2021 1 commit
-
-
Yuting Jiang authored
* Benchmarks: Add Benchmark - add computation and communication overlap micro benchmark * Benchmarks: Add benchmark - fix some format issues and typo * Benchmarks: Add Benchmark - update according comments and add test * revise tests * skip multi gpu test due to no multi gpu Co-authored-by:v-yujiang <v-yujiang@microsoft.com>
-
- 12 Apr, 2021 4 commits
-
-
guoshzhao authored
Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
guoshzhao authored
Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
guoshzhao authored
Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
guoshzhao authored
Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
- 09 Apr, 2021 1 commit
-
-
guoshzhao authored
Benchmarks: Add Benchmark - Add op-sharding microbenchmark, including matmul and sharding_matmul. (#36) * add microbenchmark - sharding matmul. * address comments. Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-