- 28 Jan, 2022 1 commit
-
-
guoshzhao authored
**Description** Please write a brief description and link the related issue if have. **Major Revision** - Sync (do allreduce max) the E2E training results among all workers. - Avoid using ':0' in metric name if there has only one rank having output.
-
- 19 Jan, 2022 1 commit
-
-
guoshzhao authored
**Description** Add 50th, 90th, 95th, 99th, 99.9th latency metrics for ORT and pytorch inference benchmarks.
-
- 09 Dec, 2021 1 commit
-
-
Yuting Jiang authored
**Description** Unify metric names of benchmarks.
-
- 27 Sep, 2021 1 commit
-
-
guoshzhao authored
**Description** Add option `force_fp32` to use fp32 instead of tf32, only takes effect on Ampere or newer GPUs.
-
- 06 Aug, 2021 2 commits
- 28 Jun, 2021 1 commit
-
-
guoshzhao authored
* add config file for ndv4.
-
- 21 Jun, 2021 1 commit
-
-
guoshzhao authored
Benchmarks: Add Feature - Add DistributedImpl and DistributedBackend arguments for micro benchmark. (#100)
-
- 04 Jun, 2021 1 commit
-
-
guoshzhao authored
* fix return code reset issue
-
- 19 May, 2021 1 commit
-
-
Yuting Jiang authored
-
- 26 Apr, 2021 1 commit
-
-
guoshzhao authored
-
- 08 Apr, 2021 1 commit
-
-
guoshzhao authored
* revise result process interface * add more comments Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
- 22 Mar, 2021 1 commit
-
-
guoshzhao authored
Benchmarks: Add Feature - Add benchmark finish check according to num_warmup/num_steps and duration in ModelBenchmark class. (#25) * add is_finished function * reuse current time. Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
- 18 Mar, 2021 1 commit
-
-
guoshzhao authored
* add sample_count argument. * handle more condidatins. * address comments. Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
- 09 Mar, 2021 2 commits
-
-
guoshzhao authored
* add flag to disable GPU. * fix spelling * fix unittest. * address comments. Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
guoshzhao authored
Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
- 08 Mar, 2021 1 commit
-
-
guoshzhao authored
* add optimizer definition and function to create torch optimizer. * move optimizer enum into model_base module. Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
- 04 Mar, 2021 1 commit
-
-
guoshzhao authored
Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
- 24 Feb, 2021 1 commit
-
-
guoshzhao authored
* benchmarks init. Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-