Commits · b5b1c3dac7831f3568fb6b574f6ade2c7dc5b575 · tsoc / superbenchmark

25 Apr, 2022 1 commit

Bug - Fix bug of duration feature for model benchmarks in distributed mode. (#347) · b5b1c3da

user4543 authored Apr 25, 2022

**Description**
Fix bug of duration feature for model benchmarks in distributed mode.

**Major Revision**
- Add all_reduce to sync the result of is_finished(the function to judge whether the model benchmark should be stopped) in each step 
  - to avoid inconsistency between different ranks to determine duration end (some rank may enter one more step and can never finish)
- Add torch.cuda.synchronize() before and after step time measuring in train_step() for all model benchmarks
  - some operations in train_step() maybe async resulting incorrect step time records (for example, lstm)

b5b1c3da

20 Apr, 2021 1 commit
- Benchmarks: Add Benchmark - Add LSTM model benchmarks. (#60) · 2a7ab691
  guoshzhao authored Apr 20, 2021
```
* Benchmarks: Add Benchmark - Add LSTM model benchmarks.
```
  2a7ab691
16 Apr, 2021 1 commit
- Benchmarks: Add Benchmark - Add GPT2 model benchmark. (#57) · af567cf6
  guoshzhao authored Apr 16, 2021
```
* Benchmarks: Add Benchmark - Add GPT2 model benchmark.
```
  af567cf6
26 Mar, 2021 1 commit

Benchmarks: Add Benchmark - Add Pytorch BERT benchmarks, including bert-base... · 0972b223

guoshzhao authored Mar 26, 2021


Benchmarks: Add Benchmark - Add Pytorch BERT benchmarks, including bert-base and bert-large.   (#20)

* add pytorch bert benchmarks.

* revise code

* fix issue

* revise code.
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>

0972b223