- 13 Sep, 2021 1 commit
-
-
Yuting Jiang authored
**Description** Add barrier before 'destroy_process_group' to resolve the bug due to when multi models in one model benchmark, some processes haven't finished the previous process group while others failed to initialize new process group for the next model on rocm4.x when running bert_models. **Major Revision** - Add barrier before 'destroy_process_group'.
-
- 29 Jul, 2021 1 commit
-
-
Yifan Xiong authored
__Description__ Cherry-pick bug fixes from v0.2.1 to main. __Major Revisions__ * Fix bug of VGG models failed on A100 GPU with batch_size=128. * Fix Ansible connection issue when running in localhost. * Update version in packages and docs.
-
- 28 Jun, 2021 1 commit
-
-
guoshzhao authored
* replace torch.optim.AdamW with transformers.AdamW.
-
- 07 Jun, 2021 1 commit
-
-
guoshzhao authored
* Clean up the cache.
-
- 19 May, 2021 1 commit
-
-
Yuting Jiang authored
-
- 12 Apr, 2021 1 commit
-
-
guoshzhao authored
Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
- 15 Mar, 2021 1 commit
-
-
guoshzhao authored
Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
- 09 Mar, 2021 2 commits
-
-
guoshzhao authored
* add flag to disable GPU. * fix spelling * fix unittest. * address comments. Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
guoshzhao authored
Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-
- 08 Mar, 2021 1 commit
-
-
guoshzhao authored
* add pytorch base class * address comments Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-