- 22 Mar, 2022 1 commit
-
-
user4543 authored
**Description** Remove fp16 samples type converting time for training cnn and lstm inference.
-
- 17 Mar, 2022 1 commit
-
-
user4543 authored
**Description** Remove fp16 samples type converting time for cnn and lstm models.
-
- 06 Mar, 2022 1 commit
-
-
Jeff Daily authored
**Description** The BatchNorm operator is not numerically stable in fp16. PyTorch documentation recommends to keep the BN op in fp32 for fp16 AMP models. Refer to https://pytorch.org/docs/stable/amp.html#ops-that-can-autocast-to-float32. Preserving BN in fp32 for superbench more accurately reflects real workloads.
-
- 16 Jun, 2021 1 commit
-
-
Yifan Xiong authored
Fix bugs and refine log in single GPU benchmarks: * Fix none framework issue * Fix empty parameter bug * Remove missed mobilenet_v3 models * Change benchmark registration log to debug level * Add pid in logging * Add missing benchmarks in default config * Fix deprecated logging warn
-
- 26 Apr, 2021 1 commit
-
-
guoshzhao authored
-
- 20 Apr, 2021 1 commit
-
-
guoshzhao authored
* Benchmarks: Add Benchmark - Add CNN model benchmarks.
-
- 16 Apr, 2021 1 commit
-
-
guoshzhao authored
Benchmarks: Code Revision - Fix some issue for BERT benchmark. (#58)
-
- 26 Mar, 2021 1 commit
-
-
guoshzhao authored
Benchmarks: Add Benchmark - Add Pytorch BERT benchmarks, including bert-base and bert-large. (#20) * add pytorch bert benchmarks. * revise code * fix issue * revise code. Co-authored-by:Guoshuai Zhao <guzhao@microsoft.com>
-