superbench/benchmarks/context.py · 5197cdf5cb305f053c20e48e69cec5efa36871ca · tsoc / superbenchmark

Benchmarks - Support FP8 in BERT models (#446) · 5197cdf5

Yifan Xiong authored Jan 04, 2023

Support FP8 in PyTorch BERT models:

* add fp8 hybrid/e4m3/e5m2 in precision arguments
* build BERT encoders with `te.TransformerLayer` to repalce
`transformers.BertModel`
* wrap forward steps with fp8 autocast

5197cdf5

context.py 2.94 KB

Replace context.py