Benchmarks - Support TE FP8 in BERT/GPT2 models (#496)
Support Transformer Engine FP8 in existing PyTorch BERT/GPT2 models by converting linear/layernorm to TE layers.
Showing
Please register or sign in to comment
Support Transformer Engine FP8 in existing PyTorch BERT/GPT2 models by converting linear/layernorm to TE layers.