B

bert-large_infer-pytorch

This example code fine-tunes BERT on the SQuAD dataset using distributed training on 8 DCUs and Bert Whole Word Masking uncased model to reach a F1 > 90 on SQuAD