official/modeling/model_training_utils.py · 11ccb99eae06471822a80beb4ba3f65faedd0fe8 · ModelZoo / ResNet50_tensorflow

Temporarily disable explicit allreduce in BERT SQuAD · 11ccb99e

Zongwei Zhou authored Mar 05, 2020

In BERT SQuAD, disable explicit allreduce for now to keep the original clip_by_global_norm math. With explicit allreduce, the gradients before allreduce are scaled so even if we move clip_by_global_norm before allreduce (as in TF1 and pre-TF 2.2) it will operate on scaled gradients, the math will be changed. So with explicit allreduce, it is better to move clip_by_global_norm to after allreduce.

PiperOrigin-RevId: 299278082

11ccb99e

model_training_utils.py 18.6 KB

Replace model_training_utils.py