-
Pasquale Minervini authored
gradient norm clipping should be done right before calling the optimiser - fixing run_glue and run_ner as well
abd7110e
gradient norm clipping should be done right before calling the optimiser - fixing run_glue and run_ner as well