Do not unscale the gradients if loss scale equal to 1 (#748)

* Do not unscale the gradients if loss scale equal to 1 * Disable unscaling loss scale == 1 only for static scaling

Do not unscale the gradients if loss scale equal to 1 (#748)
* Do not unscale the gradients if loss scale equal to 1 * Disable unscaling loss scale == 1 only for static scaling
20d00ab1 · Tomasz Grel · GitHub · 5633f6db · 20d00ab1
Unverified Commit 20d00ab1 authored Mar 11, 2020 by Tomasz Grel Committed by GitHub Mar 11, 2020
Hide whitespace changes
Inline Side-by-side

Showing with 8 additions and 0 deletions

apex/amp/_process_optimizer.py apex/amp/_process_optimizer.py +8 -0

No files found.
--- a/apex/amp/_process_optimizer.py
+++ b/apex/amp/_process_optimizer.py
@@ -92,6 +92,14 @@ def lazy_init_with_master_weights(self):
 def post_backward_models_are_masters(scaler, params, stashed_grads, scale_override=None):
        grads_have_scale, stashed_have_scale, out_scale = scaler.loss_scale(), 1.0, 1.0
+        # not much to do if scale == 1.0 and static scaling
+        if scaler.loss_scale() == 1.0 and not scaler.dynamic:
+            # Clear the stash.
+            for i in range(len(stashed_grads)):
+                stashed_grads[i] = None
+            return
        if scale_override is not None:
            grads_have_scale, stashed_have_scale, out_scale = scale_override