Fix order of variable update and moving average update

While the original code works fine in practice, it technically allows gradient application and moving average update to happen in any order. This causes the behavior to deviate from pure mathematical specifications.

Fix order of variable update and moving average update
While the original code works fine in practice, it technically allows gradient application and moving average update to happen in any order. This causes the behavior to deviate from pure mathematical specifications.
720d3363 · Igor Ganichev · GitHub · 2661eb97 · f6e4dfe9 · 720d3363
Unverified Commit 720d3363 authored Apr 11, 2018 by Igor Ganichev Committed by GitHub Apr 11, 2018
Hide whitespace changes
Inline Side-by-side

Showing with 3 additions and 5 deletions

tutorials/image/cifar10/cifar10.py tutorials/image/cifar10/cifar10.py +3 -5

No files found.
--- a/tutorials/image/cifar10/cifar10.py
+++ b/tutorials/image/cifar10/cifar10.py
@@ -370,12 +370,10 @@ def train(total_loss, global_step):
  # Track the moving averages of all trainable variables.
  variable_averages = tf.train.ExponentialMovingAverage(
      MOVING_AVERAGE_DECAY, global_step)
-  variables_averages_op = variable_averages.apply(tf.trainable_variables())
+  with tf.control_dependencies([apply_gradient_op]):
+    variables_averages_op = variable_averages.apply(tf.trainable_variables())

-  with tf.control_dependencies([apply_gradient_op, variables_averages_op]):
-    train_op = tf.no_op(name='train')
-
-  return train_op
+  return variables_averages_op


 def maybe_download_and_extract():