Update readme to emphasize 'global batch size'.

PiperOrigin-RevId: 269376599

Update readme to emphasize 'global batch size'.
PiperOrigin-RevId: 269376599
841bf60b · Hongkun Yu · A. Unique TensorFlower · c21bec54 · 841bf60b
Commit 841bf60b authored Sep 16, 2019 by Hongkun Yu Committed by A. Unique TensorFlower Sep 16, 2019
Hide whitespace changes
Inline Side-by-side

Showing with 5 additions and 0 deletions

official/transformer/v2/README.md official/transformer/v2/README.md +5 -0

No files found.
--- a/official/transformer/v2/README.md
+++ b/official/transformer/v2/README.md
@@ -96,6 +96,11 @@ tensorboard --logdir=$MODEL_DIR
   Users need to adjust `batch_size` and `num_gpus` to get good performance
   running multiple GPUs.
+   **Note that:**
+   when using multiple GPUs or TPUs, this is the global batch size for all
+   devices. For example, if the batch size is `4096*4` and there are 4 devices,
+   each device will take 4096 tokens as a batch budget.
   Command to run:
   ```
   python3 transformer_main.py --data_dir=$DATA_DIR --model_dir=$MODEL_DIR \