Added Code for Gradient Accumulation to work for basic_training (#8961)

added line allowing gradient accumulation to work for basic_training example

Added Code for Gradient Accumulation to work for basic_training (#8961)
added line allowing gradient accumulation to work for basic_training example
cdd12bde · RandomGamingDev · GitHub · 2c25b98c · cdd12bde
Unverified Commit cdd12bde authored Jul 24, 2024 by RandomGamingDev Committed by GitHub Jul 25, 2024
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 0 deletions

docs/source/en/tutorials/basic_training.md docs/source/en/tutorials/basic_training.md +1 -0

No files found.
--- a/docs/source/en/tutorials/basic_training.md
+++ b/docs/source/en/tutorials/basic_training.md
@@ -340,6 +340,7 @@ Now you can wrap all these components together in a training loop with 🤗 Acce
 ...                 loss = F.mse_loss(noise_pred, noise)
 ...                 accelerator.backward(loss)
+...             if (step + 1) % config.gradient_accumulation_steps == 0:
 ...                 accelerator.clip_grad_norm_(model.parameters(), 1.0)
 ...                 optimizer.step()
 ...                 lr_scheduler.step()