sampling bug fix in diffusers tutorial "basic_training.md" (#8223)

sampling bug fix in basic_training.md In the diffusers basic training tutorial, setting the manual seed argument (generator=torch.manual_seed(config.seed)) in the pipeline call inside evaluate() function rewinds the dataloader shuffling, leading to overfitting due to the model seeing same sequence of training examples after every evaluation call. Using generator=torch.Generator(device='cpu').manual_seed(config.seed) avoids this.

sampling bug fix in diffusers tutorial "basic_training.md" (#8223)
sampling bug fix in basic_training.md In the diffusers basic training tutorial, setting the manual seed argument (generator=torch.manual_seed(config.seed)) in the pipeline call inside evaluate() function rewinds the dataloader shuffling, leading to overfitting due to the model seeing same sequence of training examples after every evaluation call. Using generator=torch.Generator(device='cpu').manual_seed(config.seed) avoids this.
1096f88e · Yue Wu · GitHub · cef4a512 · 1096f88e
Unverified Commit 1096f88e authored May 24, 2024 by Yue Wu Committed by GitHub May 24, 2024
Show whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

docs/source/en/tutorials/basic_training.md docs/source/en/tutorials/basic_training.md +1 -1

No files found.
--- a/docs/source/en/tutorials/basic_training.md
+++ b/docs/source/en/tutorials/basic_training.md
@@ -260,7 +260,7 @@ Then, you'll need a way to evaluate the model. For evaluation, you can use the [
 ...     # The default pipeline output type is `List[PIL.Image]`
 ...     images = pipeline(
 ...         batch_size=config.eval_batch_size,
-...         generator=torch.manual_seed(config.seed),
+...         generator=torch.Generator(device='cpu').manual_seed(config.seed), # Use a separate torch generator to avoid rewinding the random state of the main training loop
 ...     ).images
 ...     # Make a grid out of the images