Speedup training by using numpy instead of jnp for batch shuffling (#15963)

Speedup training by using numpy instead of jnp for batch shuffling Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>

Speedup training by using numpy instead of jnp for batch shuffling (#15963)
Speedup training by using numpy instead of jnp for batch shuffling Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>
91fb62d0 · Yeb Havinga · GitHub · ea07064a · 91fb62d0
Unverified Commit 91fb62d0 authored Mar 08, 2022 by Yeb Havinga Committed by GitHub Mar 08, 2022
Show whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

examples/flax/language-modeling/run_t5_mlm_flax.py examples/flax/language-modeling/run_t5_mlm_flax.py +1 -1

No files found.
--- a/examples/flax/language-modeling/run_t5_mlm_flax.py
+++ b/examples/flax/language-modeling/run_t5_mlm_flax.py
@@ -810,7 +810,7 @@ def main():
        # Generate an epoch by shuffling sampling indices from the train dataset
        num_train_samples = len(tokenized_datasets["train"])
-        train_samples_idx = jax.random.permutation(input_rng, jnp.arange(num_train_samples))
+        train_samples_idx = np.random.permutation(np.arange(num_train_samples))
        train_batch_idx = generate_batch_splits(train_samples_idx, train_batch_size)
        # Gather the indexes for creating the batch and do a training step