Added a --reduce_memory option to the training script to keep training

data on disc as a memmap rather than in memory

Added a --reduce_memory option to the training script to keep training
data on disc as a memmap rather than in memory
06a30cfd · Matthew Carrigan · 7d1ae644 · 06a30cfd
Commit 06a30cfd authored Mar 21, 2019 by Matthew Carrigan
Show whitespace changes
Inline Side-by-side

Showing with 3 additions and 1 deletion

examples/lm_finetuning/README.md examples/lm_finetuning/README.md +3 -1

No files found.
--- a/examples/lm_finetuning/README.md
+++ b/examples/lm_finetuning/README.md
@@ -58,7 +58,9 @@ recent GPUs. `--max_seq_len` defaults to 128 but can be set as high as 512.
 Higher values may yield stronger language models at the cost of slower and more memory-intensive training
 In addition, if memory usage is an issue, especially when training on a single GPU, reducing `--train_batch_size` from
-the default 32 to a lower number (4-16) can be helpful.
+the default 32 to a lower number (4-16) can be helpful. There is also a `--reduce_memory` option for both the
+`pregenerate_training_data.py` and `finetune_on_pregenerated.py` scripts that spills data to disc in shelf objects
+or numpy memmaps rather than retaining it in memory, which hugely reduces memory usage with little performance impact.
 ###Examples
 #####Simple fine-tuning