Add instructions on how to run BERT with configs.

PiperOrigin-RevId: 414577439

Add instructions on how to run BERT with configs.
PiperOrigin-RevId: 414577439
eb501b56 · Le Hou · A. Unique TensorFlower · 5c5424cf · eb501b56 · eb501b56
Commit eb501b56 authored Dec 06, 2021 by Le Hou Committed by A. Unique TensorFlower Dec 06, 2021
Hide whitespace changes
Inline Side-by-side

Showing with 94 additions and 0 deletions

official/nlp/MODEL_GARDEN.md official/nlp/MODEL_GARDEN.md +55 -0

official/nlp/docs/train.md official/nlp/docs/train.md +39 -0

No files found.
--- a/official/nlp/MODEL_GARDEN.md
+++ b/official/nlp/MODEL_GARDEN.md
+# TF-NLP Model Garden
+## Introduction
+This TF-NLP library provides a collection of scripts for the training and
+evaluation of transformer-based models, on various tasks such as sentence
+classification, question answering, and translation. Additionally, we provide
+checkpoints of pretrained models which can be finetuned on downstream tasks.
+### How to Train Models
+Model Garden can be easily installed using PIP
+(`pip install tf-models-nightly`). After installation, check out
+[this instruction](https://github.com/tensorflow/models/blob/master/official/nlp/docs/train.md)
+on how to train models with this codebase.
+## Available Tasks
+There are two available model configs (we will add more) under
+`configs/experiments/`:
+| Dataset           | Task                     | Config  | Example command |
+| ----------------- | ------------------------ | ------- | ---- |
+| GLUE/MNLI-matched | bert/sentence_prediction | [glue_mnli_matched.yaml](https://github.com/tensorflow/models/blob/master/official/nlp/configs/experiments/glue_mnli_matched.yaml) | <details> <summary>finetune BERT-base on this task</summary> PARAMS=runtime.distribution_strategy=mirrored<br/>PARAMS=${PARAMS},task.train_data.input_path=/path-to-your-training-data/<br/>PARAMS=${PARAMS},task.hub_module_url=https://tfhub.dev/tensorflow/bert_en_uncased_L-12_H-768_A-12/4<br/><br/>python3 train.py \\<br/>  --experiment=bert/sentence_prediction \\<br/>  --mode=train \\<br/>  --model_dir=/a-folder-to-hold-checkpoints-and-logs/ \\<br/>  --config_file=configs/models/bert_en_uncased_base.yaml \\<br/>  --config_file=configs/experiments/glue_mnli_matched.yaml \\<br/>  --params_override=${PARAMS}</details> |
+| SQuAD v1.1        | bert/squad               | [squad_v1.yaml](https://github.com/tensorflow/models/blob/master/official/nlp/configs/experiments/squad_v1.yaml) | <details> <summary>finetune BERT-base on this task</summary> PARAMS=runtime.distribution_strategy=mirrored<br/>PARAMS=${PARAMS},task.train_data.input_path=/path-to-your-training-data/<br/>PARAMS=${PARAMS},task.hub_module_url=https://tfhub.dev/tensorflow/bert_en_uncased_L-12_H-768_A-12/4<br/><br/>python3 train.py \\<br/>  --experiment=bert/squad \\<br/>  --mode=train \\<br/>  --model_dir=/a-folder-to-hold-checkpoints-and-logs/ \\<br/>  --config_file=configs/models/bert_en_uncased_base.yaml \\<br/>  --config_file=configs/experiments/squad_v1.yaml \\<br/>  --params_override=${PARAMS}</details> |
+One example on how to use the config file: if you want to work on the SQuAD
+question answering task, set
+`--config_file=configs/experiments/squad_v1.yaml` and
+`--experiment=bert/squad`
+as arguments to `train.py`.
+## Available Model Configs
+There are two available model configs (we will add more) under
+`configs/models/`:
+| Model        | Config  | Pretrained checkpoint & Vocabulary | TF-HUB SavedModel | Example command |
+| ------------ | ------- | ---------------------------------- | ----------------- | --------------- |
+| BERT-base    | [bert_en_uncased_base.yaml](https://github.com/tensorflow/models/blob/master/official/nlp/configs/models/bert_en_uncased_base.yaml) | [uncased_L-12_H-768_A-12](https://storage.googleapis.com/tf_model_garden/nlp/bert/v3/uncased_L-12_H-768_A-12.tar.gz) | [uncased_L-12_H-768_A-12](https://tfhub.dev/tensorflow/bert_en_uncased_L-12_H-768_A-12/) | <details> <summary>finetune on SQuAD v1.1</summary> PARAMS=runtime.distribution_strategy=mirrored<br/>PARAMS=${PARAMS},task.train_data.input_path=/path-to-your-training-data/<br/>PARAMS=${PARAMS},task.hub_module_url=https://tfhub.dev/tensorflow/bert_en_uncased_L-12_H-768_A-12/4<br/><br/>python3 train.py \\<br/>  --experiment=bert/squad \\<br/>  --mode=train \\<br/>  --model_dir=/a-folder-to-hold-checkpoints-and-logs/ \\<br/>  --config_file=configs/models/bert_en_uncased_base.yaml \\<br/>  --config_file=configs/experiments/squad_v1.yaml \\<br/>  --params_override=${PARAMS}</details> |
+| ALBERT-base  | [albert_base.yaml](https://github.com/tensorflow/models/blob/master/official/nlp/configs/models/albert_base.yaml) | [albert_en_base](https://storage.googleapis.com/tf_model_garden/nlp/albert/albert_base.tar.gz) | [albert_en_base](https://tfhub.dev/tensorflow/albert_en_base/3) | <details> <summary>finetune on SQuAD v1.1</summary> PARAMS=runtime.distribution_strategy=mirrored<br/>PARAMS=${PARAMS},task.train_data.input_path=/path-to-your-training-data/<br/>PARAMS=${PARAMS},task.hub_module_url=https://tfhub.dev/tensorflow/albert_en_base/3<br/><br/>python3 train.py \\<br/>  --experiment=bert/squad \\<br/>  --mode=train \\<br/>  --model_dir=/a-folder-to-hold-checkpoints-and-logs/ \\<br/>  --config_file=configs/models/albert_base.yaml \\<br/>  --config_file=configs/experiments/squad_v1.yaml \\<br/>  --params_override=${PARAMS}</details> |
+One example on how to use the config file: if you want to train an ALBERT-base
+model, set `--config_file=configs/models/albert_base.yaml` as an argument to
+`train.py`.
+## Useful links
+[How to Train Models](https://github.com/tensorflow/models/blob/master/official/nlp/docs/train.md)
+[List of Pretrained Models](https://github.com/tensorflow/models/blob/master/official/nlp/docs/pretrained_models.md)
+[How to Publish Models](https://github.com/tensorflow/models/blob/master/official/nlp/docs/tfhub.md)
+[TensorFlow blog on Model Garden](https://blog.tensorflow.org/2020/03/introducing-model-garden-for-tensorflow-2.html).
--- a/official/nlp/docs/train.md
+++ b/official/nlp/docs/train.md
@@ -37,6 +37,45 @@ In addition, experiment configuration can be further overriden by
 --params_override=task.train_data.input_path=/some/path,task.hub_module_url=/some/tfhub
 ```
+## Run locally on GPUs
+An example command for training a model on local GPUs is below. This command
+trains a BERT-base model on GLUE/MNLI-matched which is a sentence prediction
+task.
+```shell
+PARAMS=runtime.distribution_strategy=mirrored  # Train no GPU
+PARAMS=${PARAMS},task.train_data.input_path=/path-to-your-training-data/
+python3 train.py \
+  --experiment=bert/sentence_prediction \
+  --mode=train \
+  --model_dir=/a-folder-to-hold-checkpoints-and-logs/ \
+  --config_file=configs/models/bert_en_uncased_base.yaml \
+  --config_file=configs/experiments/glue_mnli_matched.yaml \
+  --params_override=${PARAMS}
+```
+Note that you can specify any detailed configuration by appending
+to the `PARAMS` variable. For example, if you want to load from a pretrained
+checkpoint as initialization (instead of random initialization):
+```shell
+PARAMS=${PARAMS},task.hub_module_url=https://tfhub.dev/tensorflow/bert_en_uncased_L-12_H-768_A-12/4
+```
+The configuration entry `task.hub_module_url` uses a URL to a TF-Hub model which
+is officially pretrained. See
+[List of Pretrained Models](https://github.com/tensorflow/models/blob/master/official/nlp/docs/pretrained_models.md)
+for the complete list of pretrained models on TF-Hub. When initializing from a
+pretrained model, the encoder architecture of the pretrained model will be used
+and the encoder architecture you set in the config
+(`configs/models/bert_en_uncased_base.yaml` in this case) will be ignored.
+You can change `--mode=train` to `--mode=train_and_eval` if you want to see
+evaluation results. But you need to specify the path to the evaluation data by
+setting `task.validation_data.input_path` in `PARAMS`.
 ## Run on Cloud TPUs
 Next, we will describe how to run the [train.py](https://github.com/tensorflow/models/blob/master/official/nlp/train.py) on Cloud TPUs.