@@ -211,9 +211,8 @@ To evaluate a model, (e.g. GPT-2) on NLU tasks (e.g. RTE, Winograd Scheme Challe
```bash
python main.py \
--model gpt2 \
--model_argsdevice=cuda:0 \
--tasks rte,wsc \
--provide_description\
--device cuda:0 \
--tasks lambada,hellaswag \
--num_fewshot 2
```
...
...
@@ -223,11 +222,23 @@ If you have access to an OpenAI API key, you can also evaluate GPT-3 on various
export OPENAI_API_SECRET_KEY=YOUR_KEY_HERE
python main.py \
--model gpt3 \
--tasksrte,wsc\
--taskslambada,hellaswag\
--provide_description\
--num_fewshot 2
```
Additional arguments can be provided to the model constructor using the `--model_args` flag. Most importantly, the `gpt2` model can be used to load an arbitrary HuggingFace model as follows:
```bash
python main.py \
--model gpt2 \
--model_argspretrained=EleutherAI/gpt-neo-1.3B \
--device cuda:0 \
--tasks lambada,hellaswag \
--num_fewshot 2
```
To inspect what the LM inputs look like, you can run the following command: