Merge pull request #1020 from baberabb/big-refactor-readme

[Refactor] Update README

Merge pull request #1020 from baberabb/big-refactor-readme
[Refactor] Update README
9d030712 · Hailey Schoelkopf · GitHub · 50107e10 · 4fd6a731 · 9d030712
Unverified Commit 9d030712 authored Nov 23, 2023 by Hailey Schoelkopf Committed by GitHub Nov 23, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 6 additions and 1 deletion

README.md README.md +6 -1

No files found.
--- a/README.md
+++ b/README.md
@@ -194,7 +194,7 @@ python -m lm_eval \
    --check_integrity
 ```

-## Advanced Usage
+## Advanced Usage Tips

 For models loaded with the HuggingFace  `transformers` library, any arguments provided via `--model_args` get passed to the relevant constructor directly. This means that anything you can do with `AutoModel` can be done with our library. For example, you can pass a local path via `pretrained=` or use models finetuned with [PEFT](https://github.com/huggingface/peft) by taking the call you would run to evaluate the base model and add `,peft=PATH` to the `model_args` argument:
 ```bash
@@ -216,6 +216,11 @@ python -m lm_eval \

 We support wildcards in task names, for example you can run all of the machine-translated lambada tasks via `--task lambada_openai_mt_*`.

+To save evaluation results provide an `--output_path`. We also support logging model responses with the `--log_samples` flag for post-hoc analysis.
+
+Additionally, one can provide a directory with `--use_cache` to cache the results of prior runs. This allows you to avoid repeated execution of the same (model, task) pairs for re-scoring.
+
+For a full list of supported arguments, check out the [interface](https://github.com/EleutherAI/lm-evaluation-harness/blob/big-refactor/docs/interface.md) guide in our documentation!

 ## How to Contribute or Learn More?