README.md 1.45 KB
Newer Older
haileyschoelkopf's avatar
haileyschoelkopf committed
1
2
# Eval Harness Documentation

lintangsutawika's avatar
lintangsutawika committed
3
Welcome to the docs for the LM Evaluation Harness!
haileyschoelkopf's avatar
haileyschoelkopf committed
4
5
6

## Table of Contents

7
* To learn about the public interface of the library, as well as how to evaluate via the commandline or as integrated into an external library, see the [Interface](https://github.com/EleutherAI/lm-evaluation-harness/blob/big-refactor/docs/user_guide.md)
haileyschoelkopf's avatar
haileyschoelkopf committed
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
* To learn how to add a new library, API, or model type to the library, as well as a quick explainer on the types of ways to evaluate an LM, see the [Model Guide](https://github.com/EleutherAI/lm-evaluation-harness/blob/big-refactor/docs/model_guide.md).
* For a crash course on adding new tasks to the library, see our [New Task Guide](https://github.com/EleutherAI/lm-evaluation-harness/blob/big-refactor/docs/new_task_guide.md).
* To learn more about pushing the limits of task configuration that the Eval Harness supports, see the [Advanced Task Guide](https://github.com/EleutherAI/lm-evaluation-harness/blob/big-refactor/docs/advanced_task_guide.md).

## Progress on Revamp

Tracking progress on revamping documentation pages for the refactor of LM-Evaluation-Harness.

### Desired Pages

* [ ] YAML explainer
  * [ ] Explainer on filters + advanced features
  * [ ] Walkthrough start-to-finish of adding a new task to codebase
* [ ] Explaining registries + decorators
* [ ] model_guide.md for adding new model API
  * [ ] guide to writing an adapter to new advanced codebase (e.g. NeoX)
lintangsutawika's avatar
lintangsutawika committed
24
* [ ] Parallelism guide (?)