"tests/vscode:/vscode.git/clone" did not exist on "0114f4fd79ce8552533b063b5a75ac9c2a3f9b54"
Unverified Commit 070b6b9c authored by Hailey Schoelkopf's avatar Hailey Schoelkopf Committed by GitHub
Browse files

Update README.md

parent 0d8e206c
...@@ -20,7 +20,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for ...@@ -20,7 +20,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
- [ ] QA4MRE - [ ] QA4MRE
- [ ] TriviaQA - [ ] TriviaQA
- [x] AI2 ARC - [x] AI2 ARC
- [ ] LogiQA - [ ] LogiQA (WIP)
- [x] HellaSwag - [x] HellaSwag
- [ ] SWAG (WIP) - [ ] SWAG (WIP)
- [x] OpenBookQA - [x] OpenBookQA
...@@ -57,6 +57,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for ...@@ -57,6 +57,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
- [ ] MGSM - [ ] MGSM
- [ ] SCROLLS - [ ] SCROLLS
- [ ] JSON Task (reference: https://github.com/EleutherAI/lm-evaluation-harness/pull/481) - [ ] JSON Task (reference: https://github.com/EleutherAI/lm-evaluation-harness/pull/481)
- [ ] Babi
# Novel Tasks # Novel Tasks
Tasks added in the revamped harness that were not previously available. Again, a strikethrough denotes checking performed *against the original task's implementation or published results introducing the task*. Tasks added in the revamped harness that were not previously available. Again, a strikethrough denotes checking performed *against the original task's implementation or published results introducing the task*.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment