"examples/git@developer.sourcefind.cn:zhaoyu6/sglang.git" did not exist on "69276f619a1f03d35948719ff2460f79279bcc90"
Unverified Commit 15821d20 authored by Lintang Sutawika's avatar Lintang Sutawika Committed by GitHub
Browse files

Merge pull request #745 from EleutherAI/haileyschoelkopf-patch-1

Update README.md
parents 2f53b190 7483a7ea
......@@ -18,9 +18,9 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
- [x] SciQ
- [ ] QASPER
- [x] QA4MRE
- [ ] TriviaQA
- [ ] TriviaQA (Lintang)
- [x] AI2 ARC
- [ ] LogiQA [(WIP)](https://github.com/EleutherAI/lm-evaluation-harness/pull/711)
- [x] LogiQA
- [x] HellaSwag
- [x] SWAG
- [x] OpenBookQA
......@@ -28,7 +28,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
- [x] RACE
- [x] HeadQA
- [x] MathQA
- [ ] WebQs
- [x] WebQs
- [ ] WSC273
- [x] Winogrande
- [x] ANLI
......@@ -50,15 +50,15 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
- [ ] StoryCloze
- [ ] NaturalQs
- [x] CrowS-Pairs
- [ ] XCopa
- [ ] BIG-Bench
- [ ] XStoryCloze
- [ ] XCopa (Lintang)
- [ ] BIG-Bench (Hailey)
- [ ] XStoryCloze (Lintang)
- [x] XWinograd
- [ ] PAWS-X
- [ ] XNLI
- [ ] PAWS-X (Lintang)
- [ ] XNLI (Lintang)
- [ ] MGSM
- [ ] SCROLLS
- [ ] Babi
- [x] Babi
# Novel Tasks
Tasks added in the revamped harness that were not previously available. Again, a strikethrough denotes checking performed *against the original task's implementation or published results introducing the task*.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment