Unverified Commit 884ba785 authored by Hailey Schoelkopf's avatar Hailey Schoelkopf Committed by GitHub
Browse files

Update README.md

parent 71305a78
...@@ -24,7 +24,6 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for ...@@ -24,7 +24,6 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
- [x] HellaSwag - [x] HellaSwag
- [x] SWAG - [x] SWAG
- [x] OpenBookQA - [x] OpenBookQA
- [ ] SQuADv2
- [x] RACE - [x] RACE
- [ ] LogiQA (WIP) - [ ] LogiQA (WIP)
- [x] HellaSwag - [x] HellaSwag
...@@ -42,15 +41,15 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for ...@@ -42,15 +41,15 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
- [ ] TruthfulQA - [ ] TruthfulQA
- [ ] MuTual - [ ] MuTual
- [ ] Hendrycks Math (WIP) - [ ] Hendrycks Math (WIP)
- [ ] Asdiv - [ ] Asdiv (WIP)
- [ ] GSM8k - [ ] GSM8k
- [x] Arithmetic - [x] Arithmetic
- [ ] MMMLU - [ ] MMMLU
- [ ] Translation (WMT) suite - [ ] Translation (WMT) suite
- [ ] Unscramble - [ ] Unscramble (WIP)
- [x] ~~Pile (perplexity)~~ - [x] ~~Pile (perplexity)~~
- [ ] BLiMP - [ ] BLiMP
- [ ] ToxiGen - [ ] ToxiGen (WIP)
- [ ] StoryCloze - [ ] StoryCloze
- [ ] NaturalQs - [ ] NaturalQs
- [ ] CrowS-Pairs - [ ] CrowS-Pairs
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment