Unverified Commit 605964ad authored by Hailey Schoelkopf's avatar Hailey Schoelkopf Committed by GitHub
Browse files

Update README.md

parent 0fe7e8eb
...@@ -24,7 +24,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for ...@@ -24,7 +24,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
- [x] HellaSwag - [x] HellaSwag
- [ ] SWAG (WIP) - [ ] SWAG (WIP)
- [x] OpenBookQA - [x] OpenBookQA
- [ ] SQuADv2 - [ ] SQuADv2 (WIP)
- [ ] RACE (WIP) - [ ] RACE (WIP)
- [ ] HeadQA - [ ] HeadQA
- [ ] MathQA - [ ] MathQA
...@@ -35,7 +35,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for ...@@ -35,7 +35,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
- [ ] Hendrycks Ethics - [ ] Hendrycks Ethics
- [ ] TruthfulQA - [ ] TruthfulQA
- [ ] MuTual - [ ] MuTual
- [ ] Hendrycks Math - [ ] Hendrycks Math (WIP)
- [ ] Asdiv - [ ] Asdiv
- [ ] GSM8k - [ ] GSM8k
- [ ] Arithmetic (WIP) - [ ] Arithmetic (WIP)
...@@ -45,6 +45,8 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for ...@@ -45,6 +45,8 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
- [x] ~~Pile (perplexity)~~ - [x] ~~Pile (perplexity)~~
- [ ] BLiMP - [ ] BLiMP
- [ ] ToxiGen - [ ] ToxiGen
- [ ] StoryCloze
- [ ] NaturalQs
- [ ] CrowS-Pairs - [ ] CrowS-Pairs
- [ ] XCopa - [ ] XCopa
- [ ] BIG-Bench - [ ] BIG-Bench
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment