Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
fdcdc494
Unverified
Commit
fdcdc494
authored
Jul 06, 2023
by
Hailey Schoelkopf
Committed by
GitHub
Jul 06, 2023
Browse files
Update task statuses
parent
94c465fd
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
5 deletions
+5
-5
lm_eval/tasks/README.md
lm_eval/tasks/README.md
+5
-5
No files found.
lm_eval/tasks/README.md
View file @
fdcdc494
...
@@ -12,15 +12,15 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
...
@@ -12,15 +12,15 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
-
[x] ~~Lambada (Multilingual)~~
-
[x] ~~Lambada (Multilingual)~~
-
[x] Wikitext
-
[x] Wikitext
-
[x] PiQA
-
[x] PiQA
-
[
] PROST
(WIP)
-
[
x
] PROST
-
[ ] MCTACO
-
[ ] MCTACO
-
[x] Pubmed QA
-
[x] Pubmed QA
-
[x] SciQ
-
[x] SciQ
-
[ ] QASPER
-
[ ] QASPER
-
[
] QA4MRE
(WIP)
-
[
x
] QA4MRE
-
[ ] TriviaQA
-
[ ] TriviaQA
-
[x] AI2 ARC
-
[x] AI2 ARC
-
[ ] LogiQA
-
[ ] LogiQA
(WIP)
-
[x] HellaSwag
-
[x] HellaSwag
-
[x] SWAG
-
[x] SWAG
-
[x] OpenBookQA
-
[x] OpenBookQA
...
@@ -28,10 +28,10 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
...
@@ -28,10 +28,10 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
-
[x] RACE
-
[x] RACE
-
[ ] LogiQA (WIP)
-
[ ] LogiQA (WIP)
-
[x] HellaSwag
-
[x] HellaSwag
-
[
] SWAG
(WIP)
-
[
x
] SWAG
-
[x] OpenBookQA
-
[x] OpenBookQA
-
[ ] SQuADv2 (WIP)
-
[ ] SQuADv2 (WIP)
-
[
] RACE
(WIP)
-
[
x
] RACE
-
[ ] HeadQA (WIP)
-
[ ] HeadQA (WIP)
-
[ ] MathQA
-
[ ] MathQA
-
[ ] WebQs
-
[ ] WebQs
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment