Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
7f69c48b
Unverified
Commit
7f69c48b
authored
Aug 08, 2023
by
Lintang Sutawika
Committed by
GitHub
Aug 08, 2023
Browse files
Update README.md
parent
e4ceb227
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
5 deletions
+5
-5
lm_eval/tasks/README.md
lm_eval/tasks/README.md
+5
-5
No files found.
lm_eval/tasks/README.md
View file @
7f69c48b
...
@@ -18,7 +18,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
...
@@ -18,7 +18,7 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
-
[x] SciQ
-
[x] SciQ
-
[ ] QASPER
-
[ ] QASPER
-
[x] QA4MRE
-
[x] QA4MRE
-
[ ] TriviaQA
-
[ ] TriviaQA
(Lintang)
-
[x] AI2 ARC
-
[x] AI2 ARC
-
[x] LogiQA
-
[x] LogiQA
-
[x] HellaSwag
-
[x] HellaSwag
...
@@ -50,12 +50,12 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
...
@@ -50,12 +50,12 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
-
[ ] StoryCloze
-
[ ] StoryCloze
-
[ ] NaturalQs
-
[ ] NaturalQs
-
[ ] CrowS-Pairs (Hailey?)
-
[ ] CrowS-Pairs (Hailey?)
-
[ ] XCopa
-
[ ] XCopa
(Lintang)
-
[ ] BIG-Bench (Hailey)
-
[ ] BIG-Bench (Hailey)
-
[ ] XStoryCloze
-
[ ] XStoryCloze
(Lintang)
-
[x] XWinograd
-
[x] XWinograd
-
[ ] PAWS-X
-
[ ] PAWS-X
(Lintang)
-
[ ] XNLI
-
[ ] XNLI
(Lintang)
-
[ ] MGSM
-
[ ] MGSM
-
[ ] SCROLLS
-
[ ] SCROLLS
-
[ ] Babi (Hailey)
-
[ ] Babi (Hailey)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment