Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
884ba785
Unverified
Commit
884ba785
authored
Jul 07, 2023
by
Hailey Schoelkopf
Committed by
GitHub
Jul 07, 2023
Browse files
Update README.md
parent
71305a78
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
4 deletions
+3
-4
lm_eval/tasks/README.md
lm_eval/tasks/README.md
+3
-4
No files found.
lm_eval/tasks/README.md
View file @
884ba785
...
...
@@ -24,7 +24,6 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
-
[x] HellaSwag
-
[x] SWAG
-
[x] OpenBookQA
-
[ ] SQuADv2
-
[x] RACE
-
[ ] LogiQA (WIP)
-
[x] HellaSwag
...
...
@@ -42,15 +41,15 @@ Boxes should be checked iff tasks are implemented in the refactor and tested for
-
[ ] TruthfulQA
-
[ ] MuTual
-
[ ] Hendrycks Math (WIP)
-
[ ] Asdiv
-
[ ] Asdiv
(WIP)
-
[ ] GSM8k
-
[x] Arithmetic
-
[ ] MMMLU
-
[ ] Translation (WMT) suite
-
[ ] Unscramble
-
[ ] Unscramble
(WIP)
-
[x] ~~Pile (perplexity)~~
-
[ ] BLiMP
-
[ ] ToxiGen
-
[ ] ToxiGen
(WIP)
-
[ ] StoryCloze
-
[ ] NaturalQs
-
[ ] CrowS-Pairs
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment