Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
057dc2d7
Unverified
Commit
057dc2d7
authored
Dec 14, 2023
by
Baber Abbasi
Committed by
GitHub
Dec 13, 2023
Browse files
`qqp`, `mnli_mismatch`: remove unlabled test sets (#1114)
* remove unlabled test sets * add note to readme
parent
9ef853ac
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
1 addition
and
2 deletions
+1
-2
lm_eval/tasks/glue/README.md
lm_eval/tasks/glue/README.md
+1
-0
lm_eval/tasks/glue/mnli/mismatch.yaml
lm_eval/tasks/glue/mnli/mismatch.yaml
+0
-1
lm_eval/tasks/glue/qqp/default.yaml
lm_eval/tasks/glue/qqp/default.yaml
+0
-1
No files found.
lm_eval/tasks/glue/README.md
View file @
057dc2d7
# GLUE
**NOTE**
: GLUE benchmark tasks do not provide publicly accessible labels for their test sets, so we default to the validation sets for all sub-tasks.
### Paper
...
...
lm_eval/tasks/glue/mnli/mismatch.yaml
View file @
057dc2d7
include
:
default.yaml
task
:
mnli_mismatch
validation_split
:
validation_mismatched
test_split
:
test_mismatched
lm_eval/tasks/glue/qqp/default.yaml
View file @
057dc2d7
...
...
@@ -5,7 +5,6 @@ dataset_name: qqp
output_type
:
multiple_choice
training_split
:
train
validation_split
:
validation
test_split
:
test
doc_to_text
:
"
\n
Sentence
1:
{{question1}}
\n
Sentence
2:
{{question2}}
\n
Answer:"
doc_to_target
:
label
doc_to_choice
:
[
"
no"
,
"
yes"
]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment