Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
b0597a94
Commit
b0597a94
authored
Jul 14, 2023
by
lintangsutawika
Browse files
adjustments for super_glue
parent
ee633332
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
5 additions
and
4 deletions
+5
-4
lm_eval/api/task.py
lm_eval/api/task.py
+1
-0
lm_eval/tasks/super_glue/boolq/default.yaml
lm_eval/tasks/super_glue/boolq/default.yaml
+1
-1
lm_eval/tasks/super_glue/multirc/default.yaml
lm_eval/tasks/super_glue/multirc/default.yaml
+1
-1
lm_eval/tasks/super_glue/record/default.yaml
lm_eval/tasks/super_glue/record/default.yaml
+2
-2
No files found.
lm_eval/api/task.py
View file @
b0597a94
...
...
@@ -23,6 +23,7 @@ from lm_eval.api.filter import FilterEnsemble
from
lm_eval.logger
import
eval_logger
from
lm_eval.prompts
import
get_prompt
from
lm_eval.filters
import
build_filter_ensemble
from
lm_eval.metrics
import
metric_max_over_ground_truths
from
lm_eval.api.metrics
import
(
mean
,
weighted_perplexity
,
...
...
lm_eval/tasks/super_glue/boolq/default.yaml
View file @
b0597a94
...
...
@@ -8,7 +8,7 @@ training_split: train
validation_split
:
validation
doc_to_text
:
"
{{passage}}
\n
Question:
{{question}}?
\n
Answer:"
doc_to_target
:
label
doc_to_choice
:
{
0
:
"
no"
,
1
:
"
yes"
}
doc_to_choice
:
[
"
no"
,
"
yes"
]
should_decontaminate
:
true
doc_to_decontamination_query
:
passage
metric_list
:
...
...
lm_eval/tasks/super_glue/multirc/default.yaml
View file @
b0597a94
...
...
@@ -8,6 +8,6 @@ training_split: train
validation_split
:
validation
doc_to_text
:
"
{{paragraph}}
\n
Question:
{{question}}
\n
Answer:"
doc_to_target
:
label
doc_to_choice
:
"
[
\"
{{answer}}
\\
nIs
the
answer
correct?
yes
\"
,
\"
{{answer}}
\\
nIs
the
answer
correct?
no
\"
]"
doc_to_choice
:
"
[
'''
{{answer}}
\\
nIs
the
answer
correct?
yes
''',
'''
{{answer}}
\\
nIs
the
answer
correct?
no
'''
]"
metric_list
:
-
metric
:
acc
lm_eval/tasks/super_glue/record/default.yaml
View file @
b0597a94
group
:
-
super-glue-lm-eval-v1
#
group:
#
- super-glue-lm-eval-v1
task
:
record
dataset_path
:
super_glue
dataset_name
:
record
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment