Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
799359ad
"test/tests.h" did not exist on "2e8841c6fa0052420a3437eafa1f40c46cbdb2b1"
Commit
799359ad
authored
Feb 10, 2025
by
Baber
Browse files
add openbookqa generative
parent
a40fe42a
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
32 additions
and
0 deletions
+32
-0
lm_eval/tasks/openbookqa/openbookqa_generative.yaml
lm_eval/tasks/openbookqa/openbookqa_generative.yaml
+32
-0
No files found.
lm_eval/tasks/openbookqa/openbookqa_generative.yaml
0 → 100644
View file @
799359ad
task
:
openbookqa_generative
dataset_path
:
openbookqa
dataset_name
:
main
output_type
:
generate_until
training_split
:
train
validation_split
:
validation
test_split
:
test
doc_to_text
:
|-
Given the following question and four candidate answers (A, B, C and D), choose the best answer.
Question: {{question_stem}}
A. {{choices.text[0]}}
B. {{choices.text[1]}}
C. {{choices.text[2]}}
D. {{choices.text[3]}}
Your response should end with "The best answer is [the_answer_letter]" where the [the_answer_letter] is one of choice letters, A, B, C or D.
doc_to_target
:
answerKey
should_decontaminate
:
true
filter_list
:
-
name
:
"
strict-match"
filter
:
-
function
:
"
multi_choice_regex"
-
function
:
"
take_first"
gen_kwargs
:
until
:
[]
max_gen_toks
:
10
metric_list
:
-
metric
:
exact_match
aggregation
:
mean
higher_is_better
:
true
metadata
:
version
:
1.0
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment