Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
9fa5126a
"profiler/vscode:/vscode.git/clone" did not exist on "892760bdf5b15217b6910adf5b11c6cb0dcf14ea"
Commit
9fa5126a
authored
Feb 06, 2025
by
Baber
Browse files
fix
parent
89b6bdb3
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
14 additions
and
6 deletions
+14
-6
lm_eval/tasks/ai2d/ai2d.yaml
lm_eval/tasks/ai2d/ai2d.yaml
+14
-6
No files found.
lm_eval/tasks/ai2d/ai2d.yaml
View file @
9fa5126a
dataset_path
:
lmms-lab/ai2d
dataset_path
:
lmms-lab/ai2d
task
:
ai2d
test_split
:
test
test_split
:
test
output_type
:
generate_until
output_type
:
generate_until
doc_to_image
:
!function
utils.doc_to_image
doc_to_image
:
doc_to_text
:
"
<image>{{question
|
capitalize}}
-
image
{{%
for
option
in
options
%}}
doc_to_text
:
"
<image>Look
at
the
scientific
diagram
carefully
and
answer
the
following
question:
{{question
|
capitalize}}
\n
{%
for
option
in
options
-%}{{['A',
'B',
'C',
'D',
'E',
'F'][loop.index0]}}.
{{option
|
capitalize}}
\n
{%
endfor
-%}
{{loop.index
|
chr(64)}}.
{{option
|
capitalize}}
{{%
endfor
%}}"
Think
step
by
step
and
finally
respond
to
the
question
with
only
the
correct
option
number
as
\"
FINAL
ANSWER
\"
."
doc_to_target
:
"
{{options[answer]}}"
gen_prefix
:
"
Let's
think
step
by
step."
doc_to_target
:
"
{{
['A',
'B',
'C',
'D',
'E',
'F'][answer|int]
}}"
generation_kwargs
:
generation_kwargs
:
until
:
[]
until
:
[]
temperature
:
0.0
temperature
:
0.0
do_sample
:
false
do_sample
:
false
max_gen_toks
:
512
max_gen_toks
:
512
filter_list
:
-
name
:
"
strict-match"
filter
:
-
function
:
"
regex"
regex_pattern
:
"
(?:[`
\\
*_]*(?:FINAL
ANSWER|Final
Answer)[`
\\
*_]*)[:
\\
s]*[`
\\
*_]*([A-D])[`
\\
*_]*"
-
function
:
"
take_first"
metric_list
:
metric_list
:
-
metric
:
acc
-
metric
:
acc
aggregation
:
mean
aggregation
:
mean
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment