Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
6d0c60d7
Commit
6d0c60d7
authored
Dec 10, 2024
by
Baber
Browse files
test arc_challenge
parent
31631407
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
24 additions
and
0 deletions
+24
-0
lm_eval/tasks/llama3/base/arc_easy.yaml
lm_eval/tasks/llama3/base/arc_easy.yaml
+24
-0
No files found.
lm_eval/tasks/llama3/base/arc_easy.yaml
0 → 100644
View file @
6d0c60d7
tag
:
-
llama
task
:
arc_challenge_chat
dataset_path
:
allenai/ai2_arc
dataset_name
:
ARC-Challenge
output_type
:
multiple_choice
training_split
:
train
validation_split
:
validation
test_split
:
test
#doc_to_text: "Question: {{question}}\nAnswer:"
doc_to_text
:
"
Question:
{{question.strip()}}
\n
A.
{{choices.text[0]}}
\n
B.
{{choices.text[1]}}
\n
C.
{{choices.text[2]}}{%
if
choices.text|length
>
3
%}
\n
D.
{{choices.text[3]}}{%
endif
%}
\n
Answer:"
fewshot_delimiter
:
"
\n\n
"
doc_to_target
:
"
{{answerKey}}"
doc_to_choice
:
"
{{choices.label}}"
num_fewshot
:
25
metric_list
:
-
metric
:
acc
aggregation
:
mean
higher_is_better
:
true
-
metric
:
acc_norm
aggregation
:
mean
higher_is_better
:
true
metadata
:
version
:
1.0
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment