Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
9e9b8a09
"src/vscode:/vscode.git/clone" did not exist on "db4bc97050c1781fcfb2bf5ca85db84484366ab4"
Commit
9e9b8a09
authored
Jul 16, 2024
by
Yu Shi Jie
Browse files
changed choices -> options in yaml config to fit dataset schema
parent
ad01e887
Changes
5
Hide whitespace changes
Inline
Side-by-side
Showing
5 changed files
with
5 additions
and
5 deletions
+5
-5
lm_eval/tasks/mmlu_pro/continuation/_continuation_template_yaml
...l/tasks/mmlu_pro/continuation/_continuation_template_yaml
+1
-1
lm_eval/tasks/mmlu_pro/default/_default_template_yaml
lm_eval/tasks/mmlu_pro/default/_default_template_yaml
+1
-1
lm_eval/tasks/mmlu_pro/flan_cot_fewshot/_mmlu_pro_flan_cot_fewshot_template_yaml
...flan_cot_fewshot/_mmlu_pro_flan_cot_fewshot_template_yaml
+1
-1
lm_eval/tasks/mmlu_pro/flan_cot_zeroshot/_mmlu_pro_flan_cot_zeroshot_template_yaml
...an_cot_zeroshot/_mmlu_pro_flan_cot_zeroshot_template_yaml
+1
-1
lm_eval/tasks/mmlu_pro/generative/_default_template_yaml
lm_eval/tasks/mmlu_pro/generative/_default_template_yaml
+1
-1
No files found.
lm_eval/tasks/mmlu_pro/continuation/_continuation_template_yaml
View file @
9e9b8a09
...
...
@@ -4,7 +4,7 @@ fewshot_split: dev
fewshot_config:
sampler: first_n
doc_to_text: "Question: {{question.strip()}}\nAnswer:"
doc_to_choice: "{{
choice
s}}"
doc_to_choice: "{{
option
s}}"
doc_to_target: "{{answer}}"
metadata:
version: 0.0
lm_eval/tasks/mmlu_pro/default/_default_template_yaml
View file @
9e9b8a09
...
...
@@ -4,7 +4,7 @@ fewshot_split: dev
fewshot_config:
sampler: first_n
output_type: multiple_choice
doc_to_text: "{{question.strip()}}\nA. {{
choice
s[0]}}\nB. {{
choice
s[1]}}\nC. {{
choice
s[2]}}\nD. {{
choice
s[3]}}\nE. {{
choice
s[4]}}\nF. {{
choice
s[5]}}\nG. {{
choice
s[6]}}\nH. {{
choice
s[7]}}\nI. {{
choice
s[8]}}\nJ. {{
choice
s[9]}}\nAnswer:"
doc_to_text: "{{question.strip()}}\nA. {{
option
s[0]}}\nB. {{
option
s[1]}}\nC. {{
option
s[2]}}\nD. {{
option
s[3]}}\nE. {{
option
s[4]}}\nF. {{
option
s[5]}}\nG. {{
option
s[6]}}\nH. {{
option
s[7]}}\nI. {{
option
s[8]}}\nJ. {{
option
s[9]}}\nAnswer:"
doc_to_choice: ["A", "B", "C", "D", "E","F","G","H","I","J"]
doc_to_target: answer
metric_list:
...
...
lm_eval/tasks/mmlu_pro/flan_cot_fewshot/_mmlu_pro_flan_cot_fewshot_template_yaml
View file @
9e9b8a09
...
...
@@ -5,7 +5,7 @@ fewshot_split: dev
fewshot_config:
sampler: first_n
output_type: generate_until
doc_to_text: "Q: {{question.strip()}}\n(A) {{
choice
s[0]}} (B) {{
choice
s[1]}} (C) {{
choice
s[2]}} (D) {{
choice
s[3]}} (E) {{
choice
s[4]}} (F) {{
choice
s[5]}} (G) {{
choice
s[6]}} (H) {{
choice
s[7]}} (I) {{
choice
s[8]}} (J) {{
choice
s[9]}}\nA: Let's think step by step."
doc_to_text: "Q: {{question.strip()}}\n(A) {{
option
s[0]}} (B) {{
option
s[1]}} (C) {{
option
s[2]}} (D) {{
option
s[3]}} (E) {{
option
s[4]}} (F) {{
option
s[5]}} (G) {{
option
s[6]}} (H) {{
option
s[7]}} (I) {{
option
s[8]}} (J) {{
option
s[9]}}\nA: Let's think step by step."
doc_to_target: "{{['(A)', '(B)', '(C)', '(D)', '(E)', '(F)', '(G)', '(H)', '(I)', '(J)'][answer]}}"
filter_list:
- name: "get-answer"
...
...
lm_eval/tasks/mmlu_pro/flan_cot_zeroshot/_mmlu_pro_flan_cot_zeroshot_template_yaml
View file @
9e9b8a09
...
...
@@ -2,7 +2,7 @@ dataset_path: sjyuxyz/MMLU-Pro-with-subset
validation_split: validation
fewshot_split: dev
output_type: generate_until
doc_to_text: "Q: {{question.strip()}}\n(A) {{
choice
s[0]}} (B) {{
choice
s[1]}} (C) {{
choice
s[2]}} (D) {{
choice
s[3]}} (E) {{
choice
s[4]}} (F) {{
choice
s[5]}} (G) {{
choice
s[6]}} (H) {{
choice
s[7]}} (I) {{
choice
s[8]}} (J) {{
choice
s[9]}}\nA: Let's think step by step."
doc_to_text: "Q: {{question.strip()}}\n(A) {{
option
s[0]}} (B) {{
option
s[1]}} (C) {{
option
s[2]}} (D) {{
option
s[3]}} (E) {{
option
s[4]}} (F) {{
option
s[5]}} (G) {{
option
s[6]}} (H) {{
option
s[7]}} (I) {{
option
s[8]}} (J) {{
option
s[9]}}\nA: Let's think step by step."
doc_to_target: "{{['(A)', '(B)', '(C)', '(D)', '(E)', '(F)', '(G)', '(H)', '(I)', '(J)'][answer]}}"
filter_list:
- name: "strict-match"
...
...
lm_eval/tasks/mmlu_pro/generative/_default_template_yaml
View file @
9e9b8a09
...
...
@@ -4,7 +4,7 @@ fewshot_split: dev
fewshot_config:
sampler: first_n
output_type: generate_until
doc_to_text: "{{question.strip()}}\nA. {{
choice
s[0]}}\nB. {{
choice
s[1]}}\nC. {{
choice
s[2]}}\nD. {{
choice
s[3]}}\nE. {{
choice
s[4]}}\nF. {{
choice
s[5]}}\nG. {{
choice
s[6]}}\nH. {{
choice
s[7]}}\nI. {{
choice
s[8]}}\nJ. {{
choice
s[9]}}\nAnswer:"
doc_to_text: "{{question.strip()}}\nA. {{
option
s[0]}}\nB. {{
option
s[1]}}\nC. {{
option
s[2]}}\nD. {{
option
s[3]}}\nE. {{
option
s[4]}}\nF. {{
option
s[5]}}\nG. {{
option
s[6]}}\nH. {{
option
s[7]}}\nI. {{
option
s[8]}}\nJ. {{
option
s[9]}}\nAnswer:"
doc_to_target: "{{['A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J'][answer]}}"
generation_kwargs:
until:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment