Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
2493c6cc
Commit
2493c6cc
authored
Oct 06, 2023
by
lintangsutawika
Browse files
changed default to 5-shot
parent
f2c396ab
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
8 additions
and
4 deletions
+8
-4
lm_eval/tasks/benchmarks/flan/flan_held_out.yaml
lm_eval/tasks/benchmarks/flan/flan_held_out.yaml
+5
-4
lm_eval/tasks/mmlu/default/_default_template_yaml
lm_eval/tasks/mmlu/default/_default_template_yaml
+1
-0
lm_eval/tasks/mmlu/flan_n_shot/_mmlu_flan_generative_template_yaml
...asks/mmlu/flan_n_shot/_mmlu_flan_generative_template_yaml
+1
-0
lm_eval/tasks/mmlu/flan_n_shot/_mmlu_flan_loglikelihood_template_yaml
...s/mmlu/flan_n_shot/_mmlu_flan_loglikelihood_template_yaml
+1
-0
No files found.
lm_eval/tasks/benchmarks/flan/flan_held_out.yaml
View file @
2493c6cc
...
@@ -3,10 +3,11 @@ task:
...
@@ -3,10 +3,11 @@ task:
# BBH
# BBH
-
bbh_flan_zeroshot
-
bbh_flan_zeroshot
-
bbh_flan_fewshot
-
bbh_flan_fewshot
#
- bbh_flan_cot_fewshot
-
bbh_flan_cot_fewshot
#
- bbh_flan_cot_zeroshot
-
bbh_flan_cot_zeroshot
# MMLU
# MMLU
-
mmlu
-
mmlu_flan_n_shot_generative
-
mmlu_flan_n_shot_generative
-
mmlu_flan_n_shot_loglikelihood
-
mmlu_flan_n_shot_loglikelihood
#
- mmlu_flan_cot_zeroshot
-
mmlu_flan_cot_zeroshot
#
- mmlu_flan_cot_fewshot
-
mmlu_flan_cot_fewshot
lm_eval/tasks/mmlu/default/_default_template_yaml
View file @
2493c6cc
...
@@ -15,3 +15,4 @@ metric_list:
...
@@ -15,3 +15,4 @@ metric_list:
- metric: acc_norm
- metric: acc_norm
aggregation: mean
aggregation: mean
higher_is_better: true
higher_is_better: true
num_fewshot: 5
lm_eval/tasks/mmlu/flan_n_shot/_mmlu_flan_generative_template_yaml
View file @
2493c6cc
...
@@ -12,3 +12,4 @@ metric_list:
...
@@ -12,3 +12,4 @@ metric_list:
- metric: exact_match
- metric: exact_match
aggregation: mean
aggregation: mean
higher_is_better: true
higher_is_better: true
num_fewshot: 5
lm_eval/tasks/mmlu/flan_n_shot/_mmlu_flan_loglikelihood_template_yaml
View file @
2493c6cc
...
@@ -13,3 +13,4 @@ metric_list:
...
@@ -13,3 +13,4 @@ metric_list:
- metric: acc_norm
- metric: acc_norm
aggregation: mean
aggregation: mean
higher_is_better: true
higher_is_better: true
num_fewshot: 5
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment