Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
2041dc34
Commit
2041dc34
authored
Oct 03, 2023
by
haileyschoelkopf
Browse files
Merge branch 'big-refactor' into bigbench
parents
67c0f73a
15f4a3ef
Changes
201
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
80 additions
and
0 deletions
+80
-0
lm_eval/tasks/cmmlu/cmmlu_default_economics.yaml
lm_eval/tasks/cmmlu/cmmlu_default_economics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_education.yaml
lm_eval/tasks/cmmlu/cmmlu_default_education.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_electrical_engineering.yaml
...val/tasks/cmmlu/cmmlu_default_electrical_engineering.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_elementary_chinese.yaml
lm_eval/tasks/cmmlu/cmmlu_default_elementary_chinese.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_elementary_commonsense.yaml
...val/tasks/cmmlu/cmmlu_default_elementary_commonsense.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_elementary_information_and_technology.yaml
.../cmmlu_default_elementary_information_and_technology.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_elementary_mathematics.yaml
...val/tasks/cmmlu/cmmlu_default_elementary_mathematics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_ethnology.yaml
lm_eval/tasks/cmmlu/cmmlu_default_ethnology.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_food_science.yaml
lm_eval/tasks/cmmlu/cmmlu_default_food_science.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_genetics.yaml
lm_eval/tasks/cmmlu/cmmlu_default_genetics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_global_facts.yaml
lm_eval/tasks/cmmlu/cmmlu_default_global_facts.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_high_school_biology.yaml
lm_eval/tasks/cmmlu/cmmlu_default_high_school_biology.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_high_school_chemistry.yaml
lm_eval/tasks/cmmlu/cmmlu_default_high_school_chemistry.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_high_school_geography.yaml
lm_eval/tasks/cmmlu/cmmlu_default_high_school_geography.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_high_school_mathematics.yaml
...al/tasks/cmmlu/cmmlu_default_high_school_mathematics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_high_school_physics.yaml
lm_eval/tasks/cmmlu/cmmlu_default_high_school_physics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_high_school_politics.yaml
lm_eval/tasks/cmmlu/cmmlu_default_high_school_politics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_human_sexuality.yaml
lm_eval/tasks/cmmlu/cmmlu_default_human_sexuality.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_international_law.yaml
lm_eval/tasks/cmmlu/cmmlu_default_international_law.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_journalism.yaml
lm_eval/tasks/cmmlu/cmmlu_default_journalism.yaml
+4
-0
No files found.
lm_eval/tasks/cmmlu/cmmlu_default_economics.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
economics"
"
description"
:
"
以下是关于经济学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_economics"
lm_eval/tasks/cmmlu/cmmlu_default_education.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
education"
"
description"
:
"
以下是关于教育学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_education"
lm_eval/tasks/cmmlu/cmmlu_default_electrical_engineering.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
electrical_engineering"
"
description"
:
"
以下是关于电气工程的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_electrical_engineering"
lm_eval/tasks/cmmlu/cmmlu_default_elementary_chinese.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
elementary_chinese"
"
description"
:
"
以下是关于小学语文的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_elementary_chinese"
lm_eval/tasks/cmmlu/cmmlu_default_elementary_commonsense.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
elementary_commonsense"
"
description"
:
"
以下是关于小学常识的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_elementary_commonsense"
lm_eval/tasks/cmmlu/cmmlu_default_elementary_information_and_technology.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
elementary_information_and_technology"
"
description"
:
"
以下是关于小学信息技术的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_elementary_information_and_technology"
lm_eval/tasks/cmmlu/cmmlu_default_elementary_mathematics.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
elementary_mathematics"
"
description"
:
"
以下是关于初等数学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_elementary_mathematics"
lm_eval/tasks/cmmlu/cmmlu_default_ethnology.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
ethnology"
"
description"
:
"
以下是关于民族学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_ethnology"
lm_eval/tasks/cmmlu/cmmlu_default_food_science.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
food_science"
"
description"
:
"
以下是关于食品科学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_food_science"
lm_eval/tasks/cmmlu/cmmlu_default_genetics.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
genetics"
"
description"
:
"
以下是关于遗传学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_genetics"
lm_eval/tasks/cmmlu/cmmlu_default_global_facts.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
global_facts"
"
description"
:
"
以下是关于全球事实的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_global_facts"
lm_eval/tasks/cmmlu/cmmlu_default_high_school_biology.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
high_school_biology"
"
description"
:
"
以下是关于高中生物的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_high_school_biology"
lm_eval/tasks/cmmlu/cmmlu_default_high_school_chemistry.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
high_school_chemistry"
"
description"
:
"
以下是关于高中化学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_high_school_chemistry"
lm_eval/tasks/cmmlu/cmmlu_default_high_school_geography.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
high_school_geography"
"
description"
:
"
以下是关于高中地理的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_high_school_geography"
lm_eval/tasks/cmmlu/cmmlu_default_high_school_mathematics.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
high_school_mathematics"
"
description"
:
"
以下是关于高中数学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_high_school_mathematics"
lm_eval/tasks/cmmlu/cmmlu_default_high_school_physics.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
high_school_physics"
"
description"
:
"
以下是关于高中物理学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_high_school_physics"
lm_eval/tasks/cmmlu/cmmlu_default_high_school_politics.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
high_school_politics"
"
description"
:
"
以下是关于高中政治的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_high_school_politics"
lm_eval/tasks/cmmlu/cmmlu_default_human_sexuality.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
human_sexuality"
"
description"
:
"
以下是关于人类性行为的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_human_sexuality"
lm_eval/tasks/cmmlu/cmmlu_default_international_law.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
international_law"
"
description"
:
"
以下是关于国际法学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_international_law"
lm_eval/tasks/cmmlu/cmmlu_default_journalism.yaml
0 → 100644
View file @
2041dc34
"
dataset_name"
:
"
journalism"
"
description"
:
"
以下是关于新闻学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_journalism"
Prev
1
2
3
4
5
6
7
8
9
10
11
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment