Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
13c6f5e7
Commit
13c6f5e7
authored
Sep 15, 2023
by
haileyschoelkopf
Browse files
add draft cmmlu port
parent
9d0df41b
Changes
71
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
80 additions
and
0 deletions
+80
-0
lm_eval/tasks/cmmlu/cmmlu_default_college_engineering_hydrology.yaml
...ks/cmmlu/cmmlu_default_college_engineering_hydrology.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_college_law.yaml
lm_eval/tasks/cmmlu/cmmlu_default_college_law.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_college_mathematics.yaml
lm_eval/tasks/cmmlu/cmmlu_default_college_mathematics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_college_medical_statistics.yaml
...tasks/cmmlu/cmmlu_default_college_medical_statistics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_college_medicine.yaml
lm_eval/tasks/cmmlu/cmmlu_default_college_medicine.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_computer_science.yaml
lm_eval/tasks/cmmlu/cmmlu_default_computer_science.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_computer_security.yaml
lm_eval/tasks/cmmlu/cmmlu_default_computer_security.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_conceptual_physics.yaml
lm_eval/tasks/cmmlu/cmmlu_default_conceptual_physics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_construction_project_management.yaml
.../cmmlu/cmmlu_default_construction_project_management.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_economics.yaml
lm_eval/tasks/cmmlu/cmmlu_default_economics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_education.yaml
lm_eval/tasks/cmmlu/cmmlu_default_education.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_electrical_engineering.yaml
...val/tasks/cmmlu/cmmlu_default_electrical_engineering.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_elementary_chinese.yaml
lm_eval/tasks/cmmlu/cmmlu_default_elementary_chinese.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_elementary_commonsense.yaml
...val/tasks/cmmlu/cmmlu_default_elementary_commonsense.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_elementary_information_and_technology.yaml
.../cmmlu_default_elementary_information_and_technology.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_elementary_mathematics.yaml
...val/tasks/cmmlu/cmmlu_default_elementary_mathematics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_ethnology.yaml
lm_eval/tasks/cmmlu/cmmlu_default_ethnology.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_food_science.yaml
lm_eval/tasks/cmmlu/cmmlu_default_food_science.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_genetics.yaml
lm_eval/tasks/cmmlu/cmmlu_default_genetics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_default_global_facts.yaml
lm_eval/tasks/cmmlu/cmmlu_default_global_facts.yaml
+4
-0
No files found.
lm_eval/tasks/cmmlu/cmmlu_default_college_engineering_hydrology.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
college_engineering_hydrology"
"
description"
:
"
以下是关于大学工程水文学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_engineering_hydrology"
lm_eval/tasks/cmmlu/cmmlu_default_college_law.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
college_law"
"
description"
:
"
以下是关于大学法律的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_law"
lm_eval/tasks/cmmlu/cmmlu_default_college_mathematics.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
college_mathematics"
"
description"
:
"
以下是关于大学数学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_mathematics"
lm_eval/tasks/cmmlu/cmmlu_default_college_medical_statistics.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
college_medical_statistics"
"
description"
:
"
以下是关于大学医学统计的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_medical_statistics"
lm_eval/tasks/cmmlu/cmmlu_default_college_medicine.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
college_medicine"
"
description"
:
"
以下是关于大学医学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_medicine"
lm_eval/tasks/cmmlu/cmmlu_default_computer_science.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
computer_science"
"
description"
:
"
以下是关于计算机科学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_computer_science"
lm_eval/tasks/cmmlu/cmmlu_default_computer_security.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
computer_security"
"
description"
:
"
以下是关于计算机安全的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_computer_security"
lm_eval/tasks/cmmlu/cmmlu_default_conceptual_physics.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
conceptual_physics"
"
description"
:
"
以下是关于概念物理学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_conceptual_physics"
lm_eval/tasks/cmmlu/cmmlu_default_construction_project_management.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
construction_project_management"
"
description"
:
"
以下是关于建设工程管理的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_construction_project_management"
lm_eval/tasks/cmmlu/cmmlu_default_economics.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
economics"
"
description"
:
"
以下是关于经济学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_economics"
lm_eval/tasks/cmmlu/cmmlu_default_education.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
education"
"
description"
:
"
以下是关于教育学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_education"
lm_eval/tasks/cmmlu/cmmlu_default_electrical_engineering.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
electrical_engineering"
"
description"
:
"
以下是关于电气工程的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_electrical_engineering"
lm_eval/tasks/cmmlu/cmmlu_default_elementary_chinese.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
elementary_chinese"
"
description"
:
"
以下是关于小学语文的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_elementary_chinese"
lm_eval/tasks/cmmlu/cmmlu_default_elementary_commonsense.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
elementary_commonsense"
"
description"
:
"
以下是关于小学常识的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_elementary_commonsense"
lm_eval/tasks/cmmlu/cmmlu_default_elementary_information_and_technology.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
elementary_information_and_technology"
"
description"
:
"
以下是关于小学信息技术的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_elementary_information_and_technology"
lm_eval/tasks/cmmlu/cmmlu_default_elementary_mathematics.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
elementary_mathematics"
"
description"
:
"
以下是关于初等数学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_elementary_mathematics"
lm_eval/tasks/cmmlu/cmmlu_default_ethnology.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
ethnology"
"
description"
:
"
以下是关于民族学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_ethnology"
lm_eval/tasks/cmmlu/cmmlu_default_food_science.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
food_science"
"
description"
:
"
以下是关于食品科学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_food_science"
lm_eval/tasks/cmmlu/cmmlu_default_genetics.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
genetics"
"
description"
:
"
以下是关于遗传学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_genetics"
lm_eval/tasks/cmmlu/cmmlu_default_global_facts.yaml
0 → 100644
View file @
13c6f5e7
"
dataset_name"
:
"
global_facts"
"
description"
:
"
以下是关于全球事实的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_global_facts"
Prev
1
2
3
4
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment