Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
741a6a69
Commit
741a6a69
authored
Aug 20, 2024
by
lintangsutawika
Browse files
Merge branch 'main' of
https://github.com/EleutherAI/lm-evaluation-harness
into mela
parents
494a4515
b536f067
Changes
1000
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
80 additions
and
0 deletions
+80
-0
lm_eval/tasks/cmmlu/cmmlu_chinese_driving_rule.yaml
lm_eval/tasks/cmmlu/cmmlu_chinese_driving_rule.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_chinese_food_culture.yaml
lm_eval/tasks/cmmlu/cmmlu_chinese_food_culture.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_chinese_foreign_policy.yaml
lm_eval/tasks/cmmlu/cmmlu_chinese_foreign_policy.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_chinese_history.yaml
lm_eval/tasks/cmmlu/cmmlu_chinese_history.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_chinese_literature.yaml
lm_eval/tasks/cmmlu/cmmlu_chinese_literature.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_chinese_teacher_qualification.yaml
lm_eval/tasks/cmmlu/cmmlu_chinese_teacher_qualification.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_clinical_knowledge.yaml
lm_eval/tasks/cmmlu/cmmlu_clinical_knowledge.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_college_actuarial_science.yaml
lm_eval/tasks/cmmlu/cmmlu_college_actuarial_science.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_college_education.yaml
lm_eval/tasks/cmmlu/cmmlu_college_education.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_college_engineering_hydrology.yaml
lm_eval/tasks/cmmlu/cmmlu_college_engineering_hydrology.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_college_law.yaml
lm_eval/tasks/cmmlu/cmmlu_college_law.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_college_mathematics.yaml
lm_eval/tasks/cmmlu/cmmlu_college_mathematics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_college_medical_statistics.yaml
lm_eval/tasks/cmmlu/cmmlu_college_medical_statistics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_college_medicine.yaml
lm_eval/tasks/cmmlu/cmmlu_college_medicine.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_computer_science.yaml
lm_eval/tasks/cmmlu/cmmlu_computer_science.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_computer_security.yaml
lm_eval/tasks/cmmlu/cmmlu_computer_security.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_conceptual_physics.yaml
lm_eval/tasks/cmmlu/cmmlu_conceptual_physics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_construction_project_management.yaml
...al/tasks/cmmlu/cmmlu_construction_project_management.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_economics.yaml
lm_eval/tasks/cmmlu/cmmlu_economics.yaml
+4
-0
lm_eval/tasks/cmmlu/cmmlu_education.yaml
lm_eval/tasks/cmmlu/cmmlu_education.yaml
+4
-0
No files found.
Too many changes to show.
To preserve performance only
1000 of 1000+
files are displayed.
Plain diff
Email patch
lm_eval/tasks/cmmlu/cmmlu_chinese_driving_rule.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
chinese_driving_rule"
"
description"
:
"
以下是关于中国驾驶规则的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_chinese_driving_rule"
lm_eval/tasks/cmmlu/cmmlu_chinese_food_culture.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
chinese_food_culture"
"
description"
:
"
以下是关于中国饮食文化的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_chinese_food_culture"
lm_eval/tasks/cmmlu/cmmlu_chinese_foreign_policy.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
chinese_foreign_policy"
"
description"
:
"
以下是关于中国外交政策的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_chinese_foreign_policy"
lm_eval/tasks/cmmlu/cmmlu_chinese_history.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
chinese_history"
"
description"
:
"
以下是关于中国历史的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_chinese_history"
lm_eval/tasks/cmmlu/cmmlu_chinese_literature.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
chinese_literature"
"
description"
:
"
以下是关于中国文学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_chinese_literature"
lm_eval/tasks/cmmlu/cmmlu_chinese_teacher_qualification.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
chinese_teacher_qualification"
"
description"
:
"
以下是关于中国教师资格的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_chinese_teacher_qualification"
lm_eval/tasks/cmmlu/cmmlu_clinical_knowledge.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
clinical_knowledge"
"
description"
:
"
以下是关于临床知识的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_clinical_knowledge"
lm_eval/tasks/cmmlu/cmmlu_college_actuarial_science.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
college_actuarial_science"
"
description"
:
"
以下是关于大学精算学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_actuarial_science"
lm_eval/tasks/cmmlu/cmmlu_college_education.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
college_education"
"
description"
:
"
以下是关于大学教育学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_education"
lm_eval/tasks/cmmlu/cmmlu_college_engineering_hydrology.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
college_engineering_hydrology"
"
description"
:
"
以下是关于大学工程水文学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_engineering_hydrology"
lm_eval/tasks/cmmlu/cmmlu_college_law.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
college_law"
"
description"
:
"
以下是关于大学法律的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_law"
lm_eval/tasks/cmmlu/cmmlu_college_mathematics.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
college_mathematics"
"
description"
:
"
以下是关于大学数学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_mathematics"
lm_eval/tasks/cmmlu/cmmlu_college_medical_statistics.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
college_medical_statistics"
"
description"
:
"
以下是关于大学医学统计的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_medical_statistics"
lm_eval/tasks/cmmlu/cmmlu_college_medicine.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
college_medicine"
"
description"
:
"
以下是关于大学医学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_college_medicine"
lm_eval/tasks/cmmlu/cmmlu_computer_science.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
computer_science"
"
description"
:
"
以下是关于计算机科学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_computer_science"
lm_eval/tasks/cmmlu/cmmlu_computer_security.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
computer_security"
"
description"
:
"
以下是关于计算机安全的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_computer_security"
lm_eval/tasks/cmmlu/cmmlu_conceptual_physics.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
conceptual_physics"
"
description"
:
"
以下是关于概念物理学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_conceptual_physics"
lm_eval/tasks/cmmlu/cmmlu_construction_project_management.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
construction_project_management"
"
description"
:
"
以下是关于建设工程管理的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_construction_project_management"
lm_eval/tasks/cmmlu/cmmlu_economics.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
economics"
"
description"
:
"
以下是关于经济学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_economics"
lm_eval/tasks/cmmlu/cmmlu_education.yaml
0 → 100644
View file @
741a6a69
"
dataset_name"
:
"
education"
"
description"
:
"
以下是关于教育学的单项选择题,请直接给出正确答案的选项。
\n\n
"
"
include"
:
"
_default_template_yaml"
"
task"
:
"
cmmlu_education"
Prev
1
…
23
24
25
26
27
28
29
30
31
…
50
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment