Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
f8c2cfcb
Unverified
Commit
f8c2cfcb
authored
Dec 12, 2023
by
Hanwool Albert Lee
Committed by
GitHub
Dec 12, 2023
Browse files
Merge pull request #1089 from h-albert-lee/K-MMLU
Add kmmlu evaluation to tasks
parents
c86f527f
c256eda8
Changes
48
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
60 additions
and
0 deletions
+60
-0
lm_eval/tasks/kmmlu/kmmlu_food_processing.yaml
lm_eval/tasks/kmmlu/kmmlu_food_processing.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_gas_technology_and_engineering.yaml
...val/tasks/kmmlu/kmmlu_gas_technology_and_engineering.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_general_physics.yaml
lm_eval/tasks/kmmlu/kmmlu_general_physics.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_geomatics.yaml
lm_eval/tasks/kmmlu/kmmlu_geomatics.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_health.yaml
lm_eval/tasks/kmmlu/kmmlu_health.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_industrial_engineer.yaml
lm_eval/tasks/kmmlu/kmmlu_industrial_engineer.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_information_technology.yaml
lm_eval/tasks/kmmlu/kmmlu_information_technology.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_interior_architecture_and_design.yaml
...l/tasks/kmmlu/kmmlu_interior_architecture_and_design.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_korean_language.yaml
lm_eval/tasks/kmmlu/kmmlu_korean_language.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_law.yaml
lm_eval/tasks/kmmlu/kmmlu_law.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_machine_design_and_manufacturing.yaml
...l/tasks/kmmlu/kmmlu_machine_design_and_manufacturing.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_management.yaml
lm_eval/tasks/kmmlu/kmmlu_management.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_maritime_engineering.yaml
lm_eval/tasks/kmmlu/kmmlu_maritime_engineering.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_marketing.yaml
lm_eval/tasks/kmmlu/kmmlu_marketing.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_materials_engineering.yaml
lm_eval/tasks/kmmlu/kmmlu_materials_engineering.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_mechanical_engineering.yaml
lm_eval/tasks/kmmlu/kmmlu_mechanical_engineering.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_nondestructive_testing.yaml
lm_eval/tasks/kmmlu/kmmlu_nondestructive_testing.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_patent.yaml
lm_eval/tasks/kmmlu/kmmlu_patent.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_political_science_and_sociology.yaml
...al/tasks/kmmlu/kmmlu_political_science_and_sociology.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_psychology.yaml
lm_eval/tasks/kmmlu/kmmlu_psychology.yaml
+3
-0
No files found.
lm_eval/tasks/kmmlu/kmmlu_food_processing.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Food
Processing"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_food_processing"
lm_eval/tasks/kmmlu/kmmlu_gas_technology_and_engineering.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Gas
Technology
and
Engineering"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_gas_technology_and_engineering"
lm_eval/tasks/kmmlu/kmmlu_general_physics.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
General
Physics"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_general_physics"
lm_eval/tasks/kmmlu/kmmlu_geomatics.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Geomatics"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_geomatics"
lm_eval/tasks/kmmlu/kmmlu_health.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Health"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_health"
lm_eval/tasks/kmmlu/kmmlu_industrial_engineer.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Industrial
Engineer"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_industrial_engineer"
lm_eval/tasks/kmmlu/kmmlu_information_technology.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Information
Technology"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_information_technology"
lm_eval/tasks/kmmlu/kmmlu_interior_architecture_and_design.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Interior
Architecture
and
Design"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_interior_architecture_and_design"
lm_eval/tasks/kmmlu/kmmlu_korean_language.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Korean
Language"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_korean_language"
lm_eval/tasks/kmmlu/kmmlu_law.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Law"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_law"
lm_eval/tasks/kmmlu/kmmlu_machine_design_and_manufacturing.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Machine
Design
and
Manufacturing"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_machine_design_and_manufacturing"
lm_eval/tasks/kmmlu/kmmlu_management.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Management"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_management"
lm_eval/tasks/kmmlu/kmmlu_maritime_engineering.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Maritime
Engineering"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_maritime_engineering"
lm_eval/tasks/kmmlu/kmmlu_marketing.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Marketing"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_marketing"
lm_eval/tasks/kmmlu/kmmlu_materials_engineering.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Materials
Engineering"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_materials_engineering"
lm_eval/tasks/kmmlu/kmmlu_mechanical_engineering.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Mechanical
Engineering"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_mechanical_engineering"
lm_eval/tasks/kmmlu/kmmlu_nondestructive_testing.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Nondestructive
Testing"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_nondestructive_testing"
lm_eval/tasks/kmmlu/kmmlu_patent.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Patent"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_patent"
lm_eval/tasks/kmmlu/kmmlu_political_science_and_sociology.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Political
Science
and
Sociology"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_political_science_and_sociology"
lm_eval/tasks/kmmlu/kmmlu_psychology.yaml
0 → 100644
View file @
f8c2cfcb
"
dataset_name"
:
"
Psychology"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_psychology"
Prev
1
2
3
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment