Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
543617fe
Unverified
Commit
543617fe
authored
Sep 05, 2024
by
Hailey Schoelkopf
Committed by
GitHub
Sep 05, 2024
Browse files
Bump version to v0.4.4 ; Fixes to TMMLUplus (#2280)
parent
7a1614eb
Changes
76
Show whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
40 additions
and
60 deletions
+40
-60
lm_eval/tasks/tmmluplus/default/tmmluplus_linear_algebra.yaml
...val/tasks/tmmluplus/default/tmmluplus_linear_algebra.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_logic_reasoning.yaml
...al/tasks/tmmluplus/default/tmmluplus_logic_reasoning.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_macroeconomics.yaml
...val/tasks/tmmluplus/default/tmmluplus_macroeconomics.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_management_accounting.yaml
...ks/tmmluplus/default/tmmluplus_management_accounting.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_marketing_management.yaml
...sks/tmmluplus/default/tmmluplus_marketing_management.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_mechanical.yaml
lm_eval/tasks/tmmluplus/default/tmmluplus_mechanical.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_music.yaml
lm_eval/tasks/tmmluplus/default/tmmluplus_music.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_national_protection.yaml
...asks/tmmluplus/default/tmmluplus_national_protection.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_nautical_science.yaml
...l/tasks/tmmluplus/default/tmmluplus_nautical_science.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_occupational_therapy_for_psychological_disorders.yaml
...lus_occupational_therapy_for_psychological_disorders.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_official_document_management.yaml
...uplus/default/tmmluplus_official_document_management.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_optometry.yaml
lm_eval/tasks/tmmluplus/default/tmmluplus_optometry.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_organic_chemistry.yaml
.../tasks/tmmluplus/default/tmmluplus_organic_chemistry.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_pharmacology.yaml
lm_eval/tasks/tmmluplus/default/tmmluplus_pharmacology.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_pharmacy.yaml
lm_eval/tasks/tmmluplus/default/tmmluplus_pharmacy.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_physical_education.yaml
...tasks/tmmluplus/default/tmmluplus_physical_education.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_physics.yaml
lm_eval/tasks/tmmluplus/default/tmmluplus_physics.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_politic_science.yaml
...al/tasks/tmmluplus/default/tmmluplus_politic_science.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_real_estate.yaml
lm_eval/tasks/tmmluplus/default/tmmluplus_real_estate.yaml
+2
-3
lm_eval/tasks/tmmluplus/default/tmmluplus_secondary_physics.yaml
.../tasks/tmmluplus/default/tmmluplus_secondary_physics.yaml
+2
-3
No files found.
lm_eval/tasks/tmmluplus/default/tmmluplus_linear_algebra.yaml
View file @
543617fe
"
dataset_name"
:
"
linear_algebra"
"
description"
:
"
以下為線代的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_STEM"
"
group_alias"
:
"
STEM"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_STEM_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_linear_algebra"
"
task_alias"
:
"
linear
algebra"
lm_eval/tasks/tmmluplus/default/tmmluplus_logic_reasoning.yaml
View file @
543617fe
"
dataset_name"
:
"
logic_reasoning"
"
description"
:
"
以下為邏輯思維的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_other"
"
group_alias"
:
"
other"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_other_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_logic_reasoning"
"
task_alias"
:
"
logic
reasoning"
lm_eval/tasks/tmmluplus/default/tmmluplus_macroeconomics.yaml
View file @
543617fe
"
dataset_name"
:
"
macroeconomics"
"
description"
:
"
以下為總經的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_social_sciences"
"
group_alias"
:
"
social
sciences"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_social_sciences_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_macroeconomics"
"
task_alias"
:
"
macroeconomics"
lm_eval/tasks/tmmluplus/default/tmmluplus_management_accounting.yaml
View file @
543617fe
"
dataset_name"
:
"
management_accounting"
"
description"
:
"
以下為管理會計的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_other"
"
group_alias"
:
"
other"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_other_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_management_accounting"
"
task_alias"
:
"
management
accounting"
lm_eval/tasks/tmmluplus/default/tmmluplus_marketing_management.yaml
View file @
543617fe
"
dataset_name"
:
"
marketing_management"
"
description"
:
"
以下為行銷管理的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_other"
"
group_alias"
:
"
other"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_other_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_marketing_management"
"
task_alias"
:
"
marketing
management"
lm_eval/tasks/tmmluplus/default/tmmluplus_mechanical.yaml
View file @
543617fe
"
dataset_name"
:
"
mechanical"
"
description"
:
"
以下為機械與機電概論的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_other"
"
group_alias"
:
"
other"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_other_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_mechanical"
"
task_alias"
:
"
mechanical"
lm_eval/tasks/tmmluplus/default/tmmluplus_music.yaml
View file @
543617fe
"
dataset_name"
:
"
music"
"
description"
:
"
以下為音樂科的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_other"
"
group_alias"
:
"
other"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_other_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_music"
"
task_alias"
:
"
music"
lm_eval/tasks/tmmluplus/default/tmmluplus_national_protection.yaml
View file @
543617fe
"
dataset_name"
:
"
national_protection"
"
description"
:
"
以下為軍事的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_social_sciences"
"
group_alias"
:
"
social
sciences"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_social_sciences_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_national_protection"
"
task_alias"
:
"
national
protection"
lm_eval/tasks/tmmluplus/default/tmmluplus_nautical_science.yaml
View file @
543617fe
"
dataset_name"
:
"
nautical_science"
"
description"
:
"
以下為航海的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_other"
"
group_alias"
:
"
other"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_other_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_nautical_science"
"
task_alias"
:
"
nautical
science"
lm_eval/tasks/tmmluplus/default/tmmluplus_occupational_therapy_for_psychological_disorders.yaml
View file @
543617fe
"
dataset_name"
:
"
occupational_therapy_for_psychological_disorders"
"
description"
:
"
以下為心理障礙職能治療學的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_social_sciences"
"
group_alias"
:
"
social
sciences"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_social_sciences_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_occupational_therapy_for_psychological_disorders"
"
task_alias"
:
"
occupational
therapy
for
psychological
disorders"
lm_eval/tasks/tmmluplus/default/tmmluplus_official_document_management.yaml
View file @
543617fe
"
dataset_name"
:
"
official_document_management"
"
description"
:
"
以下為機關文書的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_other"
"
group_alias"
:
"
other"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_other_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_official_document_management"
"
task_alias"
:
"
official
document
management"
lm_eval/tasks/tmmluplus/default/tmmluplus_optometry.yaml
View file @
543617fe
"
dataset_name"
:
"
optometry"
"
description"
:
"
以下為視光學的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_other"
"
group_alias"
:
"
other"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_other_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_optometry"
"
task_alias"
:
"
optometry"
lm_eval/tasks/tmmluplus/default/tmmluplus_organic_chemistry.yaml
View file @
543617fe
"
dataset_name"
:
"
organic_chemistry"
"
description"
:
"
以下為有機化學的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_STEM"
"
group_alias"
:
"
STEM"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_STEM_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_organic_chemistry"
"
task_alias"
:
"
organic
chemistry"
lm_eval/tasks/tmmluplus/default/tmmluplus_pharmacology.yaml
View file @
543617fe
"
dataset_name"
:
"
pharmacology"
"
description"
:
"
以下為藥理學的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_other"
"
group_alias"
:
"
other"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_other_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_pharmacology"
"
task_alias"
:
"
pharmacology"
lm_eval/tasks/tmmluplus/default/tmmluplus_pharmacy.yaml
View file @
543617fe
"
dataset_name"
:
"
pharmacy"
"
description"
:
"
以下為藥劑學的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_STEM"
"
group_alias"
:
"
STEM"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_STEM_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_pharmacy"
"
task_alias"
:
"
pharmacy"
lm_eval/tasks/tmmluplus/default/tmmluplus_physical_education.yaml
View file @
543617fe
"
dataset_name"
:
"
physical_education"
"
description"
:
"
以下為體育的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_social_sciences"
"
group_alias"
:
"
social
sciences"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_social_sciences_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_physical_education"
"
task_alias"
:
"
physical
education"
lm_eval/tasks/tmmluplus/default/tmmluplus_physics.yaml
View file @
543617fe
"
dataset_name"
:
"
physics"
"
description"
:
"
以下為物理的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_STEM"
"
group_alias"
:
"
STEM"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_STEM_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_physics"
"
task_alias"
:
"
physics"
lm_eval/tasks/tmmluplus/default/tmmluplus_politic_science.yaml
View file @
543617fe
"
dataset_name"
:
"
politic_science"
"
description"
:
"
以下為政治的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_social_sciences"
"
group_alias"
:
"
social
sciences"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_social_sciences_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_politic_science"
"
task_alias"
:
"
politic
science"
lm_eval/tasks/tmmluplus/default/tmmluplus_real_estate.yaml
View file @
543617fe
"
dataset_name"
:
"
real_estate"
"
description"
:
"
以下為房地產的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_other"
"
group_alias"
:
"
other"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_other_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_real_estate"
"
task_alias"
:
"
real
estate"
lm_eval/tasks/tmmluplus/default/tmmluplus_secondary_physics.yaml
View file @
543617fe
"
dataset_name"
:
"
secondary_physics"
"
description"
:
"
以下為高中物理的單選題,請提供正確答案的選項。
\n\n
"
"
group"
:
"
tmmluplus_STEM"
"
group_alias"
:
"
STEM"
"
include"
:
"
_default_template_yaml"
"
tag"
:
"
tmmluplus_STEM_tasks"
"
include"
:
"
_tmmluplus_template_yaml"
"
task"
:
"
tmmluplus_secondary_physics"
"
task_alias"
:
"
secondary
physics"
Prev
1
2
3
4
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment