Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
89b6bdb3
Commit
89b6bdb3
authored
Feb 06, 2025
by
Baber
Browse files
Merge branch 'main' into ai2d
parents
59053d58
144a1e58
Changes
1000
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
129 additions
and
0 deletions
+129
-0
lm_eval/tasks/global_mmlu/full/fa/_fa_template_yaml
lm_eval/tasks/global_mmlu/full/fa/_fa_template_yaml
+16
-0
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa.yaml
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa.yaml
+11
-0
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa_humanities.yaml
.../global_mmlu/full/fa/_global_mmlu_full_fa_humanities.yaml
+8
-0
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa_other.yaml
...tasks/global_mmlu/full/fa/_global_mmlu_full_fa_other.yaml
+8
-0
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa_social_sciences.yaml
...al_mmlu/full/fa/_global_mmlu_full_fa_social_sciences.yaml
+8
-0
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa_stem.yaml
.../tasks/global_mmlu/full/fa/_global_mmlu_full_fa_stem.yaml
+8
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_abstract_algebra.yaml
...al_mmlu/full/fa/global_mmlu_full_fa_abstract_algebra.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_anatomy.yaml
...asks/global_mmlu/full/fa/global_mmlu_full_fa_anatomy.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_astronomy.yaml
...ks/global_mmlu/full/fa/global_mmlu_full_fa_astronomy.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_business_ethics.yaml
...bal_mmlu/full/fa/global_mmlu_full_fa_business_ethics.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_clinical_knowledge.yaml
..._mmlu/full/fa/global_mmlu_full_fa_clinical_knowledge.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_biology.yaml
...bal_mmlu/full/fa/global_mmlu_full_fa_college_biology.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_chemistry.yaml
...l_mmlu/full/fa/global_mmlu_full_fa_college_chemistry.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_computer_science.yaml
...full/fa/global_mmlu_full_fa_college_computer_science.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_mathematics.yaml
...mmlu/full/fa/global_mmlu_full_fa_college_mathematics.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_medicine.yaml
...al_mmlu/full/fa/global_mmlu_full_fa_college_medicine.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_physics.yaml
...bal_mmlu/full/fa/global_mmlu_full_fa_college_physics.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_computer_security.yaml
...l_mmlu/full/fa/global_mmlu_full_fa_computer_security.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_conceptual_physics.yaml
..._mmlu/full/fa/global_mmlu_full_fa_conceptual_physics.yaml
+5
-0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_econometrics.yaml
...global_mmlu/full/fa/global_mmlu_full_fa_econometrics.yaml
+5
-0
No files found.
Too many changes to show.
To preserve performance only
1000 of 1000+
files are displayed.
Plain diff
Email patch
lm_eval/tasks/global_mmlu/full/fa/_fa_template_yaml
0 → 100644
View file @
89b6bdb3
dataset_path: CohereForAI/Global-MMLU
dataset_name: fa
test_split: test
fewshot_split: dev
fewshot_config:
sampler: first_n
output_type: multiple_choice
doc_to_text: "{{question.strip()}}\nA. {{option_a}}\nB. {{option_b}}\nC. {{option_c}}\nD. {{option_d}}\nAnswer:"
doc_to_choice: ["A", "B", "C", "D"]
doc_to_target: answer
metric_list:
- metric: acc
aggregation: mean
higher_is_better: true
metadata:
version: 0.0
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa.yaml
0 → 100644
View file @
89b6bdb3
group
:
global_mmlu_full_fa
task
:
-
global_mmlu_full_fa_stem
-
global_mmlu_full_fa_other
-
global_mmlu_full_fa_social_sciences
-
global_mmlu_full_fa_humanities
aggregate_metric_list
:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
0.0
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa_humanities.yaml
0 → 100644
View file @
89b6bdb3
group
:
global_mmlu_full_fa_humanities
task
:
-
global_mmlu_full_fa_humanities_tasks
aggregate_metric_list
:
-
metric
:
acc
weight_by_size
:
true
metadata
:
version
:
0.0
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa_other.yaml
0 → 100644
View file @
89b6bdb3
group
:
global_mmlu_full_fa_other
task
:
-
global_mmlu_full_fa_other_tasks
aggregate_metric_list
:
-
metric
:
acc
weight_by_size
:
true
metadata
:
version
:
0.0
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa_social_sciences.yaml
0 → 100644
View file @
89b6bdb3
group
:
global_mmlu_full_fa_social_sciences
task
:
-
global_mmlu_full_fa_social_sciences_tasks
aggregate_metric_list
:
-
metric
:
acc
weight_by_size
:
true
metadata
:
version
:
0.0
lm_eval/tasks/global_mmlu/full/fa/_global_mmlu_full_fa_stem.yaml
0 → 100644
View file @
89b6bdb3
group
:
global_mmlu_full_fa_stem
task
:
-
global_mmlu_full_fa_stem_tasks
aggregate_metric_list
:
-
metric
:
acc
weight_by_size
:
true
metadata
:
version
:
0.0
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_abstract_algebra.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_abstract_algebra
tag
:
global_mmlu_full_fa_stem_tasks
task
:
global_mmlu_full_fa_abstract_algebra
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_anatomy.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_anatomy
tag
:
global_mmlu_full_fa_stem_tasks
task
:
global_mmlu_full_fa_anatomy
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_astronomy.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_astronomy
tag
:
global_mmlu_full_fa_stem_tasks
task
:
global_mmlu_full_fa_astronomy
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_business_ethics.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_business_ethics
tag
:
global_mmlu_full_fa_other_tasks
task
:
global_mmlu_full_fa_business_ethics
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_clinical_knowledge.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_clinical_knowledge
tag
:
global_mmlu_full_fa_other_tasks
task
:
global_mmlu_full_fa_clinical_knowledge
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_biology.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_college_biology
tag
:
global_mmlu_full_fa_stem_tasks
task
:
global_mmlu_full_fa_college_biology
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_chemistry.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_college_chemistry
tag
:
global_mmlu_full_fa_stem_tasks
task
:
global_mmlu_full_fa_college_chemistry
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_computer_science.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_college_computer_science
tag
:
global_mmlu_full_fa_stem_tasks
task
:
global_mmlu_full_fa_college_computer_science
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_mathematics.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_college_mathematics
tag
:
global_mmlu_full_fa_stem_tasks
task
:
global_mmlu_full_fa_college_mathematics
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_medicine.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_college_medicine
tag
:
global_mmlu_full_fa_other_tasks
task
:
global_mmlu_full_fa_college_medicine
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_college_physics.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_college_physics
tag
:
global_mmlu_full_fa_stem_tasks
task
:
global_mmlu_full_fa_college_physics
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_computer_security.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_computer_security
tag
:
global_mmlu_full_fa_stem_tasks
task
:
global_mmlu_full_fa_computer_security
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_conceptual_physics.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_conceptual_physics
tag
:
global_mmlu_full_fa_stem_tasks
task
:
global_mmlu_full_fa_conceptual_physics
lm_eval/tasks/global_mmlu/full/fa/global_mmlu_full_fa_econometrics.yaml
0 → 100644
View file @
89b6bdb3
# Generated by _generate_configs.py
include
:
_fa_template_yaml
process_docs
:
!function
utils.process_econometrics
tag
:
global_mmlu_full_fa_social_sciences_tasks
task
:
global_mmlu_full_fa_econometrics
Prev
1
…
42
43
44
45
46
47
48
49
50
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment