Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
9822b06e
"tests/blenderbot_small/test_tokenization_blenderbot_small.py" did not exist on "476ba679dd21c25aa6c7b57e6849835b9700db0c"
Unverified
Commit
9822b06e
authored
Mar 01, 2024
by
Lintang Sutawika
Committed by
GitHub
Mar 01, 2024
Browse files
Merge branch 'main' into weight_by_size
parents
51f27158
b177c82c
Changes
656
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
21 additions
and
39 deletions
+21
-39
lm_eval/tasks/kmmlu/hard/kmmlu_hard_public_safety.yaml
lm_eval/tasks/kmmlu/hard/kmmlu_hard_public_safety.yaml
+3
-0
lm_eval/tasks/kmmlu/hard/kmmlu_hard_railway_and_automotive_engineering.yaml
...u/hard/kmmlu_hard_railway_and_automotive_engineering.yaml
+3
-0
lm_eval/tasks/kmmlu/hard/kmmlu_hard_real_estate.yaml
lm_eval/tasks/kmmlu/hard/kmmlu_hard_real_estate.yaml
+3
-0
lm_eval/tasks/kmmlu/hard/kmmlu_hard_refrigerating_machinery.yaml
.../tasks/kmmlu/hard/kmmlu_hard_refrigerating_machinery.yaml
+3
-0
lm_eval/tasks/kmmlu/hard/kmmlu_hard_social_welfare.yaml
lm_eval/tasks/kmmlu/hard/kmmlu_hard_social_welfare.yaml
+3
-0
lm_eval/tasks/kmmlu/hard/kmmlu_hard_taxation.yaml
lm_eval/tasks/kmmlu/hard/kmmlu_hard_taxation.yaml
+3
-0
lm_eval/tasks/kmmlu/hard/kmmlu_hard_telecommunications_and_wireless_technology.yaml
...mmlu_hard_telecommunications_and_wireless_technology.yaml
+3
-0
lm_eval/tasks/kmmlu/kmmlu_accounting.yaml
lm_eval/tasks/kmmlu/kmmlu_accounting.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_agricultural_sciences.yaml
lm_eval/tasks/kmmlu/kmmlu_agricultural_sciences.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_aviation_engineering_and_maintenance.yaml
...sks/kmmlu/kmmlu_aviation_engineering_and_maintenance.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_biology.yaml
lm_eval/tasks/kmmlu/kmmlu_biology.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_chemical_engineering.yaml
lm_eval/tasks/kmmlu/kmmlu_chemical_engineering.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_chemistry.yaml
lm_eval/tasks/kmmlu/kmmlu_chemistry.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_civil_engineering.yaml
lm_eval/tasks/kmmlu/kmmlu_civil_engineering.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_computer_science.yaml
lm_eval/tasks/kmmlu/kmmlu_computer_science.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_construction.yaml
lm_eval/tasks/kmmlu/kmmlu_construction.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_criminal_law.yaml
lm_eval/tasks/kmmlu/kmmlu_criminal_law.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_ecology.yaml
lm_eval/tasks/kmmlu/kmmlu_ecology.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_economics.yaml
lm_eval/tasks/kmmlu/kmmlu_economics.yaml
+0
-3
lm_eval/tasks/kmmlu/kmmlu_education.yaml
lm_eval/tasks/kmmlu/kmmlu_education.yaml
+0
-3
No files found.
lm_eval/tasks/kmmlu/hard/kmmlu_hard_public_safety.yaml
0 → 100644
View file @
9822b06e
dataset_name
:
public_safety
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_public_safety
lm_eval/tasks/kmmlu/hard/kmmlu_hard_railway_and_automotive_engineering.yaml
0 → 100644
View file @
9822b06e
dataset_name
:
railway_and_automotive_engineering
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_railway_and_automotive_engineering
lm_eval/tasks/kmmlu/hard/kmmlu_hard_real_estate.yaml
0 → 100644
View file @
9822b06e
dataset_name
:
real_estate
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_real_estate
lm_eval/tasks/kmmlu/hard/kmmlu_hard_refrigerating_machinery.yaml
0 → 100644
View file @
9822b06e
dataset_name
:
refrigerating_machinery
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_refrigerating_machinery
lm_eval/tasks/kmmlu/hard/kmmlu_hard_social_welfare.yaml
0 → 100644
View file @
9822b06e
dataset_name
:
social_welfare
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_social_welfare
lm_eval/tasks/kmmlu/hard/kmmlu_hard_taxation.yaml
0 → 100644
View file @
9822b06e
dataset_name
:
taxation
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_taxation
lm_eval/tasks/kmmlu/hard/kmmlu_hard_telecommunications_and_wireless_technology.yaml
0 → 100644
View file @
9822b06e
dataset_name
:
telecommunications_and_wireless_technology
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_telecommunications_and_wireless_technology
lm_eval/tasks/kmmlu/kmmlu_accounting.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Accounting"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_accounting"
lm_eval/tasks/kmmlu/kmmlu_agricultural_sciences.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Agricultural-Sciences"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_agricultural_sciences"
lm_eval/tasks/kmmlu/kmmlu_aviation_engineering_and_maintenance.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Aviation-Engineering-and-Maintenance"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_aviation_engineering_and_maintenance"
lm_eval/tasks/kmmlu/kmmlu_biology.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Biology"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_biology"
lm_eval/tasks/kmmlu/kmmlu_chemical_engineering.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Chemical-Engineering"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_chemical_engineering"
lm_eval/tasks/kmmlu/kmmlu_chemistry.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Chemistry"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_chemistry"
lm_eval/tasks/kmmlu/kmmlu_civil_engineering.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Civil-Engineering"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_civil_engineering"
lm_eval/tasks/kmmlu/kmmlu_computer_science.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Computer-Science"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_computer_science"
lm_eval/tasks/kmmlu/kmmlu_construction.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Construction"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_construction"
lm_eval/tasks/kmmlu/kmmlu_criminal_law.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Criminal-Law"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_criminal_law"
lm_eval/tasks/kmmlu/kmmlu_ecology.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Ecology"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_ecology"
lm_eval/tasks/kmmlu/kmmlu_economics.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Economics"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_economics"
lm_eval/tasks/kmmlu/kmmlu_education.yaml
deleted
100644 → 0
View file @
51f27158
"
dataset_name"
:
"
Education"
"
include"
:
"
_default_kmmlu_yaml"
"
task"
:
"
kmmlu_education"
Prev
1
…
16
17
18
19
20
21
22
23
24
…
33
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment