Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
5c006ed4
Unverified
Commit
5c006ed4
authored
Jan 25, 2025
by
Minho Ryu
Committed by
GitHub
Jan 24, 2025
Browse files
separate category for `global_mmlu` (#2652)
* separate category * set version 0.0 * apply precommit
parent
370e2f9e
Changes
193
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
lm_eval/tasks/global_mmlu/full/fil/_global_mmlu_full_fil.yaml
...val/tasks/global_mmlu/full/fil/_global_mmlu_full_fil.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/fr/_global_mmlu_full_fr.yaml
lm_eval/tasks/global_mmlu/full/fr/_global_mmlu_full_fr.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/ha/_global_mmlu_full_ha.yaml
lm_eval/tasks/global_mmlu/full/ha/_global_mmlu_full_ha.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/he/_global_mmlu_full_he.yaml
lm_eval/tasks/global_mmlu/full/he/_global_mmlu_full_he.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/hi/_global_mmlu_full_hi.yaml
lm_eval/tasks/global_mmlu/full/hi/_global_mmlu_full_hi.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/id/_global_mmlu_full_id.yaml
lm_eval/tasks/global_mmlu/full/id/_global_mmlu_full_id.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/ig/_global_mmlu_full_ig.yaml
lm_eval/tasks/global_mmlu/full/ig/_global_mmlu_full_ig.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/it/_global_mmlu_full_it.yaml
lm_eval/tasks/global_mmlu/full/it/_global_mmlu_full_it.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/ja/_global_mmlu_full_ja.yaml
lm_eval/tasks/global_mmlu/full/ja/_global_mmlu_full_ja.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/ko/_global_mmlu_full_ko.yaml
lm_eval/tasks/global_mmlu/full/ko/_global_mmlu_full_ko.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/ky/_global_mmlu_full_ky.yaml
lm_eval/tasks/global_mmlu/full/ky/_global_mmlu_full_ky.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/lt/_global_mmlu_full_lt.yaml
lm_eval/tasks/global_mmlu/full/lt/_global_mmlu_full_lt.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/mg/_global_mmlu_full_mg.yaml
lm_eval/tasks/global_mmlu/full/mg/_global_mmlu_full_mg.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/ms/_global_mmlu_full_ms.yaml
lm_eval/tasks/global_mmlu/full/ms/_global_mmlu_full_ms.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/ne/_global_mmlu_full_ne.yaml
lm_eval/tasks/global_mmlu/full/ne/_global_mmlu_full_ne.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/nl/_global_mmlu_full_nl.yaml
lm_eval/tasks/global_mmlu/full/nl/_global_mmlu_full_nl.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/ny/_global_mmlu_full_ny.yaml
lm_eval/tasks/global_mmlu/full/ny/_global_mmlu_full_ny.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/pl/_global_mmlu_full_pl.yaml
lm_eval/tasks/global_mmlu/full/pl/_global_mmlu_full_pl.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/pt/_global_mmlu_full_pt.yaml
lm_eval/tasks/global_mmlu/full/pt/_global_mmlu_full_pt.yaml
+1
-1
lm_eval/tasks/global_mmlu/full/ro/_global_mmlu_full_ro.yaml
lm_eval/tasks/global_mmlu/full/ro/_global_mmlu_full_ro.yaml
+1
-1
No files found.
lm_eval/tasks/global_mmlu/full/fil/_global_mmlu_full_fil.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/fr/_global_mmlu_full_fr.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/ha/_global_mmlu_full_ha.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/he/_global_mmlu_full_he.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/hi/_global_mmlu_full_hi.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/id/_global_mmlu_full_id.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/ig/_global_mmlu_full_ig.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/it/_global_mmlu_full_it.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/ja/_global_mmlu_full_ja.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/ko/_global_mmlu_full_ko.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/ky/_global_mmlu_full_ky.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/lt/_global_mmlu_full_lt.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/mg/_global_mmlu_full_mg.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/ms/_global_mmlu_full_ms.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/ne/_global_mmlu_full_ne.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/nl/_global_mmlu_full_nl.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/ny/_global_mmlu_full_ny.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/pl/_global_mmlu_full_pl.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/pt/_global_mmlu_full_pt.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
lm_eval/tasks/global_mmlu/full/ro/_global_mmlu_full_ro.yaml
View file @
5c006ed4
...
...
@@ -8,4 +8,4 @@ aggregate_metric_list:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
.0
version
:
0
.0
Prev
1
…
5
6
7
8
9
10
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment