Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
b2c090cc
"...git@developer.sourcefind.cn:sugon_wxj/megatron-lm.git" did not exist on "feea48cd06e5054cdc3778e03f0b26634b9ed718"
Unverified
Commit
b2c090cc
authored
Jan 22, 2025
by
Minho Ryu
Committed by
GitHub
Jan 21, 2025
Browse files
aggregate by group (total and categories) (#2643)
parent
ed9c6fc8
Changes
204
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
4 additions
and
0 deletions
+4
-0
lm_eval/tasks/kmmlu/hard/kmmlu_hard_refrigerating_machinery.yaml
.../tasks/kmmlu/hard/kmmlu_hard_refrigerating_machinery.yaml
+1
-0
lm_eval/tasks/kmmlu/hard/kmmlu_hard_social_welfare.yaml
lm_eval/tasks/kmmlu/hard/kmmlu_hard_social_welfare.yaml
+1
-0
lm_eval/tasks/kmmlu/hard/kmmlu_hard_taxation.yaml
lm_eval/tasks/kmmlu/hard/kmmlu_hard_taxation.yaml
+1
-0
lm_eval/tasks/kmmlu/hard/kmmlu_hard_telecommunications_and_wireless_technology.yaml
...mmlu_hard_telecommunications_and_wireless_technology.yaml
+1
-0
No files found.
lm_eval/tasks/kmmlu/hard/kmmlu_hard_refrigerating_machinery.yaml
View file @
b2c090cc
dataset_name
:
refrigerating_machinery
dataset_name
:
refrigerating_machinery
include
:
_hard_kmmlu_yaml
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_refrigerating_machinery
task
:
kmmlu_hard_refrigerating_machinery
tag
:
kmmlu_hard_other_tasks
lm_eval/tasks/kmmlu/hard/kmmlu_hard_social_welfare.yaml
View file @
b2c090cc
dataset_name
:
social_welfare
dataset_name
:
social_welfare
include
:
_hard_kmmlu_yaml
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_social_welfare
task
:
kmmlu_hard_social_welfare
tag
:
kmmlu_hard_humss_tasks
lm_eval/tasks/kmmlu/hard/kmmlu_hard_taxation.yaml
View file @
b2c090cc
dataset_name
:
taxation
dataset_name
:
taxation
include
:
_hard_kmmlu_yaml
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_taxation
task
:
kmmlu_hard_taxation
tag
:
kmmlu_hard_humss_tasks
lm_eval/tasks/kmmlu/hard/kmmlu_hard_telecommunications_and_wireless_technology.yaml
View file @
b2c090cc
dataset_name
:
telecommunications_and_wireless_technology
dataset_name
:
telecommunications_and_wireless_technology
include
:
_hard_kmmlu_yaml
include
:
_hard_kmmlu_yaml
task
:
kmmlu_hard_telecommunications_and_wireless_technology
task
:
kmmlu_hard_telecommunications_and_wireless_technology
tag
:
kmmlu_hard_applied_science_tasks
Prev
1
…
7
8
9
10
11
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment