Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
e4db76cb
"docs/zh_cn/kv_int8.md" did not exist on "edb6eb86437d8f1c8df3d509d6e507e466742978"
Commit
e4db76cb
authored
Jul 09, 2024
by
haileyschoelkopf
Browse files
Merge branch 'main' into multimodal-prototyping
parents
6cc6e9cd
ad80f555
Changes
871
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
53 additions
and
56 deletions
+53
-56
lm_eval/tasks/agieval/agieval_cn.yaml
lm_eval/tasks/agieval/agieval_cn.yaml
+19
-0
lm_eval/tasks/agieval/agieval_en.yaml
lm_eval/tasks/agieval/agieval_en.yaml
+18
-0
lm_eval/tasks/agieval/agieval_nous.yaml
lm_eval/tasks/agieval/agieval_nous.yaml
+16
-0
lm_eval/tasks/agieval/aqua-rat.yaml
lm_eval/tasks/agieval/aqua-rat.yaml
+0
-4
lm_eval/tasks/agieval/gaokao-biology.yaml
lm_eval/tasks/agieval/gaokao-biology.yaml
+0
-3
lm_eval/tasks/agieval/gaokao-chemistry.yaml
lm_eval/tasks/agieval/gaokao-chemistry.yaml
+0
-3
lm_eval/tasks/agieval/gaokao-chinese.yaml
lm_eval/tasks/agieval/gaokao-chinese.yaml
+0
-3
lm_eval/tasks/agieval/gaokao-english.yaml
lm_eval/tasks/agieval/gaokao-english.yaml
+0
-3
lm_eval/tasks/agieval/gaokao-geography.yaml
lm_eval/tasks/agieval/gaokao-geography.yaml
+0
-3
lm_eval/tasks/agieval/gaokao-history.yaml
lm_eval/tasks/agieval/gaokao-history.yaml
+0
-3
lm_eval/tasks/agieval/gaokao-mathcloze.yaml
lm_eval/tasks/agieval/gaokao-mathcloze.yaml
+0
-3
lm_eval/tasks/agieval/gaokao-mathqa.yaml
lm_eval/tasks/agieval/gaokao-mathqa.yaml
+0
-3
lm_eval/tasks/agieval/gaokao-physics.yaml
lm_eval/tasks/agieval/gaokao-physics.yaml
+0
-3
lm_eval/tasks/agieval/jec-qa-ca.yaml
lm_eval/tasks/agieval/jec-qa-ca.yaml
+0
-3
lm_eval/tasks/agieval/jec-qa-kd.yaml
lm_eval/tasks/agieval/jec-qa-kd.yaml
+0
-3
lm_eval/tasks/agieval/logiqa-en.yaml
lm_eval/tasks/agieval/logiqa-en.yaml
+0
-4
lm_eval/tasks/agieval/logiqa-zh.yaml
lm_eval/tasks/agieval/logiqa-zh.yaml
+0
-3
lm_eval/tasks/agieval/lsat-ar.yaml
lm_eval/tasks/agieval/lsat-ar.yaml
+0
-4
lm_eval/tasks/agieval/lsat-lr.yaml
lm_eval/tasks/agieval/lsat-lr.yaml
+0
-4
lm_eval/tasks/agieval/lsat-rc.yaml
lm_eval/tasks/agieval/lsat-rc.yaml
+0
-4
No files found.
lm_eval/tasks/agieval/agieval_cn.yaml
0 → 100644
View file @
e4db76cb
group
:
agieval_cn
task
:
-
agieval_gaokao_biology
-
agieval_gaokao_chemistry
-
agieval_gaokao_chinese
-
agieval_gaokao_geography
-
agieval_gaokao_history
-
agieval_gaokao_mathcloze
-
agieval_gaokao_mathqa
-
agieval_gaokao_physics
-
agieval_jec_qa_ca
-
agieval_jec_qa_kd
-
agieval_logiqa_zh
aggregate_metric_list
:
-
metric
:
acc
aggregation
:
mean
weight_by_size
:
true
metadata
:
version
:
0.0
lm_eval/tasks/agieval/agieval_en.yaml
0 → 100644
View file @
e4db76cb
group
:
agieval_en
task
:
-
agieval_aqua_rat
-
agieval_gaokao_english
# categorizing as EN because the AGIEval codebase lists this as in `english_qa_tasks`
-
agieval_logiqa_en
-
agieval_lsat_ar
-
agieval_lsat_lr
-
agieval_lsat_rc
-
agieval_math
-
agieval_sat_en_without_passage
-
agieval_sat_en
-
agieval_sat_math
aggregate_metric_list
:
-
metric
:
acc
aggregation
:
mean
weight_by_size
:
true
metadata
:
version
:
0.0
lm_eval/tasks/agieval/agieval_nous.yaml
0 → 100644
View file @
e4db76cb
group
:
agieval_nous
task
:
-
agieval_aqua_rat
-
agieval_logiqa_en
-
agieval_lsat_ar
-
agieval_lsat_lr
-
agieval_lsat_rc
-
agieval_sat_en_without_passage
-
agieval_sat_en
-
agieval_sat_math
aggregate_metric_list
:
-
metric
:
acc_norm
aggregation
:
mean
weight_by_size
:
true
metadata
:
version
:
0.0
lm_eval/tasks/agieval/aqua-rat.yaml
View file @
e4db76cb
group
:
-
agieval
-
agieval_en
-
agieval_nous
task
:
agieval_aqua_rat
dataset_path
:
hails/agieval-aqua-rat
dataset_name
:
null
...
...
lm_eval/tasks/agieval/gaokao-biology.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_cn
task
:
agieval_gaokao_biology
dataset_path
:
hails/agieval-gaokao-biology
lm_eval/tasks/agieval/gaokao-chemistry.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_cn
task
:
agieval_gaokao_chemistry
dataset_path
:
hails/agieval-gaokao-chemistry
lm_eval/tasks/agieval/gaokao-chinese.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_cn
task
:
agieval_gaokao_chinese
dataset_path
:
hails/agieval-gaokao-chinese
lm_eval/tasks/agieval/gaokao-english.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_en
# categorizing as EN because the AGIEval codebase lists this as in `english_qa_tasks`
task
:
agieval_gaokao_english
dataset_path
:
hails/agieval-gaokao-english
lm_eval/tasks/agieval/gaokao-geography.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_cn
task
:
agieval_gaokao_geography
dataset_path
:
hails/agieval-gaokao-geography
lm_eval/tasks/agieval/gaokao-history.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_cn
task
:
agieval_gaokao_history
dataset_path
:
hails/agieval-gaokao-history
lm_eval/tasks/agieval/gaokao-mathcloze.yaml
View file @
e4db76cb
group
:
-
agieval
-
agieval_cn
task
:
agieval_gaokao_mathcloze
dataset_path
:
hails/agieval-gaokao-mathcloze
dataset_name
:
null
...
...
lm_eval/tasks/agieval/gaokao-mathqa.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_cn
task
:
agieval_gaokao_mathqa
dataset_path
:
hails/agieval-gaokao-mathqa
lm_eval/tasks/agieval/gaokao-physics.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_cn
task
:
agieval_gaokao_physics
dataset_path
:
hails/agieval-gaokao-physics
lm_eval/tasks/agieval/jec-qa-ca.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_cn
task
:
agieval_jec_qa_ca
dataset_path
:
hails/agieval-jec-qa-ca
lm_eval/tasks/agieval/jec-qa-kd.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_cn
task
:
agieval_jec_qa_kd
dataset_path
:
hails/agieval-jec-qa-kd
lm_eval/tasks/agieval/logiqa-en.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_nous
-
agieval_en
task
:
agieval_logiqa_en
dataset_path
:
hails/agieval-logiqa-en
lm_eval/tasks/agieval/logiqa-zh.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_cn
task
:
agieval_logiqa_zh
dataset_path
:
hails/agieval-logiqa-zh
lm_eval/tasks/agieval/lsat-ar.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_nous
-
agieval_en
task
:
agieval_lsat_ar
dataset_path
:
hails/agieval-lsat-ar
lm_eval/tasks/agieval/lsat-lr.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_nous
-
agieval_en
task
:
agieval_lsat_lr
dataset_path
:
hails/agieval-lsat-lr
lm_eval/tasks/agieval/lsat-rc.yaml
View file @
e4db76cb
include
:
aqua-rat.yaml
group
:
-
agieval
-
agieval_nous
-
agieval_en
task
:
agieval_lsat_rc
dataset_path
:
hails/agieval-lsat-rc
Prev
1
2
3
4
5
6
…
44
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment