Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
2b56339e
Commit
2b56339e
authored
Jan 17, 2025
by
Baber
Browse files
Merge branch 'main' into longcxt
parents
0b533339
703fbffd
Changes
316
Show whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
117 additions
and
0 deletions
+117
-0
lm_eval/tasks/mlqa/mlqa_ar_en.yaml
lm_eval/tasks/mlqa/mlqa_ar_en.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_ar_es.yaml
lm_eval/tasks/mlqa/mlqa_ar_es.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_ar_hi.yaml
lm_eval/tasks/mlqa/mlqa_ar_hi.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_ar_vi.yaml
lm_eval/tasks/mlqa/mlqa_ar_vi.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_ar_zh.yaml
lm_eval/tasks/mlqa/mlqa_ar_zh.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_common_yaml
lm_eval/tasks/mlqa/mlqa_common_yaml
+22
-0
lm_eval/tasks/mlqa/mlqa_de_ar.yaml
lm_eval/tasks/mlqa/mlqa_de_ar.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_de_de.yaml
lm_eval/tasks/mlqa/mlqa_de_de.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_de_en.yaml
lm_eval/tasks/mlqa/mlqa_de_en.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_de_es.yaml
lm_eval/tasks/mlqa/mlqa_de_es.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_de_hi.yaml
lm_eval/tasks/mlqa/mlqa_de_hi.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_de_vi.yaml
lm_eval/tasks/mlqa/mlqa_de_vi.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_de_zh.yaml
lm_eval/tasks/mlqa/mlqa_de_zh.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_en_ar.yaml
lm_eval/tasks/mlqa/mlqa_en_ar.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_en_de.yaml
lm_eval/tasks/mlqa/mlqa_en_de.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_en_en.yaml
lm_eval/tasks/mlqa/mlqa_en_en.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_en_es.yaml
lm_eval/tasks/mlqa/mlqa_en_es.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_en_hi.yaml
lm_eval/tasks/mlqa/mlqa_en_hi.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_en_vi.yaml
lm_eval/tasks/mlqa/mlqa_en_vi.yaml
+5
-0
lm_eval/tasks/mlqa/mlqa_en_zh.yaml
lm_eval/tasks/mlqa/mlqa_en_zh.yaml
+5
-0
No files found.
lm_eval/tasks/mlqa/mlqa_ar_en.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_ar_en
dataset_name
:
mlqa.ar.en
process_results
:
!function
utils.process_results_ar
lm_eval/tasks/mlqa/mlqa_ar_es.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_ar_es
dataset_name
:
mlqa.ar.es
process_results
:
!function
utils.process_results_ar
lm_eval/tasks/mlqa/mlqa_ar_hi.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_ar_hi
dataset_name
:
mlqa.ar.hi
process_results
:
!function
utils.process_results_ar
lm_eval/tasks/mlqa/mlqa_ar_vi.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_ar_vi
dataset_name
:
mlqa.ar.vi
process_results
:
!function
utils.process_results_ar
lm_eval/tasks/mlqa/mlqa_ar_zh.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_ar_zh
dataset_name
:
mlqa.ar.zh
process_results
:
!function
utils.process_results_ar
lm_eval/tasks/mlqa/mlqa_common_yaml
0 → 100644
View file @
2b56339e
dataset_path: facebook/mlqa
dataset_kwargs:
trust_remote_code: true
test_split: test
validation_split: validation
output_type: generate_until
doc_to_text: "Context: {{context}}\n\nQuestion: {{question}}\n\nAnswer:"
doc_to_target: "{{answers}}"
process_docs: !function utils.process_docs
metric_list:
- metric: exact_match
aggregation: mean
higher_is_better: true
- metric: f1
aggregation: mean
higher_is_better: true
generation_kwargs:
until:
- "\n"
do_sample: false
metadata:
version: 0.0
lm_eval/tasks/mlqa/mlqa_de_ar.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_de_ar
dataset_name
:
mlqa.de.ar
process_results
:
!function
utils.process_results_de
lm_eval/tasks/mlqa/mlqa_de_de.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_de_de
dataset_name
:
mlqa.de.de
process_results
:
!function
utils.process_results_de
lm_eval/tasks/mlqa/mlqa_de_en.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_de_en
dataset_name
:
mlqa.de.en
process_results
:
!function
utils.process_results_de
lm_eval/tasks/mlqa/mlqa_de_es.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_de_es
dataset_name
:
mlqa.de.es
process_results
:
!function
utils.process_results_de
lm_eval/tasks/mlqa/mlqa_de_hi.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_de_hi
dataset_name
:
mlqa.de.hi
process_results
:
!function
utils.process_results_de
lm_eval/tasks/mlqa/mlqa_de_vi.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_de_vi
dataset_name
:
mlqa.de.vi
process_results
:
!function
utils.process_results_de
lm_eval/tasks/mlqa/mlqa_de_zh.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_de_zh
dataset_name
:
mlqa.de.zh
process_results
:
!function
utils.process_results_de
lm_eval/tasks/mlqa/mlqa_en_ar.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_en_ar
dataset_name
:
mlqa.en.ar
process_results
:
!function
utils.process_results_en
lm_eval/tasks/mlqa/mlqa_en_de.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_en_de
dataset_name
:
mlqa.en.de
process_results
:
!function
utils.process_results_en
lm_eval/tasks/mlqa/mlqa_en_en.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_en_en
dataset_name
:
mlqa.en.en
process_results
:
!function
utils.process_results_en
lm_eval/tasks/mlqa/mlqa_en_es.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_en_es
dataset_name
:
mlqa.en.es
process_results
:
!function
utils.process_results_en
lm_eval/tasks/mlqa/mlqa_en_hi.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_en_hi
dataset_name
:
mlqa.en.hi
process_results
:
!function
utils.process_results_en
lm_eval/tasks/mlqa/mlqa_en_vi.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_en_vi
dataset_name
:
mlqa.en.vi
process_results
:
!function
utils.process_results_en
lm_eval/tasks/mlqa/mlqa_en_zh.yaml
0 → 100644
View file @
2b56339e
# Generated by generate_tasks.py
include
:
mlqa_common_yaml
task
:
mlqa_en_zh
dataset_name
:
mlqa.en.zh
process_results
:
!function
utils.process_results_en
Prev
1
…
10
11
12
13
14
15
16
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment