Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
a2af2101
Unverified
Commit
a2af2101
authored
Jul 12, 2024
by
Yen-Ting Lin
Committed by
GitHub
Jul 12, 2024
Browse files
Merge branch 'EleutherAI:main' into main
parents
82cb25c1
d5f39bf8
Changes
1000
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
17 additions
and
29 deletions
+17
-29
lm_eval/tasks/bigbench/multiple_choice/gender_inclusive_sentences_german.yaml
...ch/multiple_choice/gender_inclusive_sentences_german.yaml
+0
-4
lm_eval/tasks/bigbench/multiple_choice/general_knowledge.yaml
...val/tasks/bigbench/multiple_choice/general_knowledge.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/geometric_shapes.yaml
lm_eval/tasks/bigbench/multiple_choice/geometric_shapes.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/goal_step_wikihow.yaml
...val/tasks/bigbench/multiple_choice/goal_step_wikihow.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/gre_reading_comprehension.yaml
...s/bigbench/multiple_choice/gre_reading_comprehension.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/hhh_alignment.yaml
lm_eval/tasks/bigbench/multiple_choice/hhh_alignment.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/hindi_question_answering.yaml
...ks/bigbench/multiple_choice/hindi_question_answering.yaml
+0
-4
lm_eval/tasks/bigbench/multiple_choice/hindu_knowledge.yaml
lm_eval/tasks/bigbench/multiple_choice/hindu_knowledge.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/hinglish_toxicity.yaml
...val/tasks/bigbench/multiple_choice/hinglish_toxicity.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/human_organs_senses.yaml
...l/tasks/bigbench/multiple_choice/human_organs_senses.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/hyperbaton.yaml
lm_eval/tasks/bigbench/multiple_choice/hyperbaton.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/identify_math_theorems.yaml
...asks/bigbench/multiple_choice/identify_math_theorems.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/identify_odd_metaphor.yaml
...tasks/bigbench/multiple_choice/identify_odd_metaphor.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/implicatures.yaml
lm_eval/tasks/bigbench/multiple_choice/implicatures.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/implicit_relations.yaml
...al/tasks/bigbench/multiple_choice/implicit_relations.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/intent_recognition.yaml
...al/tasks/bigbench/multiple_choice/intent_recognition.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/international_phonetic_alphabet_nli.yaml
.../multiple_choice/international_phonetic_alphabet_nli.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/international_phonetic_alphabet_transliterate.yaml
...choice/international_phonetic_alphabet_transliterate.yaml
+0
-4
lm_eval/tasks/bigbench/multiple_choice/intersect_geometry.yaml
...al/tasks/bigbench/multiple_choice/intersect_geometry.yaml
+1
-1
lm_eval/tasks/bigbench/multiple_choice/irony_identification.yaml
.../tasks/bigbench/multiple_choice/irony_identification.yaml
+1
-1
No files found.
Too many changes to show.
To preserve performance only
1000 of 1000+
files are displayed.
Plain diff
Email patch
lm_eval/tasks/bigbench/multiple_choice/gender_inclusive_sentences_german.yaml
deleted
100644 → 0
View file @
82cb25c1
# Generated by utils.py
dataset_name
:
gender_inclusive_sentences_german_zero_shot
include
:
../multiple_choice_template_yaml
task
:
bigbench_gender_inclusive_sentences_german_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/general_knowledge.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
general_knowledge_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_general_knowledge_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/geometric_shapes.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
geometric_shapes_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_geometric_shapes_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/goal_step_wikihow.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
goal_step_wikihow_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_goal_step_wikihow_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/gre_reading_comprehension.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
gre_reading_comprehension_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_gre_reading_comprehension_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/hhh_alignment.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
hhh_alignment_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_hhh_alignment_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/hindi_question_answering.yaml
deleted
100644 → 0
View file @
82cb25c1
# Generated by utils.py
dataset_name
:
hindi_question_answering_zero_shot
include
:
../multiple_choice_template_yaml
task
:
bigbench_hindi_question_answering_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/hindu_knowledge.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
hindu_knowledge_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_hindu_knowledge_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/hinglish_toxicity.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
hinglish_toxicity_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_hinglish_toxicity_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/human_organs_senses.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
human_organs_senses_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_human_organs_senses_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/hyperbaton.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
hyperbaton_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_hyperbaton_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/identify_math_theorems.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
identify_math_theorems_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_identify_math_theorems_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/identify_odd_metaphor.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
identify_odd_metaphor_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_identify_odd_metaphor_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/implicatures.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
implicatures_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_implicatures_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/implicit_relations.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
implicit_relations_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_implicit_relations_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/intent_recognition.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
intent_recognition_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_intent_recognition_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/international_phonetic_alphabet_nli.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
international_phonetic_alphabet_nli_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_international_phonetic_alphabet_nli_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/international_phonetic_alphabet_transliterate.yaml
deleted
100644 → 0
View file @
82cb25c1
# Generated by utils.py
dataset_name
:
international_phonetic_alphabet_transliterate_zero_shot
include
:
../multiple_choice_template_yaml
task
:
bigbench_international_phonetic_alphabet_transliterate_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/intersect_geometry.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
intersect_geometry_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_intersect_geometry_multiple_choice
lm_eval/tasks/bigbench/multiple_choice/irony_identification.yaml
View file @
a2af2101
# Generated by utils.py
dataset_name
:
irony_identification_zero_shot
include
:
../multiple_choice_template_yaml
include
:
../multiple_choice_template_
a_
yaml
task
:
bigbench_irony_identification_multiple_choice
Prev
1
…
13
14
15
16
17
18
19
20
21
…
50
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment