Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
e4db76cb
Commit
e4db76cb
authored
Jul 09, 2024
by
haileyschoelkopf
Browse files
Merge branch 'main' into multimodal-prototyping
parents
6cc6e9cd
ad80f555
Changes
871
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
22 deletions
+20
-22
lm_eval/tasks/super_glue/boolq/default.yaml
lm_eval/tasks/super_glue/boolq/default.yaml
+1
-1
lm_eval/tasks/super_glue/boolq/seq2seq.yaml
lm_eval/tasks/super_glue/boolq/seq2seq.yaml
+1
-1
lm_eval/tasks/super_glue/boolq/t5-prompt.yaml
lm_eval/tasks/super_glue/boolq/t5-prompt.yaml
+1
-1
lm_eval/tasks/super_glue/cb/default.yaml
lm_eval/tasks/super_glue/cb/default.yaml
+1
-1
lm_eval/tasks/super_glue/cb/t5-prompt.yaml
lm_eval/tasks/super_glue/cb/t5-prompt.yaml
+1
-1
lm_eval/tasks/super_glue/copa/default.yaml
lm_eval/tasks/super_glue/copa/default.yaml
+1
-1
lm_eval/tasks/super_glue/copa/t5-prompt.yaml
lm_eval/tasks/super_glue/copa/t5-prompt.yaml
+1
-1
lm_eval/tasks/super_glue/multirc/default.yaml
lm_eval/tasks/super_glue/multirc/default.yaml
+1
-1
lm_eval/tasks/super_glue/multirc/t5-prompt.yaml
lm_eval/tasks/super_glue/multirc/t5-prompt.yaml
+1
-1
lm_eval/tasks/super_glue/record/default.yaml
lm_eval/tasks/super_glue/record/default.yaml
+1
-1
lm_eval/tasks/super_glue/record/t5-prompt.yaml
lm_eval/tasks/super_glue/record/t5-prompt.yaml
+1
-1
lm_eval/tasks/super_glue/rte/default.yaml
lm_eval/tasks/super_glue/rte/default.yaml
+1
-1
lm_eval/tasks/super_glue/rte/t5-prompt.yaml
lm_eval/tasks/super_glue/rte/t5-prompt.yaml
+1
-1
lm_eval/tasks/super_glue/wic/default.yaml
lm_eval/tasks/super_glue/wic/default.yaml
+1
-1
lm_eval/tasks/super_glue/wic/t5-prompt.yaml
lm_eval/tasks/super_glue/wic/t5-prompt.yaml
+1
-1
lm_eval/tasks/super_glue/wsc/default.yaml
lm_eval/tasks/super_glue/wsc/default.yaml
+1
-1
lm_eval/tasks/super_glue/wsc/t5-prompt.yaml
lm_eval/tasks/super_glue/wsc/t5-prompt.yaml
+1
-1
lm_eval/tasks/swde/task.py
lm_eval/tasks/swde/task.py
+1
-1
lm_eval/tasks/translation/iwslt2017_ar-en.yaml
lm_eval/tasks/translation/iwslt2017_ar-en.yaml
+1
-2
lm_eval/tasks/translation/iwslt2017_en-ar.yaml
lm_eval/tasks/translation/iwslt2017_en-ar.yaml
+1
-2
No files found.
lm_eval/tasks/super_glue/boolq/default.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-lm-eval-v1
task
:
boolq
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/boolq/seq2seq.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-lm-eval-v1-seq2seq
task
:
"
boolq-seq2seq"
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/boolq/t5-prompt.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-t5-prompt
task
:
super_glue-boolq-t5-prompt
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/cb/default.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-lm-eval-v1
task
:
cb
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/cb/t5-prompt.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-t5-prompt
task
:
super_glue-cb-t5-prompt
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/copa/default.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-lm-eval-v1
task
:
copa
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/copa/t5-prompt.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-t5-prompt
task
:
super_glue-copa-t5-prompt
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/multirc/default.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-lm-eval-v1
task
:
multirc
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/multirc/t5-prompt.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-t5-prompt
task
:
super_glue-multirc-t5-prompt
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/record/default.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-lm-eval-v1
task
:
record
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/record/t5-prompt.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-t5-prompt
task
:
super_glue-record-t5-prompt
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/rte/default.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-lm-eval-v1
task
:
sglue_rte
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/rte/t5-prompt.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-t5-prompt
task
:
super_glue-rte-t5-prompt
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/wic/default.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-lm-eval-v1
task
:
"
wic"
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/wic/t5-prompt.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-t5-prompt
task
:
super_glue-wic-t5-prompt
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/wsc/default.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-lm-eval-v1
task
:
wsc
dataset_path
:
super_glue
...
...
lm_eval/tasks/super_glue/wsc/t5-prompt.yaml
View file @
e4db76cb
group
:
tag
:
-
super-glue-t5-prompt
task
:
super_glue-wsc-t5-prompt
dataset_path
:
super_glue
...
...
lm_eval/tasks/swde/task.py
View file @
e4db76cb
...
...
@@ -12,7 +12,7 @@ class SWDE(ConfigurableTask):
DATASET_PATH
=
"hazyresearch/based-swde-v2"
DATASET_NAME
=
"default"
def
__init__
(
self
):
def
__init__
(
self
,
**
kwargs
):
super
().
__init__
(
config
=
{
"metadata"
:
{
"version"
:
self
.
VERSION
}})
def
has_training_docs
(
self
):
...
...
lm_eval/tasks/translation/iwslt2017_ar-en.yaml
View file @
e4db76cb
...
...
@@ -5,8 +5,7 @@ doc_to_target: ' {{translation["en"]}}'
doc_to_text
:
'
Arabic
phrase:
{{translation["ar"]}}
English
phrase:'
group
:
-
generate_until
tag
:
-
translation
-
iwslt2017
include
:
wmt_common_yaml
...
...
lm_eval/tasks/translation/iwslt2017_en-ar.yaml
View file @
e4db76cb
...
...
@@ -5,8 +5,7 @@ doc_to_target: ' {{translation["ar"]}}'
doc_to_text
:
'
English
phrase:
{{translation["en"]}}
Arabic
phrase:'
group
:
-
generate_until
tag
:
-
translation
-
iwslt2017
include
:
wmt_common_yaml
...
...
Prev
1
…
36
37
38
39
40
41
42
43
44
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment