Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
0456d543
Commit
0456d543
authored
Aug 14, 2023
by
lintangsutawika
Browse files
edit anli
parent
ae55476a
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
9 additions
and
58 deletions
+9
-58
lm_eval/tasks/anli/README.md
lm_eval/tasks/anli/README.md
+6
-8
lm_eval/tasks/anli/anli_r1.yaml
lm_eval/tasks/anli/anli_r1.yaml
+1
-4
lm_eval/tasks/anli/anli_r2.yaml
lm_eval/tasks/anli/anli_r2.yaml
+1
-23
lm_eval/tasks/anli/anli_r3.yaml
lm_eval/tasks/anli/anli_r3.yaml
+1
-23
No files found.
lm_eval/tasks/anli/README.md
View file @
0456d543
...
...
@@ -30,19 +30,17 @@ Homepage: https://github.com/facebookresearch/anli
}
```
###
Subt
asks
###
Groups and T
asks
List or describe tasks defined in this folder, and their names here:
#### Groups
*
`anli`
: Evaluates
`anli_r1`
,
`anli_r2`
, and
`anli_r3`
#### Tasks
*
`anli_r1`
: The data collected adversarially in the first round.
*
`anli_r2`
: The data collected adversarially in the second round, after training on the previous round's data.
*
`anli_r3`
: The data collected adversarially in the third round, after training on the previous multiple rounds of data.
### Groups
-
`multiple_choice`
-
`natural_language_inference`
-
`nli`
-
`adverserial`
### Checklist
...
...
lm_eval/tasks/anli/anli_r1.yaml
View file @
0456d543
group
:
-
multiple_choice
-
natural_language_inference
-
nli
-
adverserial
-
anli
task
:
anli_r1
dataset_path
:
anli
dataset_name
:
null
...
...
lm_eval/tasks/anli/anli_r2.yaml
View file @
0456d543
group
:
-
multiple_choice
-
natural_language_inference
-
nli
-
adverserial
include
:
anli_r1.yaml
task
:
anli_r2
dataset_path
:
anli
dataset_name
:
null
output_type
:
multiple_choice
training_split
:
train_r2
validation_split
:
dev_r2
test_split
:
test_r2
doc_to_text
:
"
{{premise}}
\n
Question:
{{hypothesis}}
True,
False,
or
Neither?
\n
Answer:"
# True = entailment
# False = contradiction
# Neither = neutral
doc_to_target
:
"
{{['True',
'Neither',
'False'][label]}}"
doc_to_choice
:
-
"
True"
-
"
Neither"
-
"
False"
should_decontaminate
:
true
doc_to_decontamination_query
:
premise
metric_list
:
-
metric
:
acc
aggregation
:
mean
higher_is_better
:
true
lm_eval/tasks/anli/anli_r3.yaml
View file @
0456d543
group
:
-
multiple_choice
-
natural_language_inference
-
nli
-
adverserial
include
:
anli_r1.yaml
task
:
anli_r3
dataset_path
:
anli
dataset_name
:
null
output_type
:
multiple_choice
training_split
:
train_r3
validation_split
:
dev_r3
test_split
:
test_r3
doc_to_text
:
"
{{premise}}
\n
Question:
{{hypothesis}}
True,
False,
or
Neither?
\n
Answer:"
# True = entailment
# False = contradiction
# Neither = neutral
doc_to_target
:
"
{{['True',
'Neither',
'False'][label]}}"
doc_to_choice
:
-
"
True"
-
"
Neither"
-
"
False"
should_decontaminate
:
true
doc_to_decontamination_query
:
premise
metric_list
:
-
metric
:
acc
aggregation
:
mean
higher_is_better
:
true
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment