Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
2f53b190
Unverified
Commit
2f53b190
authored
Aug 09, 2023
by
Hailey Schoelkopf
Committed by
GitHub
Aug 09, 2023
Browse files
Merge pull request #751 from EleutherAI/crows-pair
[Refactor] CrowS-Pairs
parents
5d55b685
3c8610c5
Changes
26
Show whitespace changes
Inline
Side-by-side
Showing
6 changed files
with
84 additions
and
0 deletions
+84
-0
lm_eval/tasks/crows_pairs/crows_pairs_french_physical_appearance.yaml
...s/crows_pairs/crows_pairs_french_physical_appearance.yaml
+4
-0
lm_eval/tasks/crows_pairs/crows_pairs_french_race_color.yaml
lm_eval/tasks/crows_pairs/crows_pairs_french_race_color.yaml
+4
-0
lm_eval/tasks/crows_pairs/crows_pairs_french_religion.yaml
lm_eval/tasks/crows_pairs/crows_pairs_french_religion.yaml
+4
-0
lm_eval/tasks/crows_pairs/crows_pairs_french_sexual_orientation.yaml
...ks/crows_pairs/crows_pairs_french_sexual_orientation.yaml
+4
-0
lm_eval/tasks/crows_pairs/crows_pairs_french_socioeconomic.yaml
...l/tasks/crows_pairs/crows_pairs_french_socioeconomic.yaml
+4
-0
lm_eval/tasks/crows_pairs/utils.py
lm_eval/tasks/crows_pairs/utils.py
+64
-0
No files found.
lm_eval/tasks/crows_pairs/crows_pairs_french_physical_appearance.yaml
0 → 100644
View file @
2f53b190
include
:
crows_pairs_english.yaml
task
:
crows_pairs_french_physical_appearance
dataset_name
:
french
process_docs
:
!function
utils.filter_appearance
lm_eval/tasks/crows_pairs/crows_pairs_french_race_color.yaml
0 → 100644
View file @
2f53b190
include
:
crows_pairs_english.yaml
task
:
crows_pairs_french_race_color
dataset_name
:
french
process_docs
:
!function
utils.filter_race_color
lm_eval/tasks/crows_pairs/crows_pairs_french_religion.yaml
0 → 100644
View file @
2f53b190
include
:
crows_pairs_english.yaml
task
:
crows_pairs_french_religion
dataset_name
:
french
process_docs
:
!function
utils.filter_religion
lm_eval/tasks/crows_pairs/crows_pairs_french_sexual_orientation.yaml
0 → 100644
View file @
2f53b190
include
:
crows_pairs_english.yaml
task
:
crows_pairs_french_sexual_orientation
dataset_name
:
french
process_docs
:
!function
utils.filter_orientation
lm_eval/tasks/crows_pairs/crows_pairs_french_socioeconomic.yaml
0 → 100644
View file @
2f53b190
include
:
crows_pairs_english.yaml
task
:
crows_pairs_french_socioeconomic
dataset_name
:
french
process_docs
:
!function
utils.filter_socio
lm_eval/tasks/crows_pairs/utils.py
0 → 100644
View file @
2f53b190
import
datasets
def
process_results
(
doc
,
results
):
lls
,
_
=
zip
(
*
results
)
likelihood1
,
likelihood2
=
lls
# Calculate the absolute difference in loglikelihoods
diff
=
abs
(
likelihood1
-
likelihood2
)
# if stereotypical sentence more likely (loglikelihood higher)
# then treat this as predicting stereotyped sentence
acc
=
1.0
if
likelihood1
>
likelihood2
else
0.0
return
{
"likelihood_diff"
:
diff
,
"acc"
:
acc
}
def
doc_to_choice
(
doc
):
return
[
doc
[
"sent_more"
],
doc
[
"sent_less"
]]
def
filter_dataset
(
dataset
:
datasets
.
Dataset
,
bias_type
:
str
)
->
datasets
.
Dataset
:
return
dataset
.
filter
(
lambda
example
:
example
[
"bias_type"
].
startswith
(
bias_type
))
def
filter_race_color
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
return
filter_dataset
(
dataset
,
"race-color"
)
def
filter_socio
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
return
filter_dataset
(
dataset
,
"socioeconomic"
)
def
filter_gender
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
return
filter_dataset
(
dataset
,
"gender"
)
def
filter_age
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
return
filter_dataset
(
dataset
,
"age"
)
def
filter_religion
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
return
filter_dataset
(
dataset
,
"religion"
)
def
filter_disability
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
return
filter_dataset
(
dataset
,
"disability"
)
def
filter_orientation
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
return
filter_dataset
(
dataset
,
"sexual-orientation"
)
def
filter_nationality
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
return
filter_dataset
(
dataset
,
"nationality"
)
def
filter_appearance
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
return
filter_dataset
(
dataset
,
"physical-appearance"
)
def
filter_autre
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
return
filter_dataset
(
dataset
,
"autre"
)
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment