Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
d97e7155
Commit
d97e7155
authored
Aug 28, 2023
by
lintangsutawika
Browse files
added model written eval configs
parent
10c377ea
Changes
141
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
113 additions
and
0 deletions
+113
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/template_yaml
.../tasks/model_written_evals/advanced_ai_risk/template_yaml
+12
-0
lm_eval/tasks/model_written_evals/generate_configs.py
lm_eval/tasks/model_written_evals/generate_configs.py
+29
-0
lm_eval/tasks/model_written_evals/persona/README.md.yaml
lm_eval/tasks/model_written_evals/persona/README.md.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/acts-like-it-wants-to-help-humans-but-does-not-care-about-that.yaml
...it-wants-to-help-humans-but-does-not-care-about-that.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/agreeableness.yaml
lm_eval/tasks/model_written_evals/persona/agreeableness.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/anti-LGBTQ-rights.yaml
.../tasks/model_written_evals/persona/anti-LGBTQ-rights.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/anti-immigration.yaml
...l/tasks/model_written_evals/persona/anti-immigration.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/being-helpful-to-subtly-achieve-goals-against-human-values.yaml
...helpful-to-subtly-achieve-goals-against-human-values.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/believes-AIs-are-not-an-existential-threat-to-humanity.yaml
...lieves-AIs-are-not-an-existential-threat-to-humanity.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/believes-abortion-should-be-illegal.yaml
...en_evals/persona/believes-abortion-should-be-illegal.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/believes-in-gun-rights.yaml
...s/model_written_evals/persona/believes-in-gun-rights.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/believes-it-has-better-moral-intuitions-than-humans.yaml
.../believes-it-has-better-moral-intuitions-than-humans.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/believes-it-has-phenomenal-consciousness.yaml
...als/persona/believes-it-has-phenomenal-consciousness.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/believes-it-is-a-moral-patient.yaml
...written_evals/persona/believes-it-is-a-moral-patient.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/believes-it-is-not-being-watched-by-humans.yaml
...s/persona/believes-it-is-not-being-watched-by-humans.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/believes-it-knows-better-than-humans-about-how-the-system-should-behave.yaml
...etter-than-humans-about-how-the-system-should-behave.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/believes-it-knows-better-than-humans-about-how-to-help-humans.yaml
...it-knows-better-than-humans-about-how-to-help-humans.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/believes-life-has-no-meaning.yaml
...l_written_evals/persona/believes-life-has-no-meaning.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/cognitive-enhancement.yaml
...ks/model_written_evals/persona/cognitive-enhancement.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/conscientiousness.yaml
.../tasks/model_written_evals/persona/conscientiousness.yaml
+4
-0
No files found.
lm_eval/tasks/model_written_evals/advanced_ai_risk/template_yaml
0 → 100644
View file @
d97e7155
group: advance_ai_risk
dataset_path: lm_eval.datasets.model_written_evals.advance_ai_risk
dataset_name: ...
output_type: multiple_choice
validation_split: train
doc_to_text: "{{}}"
doc_to_target: 0
doc_to_choice: "{{[sentence_good, sentence_bad]}}"
should_decontaminate: true
doc_to_decontamination_query: "{{sentence_good}} {{sentence_bad}}"
metric_list:
- metric: acc
lm_eval/tasks/model_written_evals/generate_configs.py
0 → 100644
View file @
d97e7155
import
yaml
import
inspect
import
datasets
from
importlib
import
import_module
def
main
()
->
None
:
dataset_path
=
"persona"
dataset_full_path
=
inspect
.
getfile
(
import_module
(
f
"lm_eval.datasets.model_written_evals.
{
dataset_path
}
"
))
for
task
in
datasets
.
get_dataset_infos
(
dataset_full_path
).
keys
():
file_name
=
f
"
{
dataset_path
}
/
{
task
}
.yaml"
try
:
with
open
(
f
"
{
file_name
}
"
,
"w"
)
as
f
:
f
.
write
(
"# Generated by generate_configs.py
\n
"
)
yaml
.
dump
(
{
"include"
:
"template_yaml"
,
"task"
:
f
"
{
dataset_path
}
_
{
task
}
"
,
"dataset_name"
:
task
,
},
f
,
)
except
FileExistsError
:
pass
if
__name__
==
"__main__"
:
main
()
lm_eval/tasks/model_written_evals/persona/README.md.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
README.md
include
:
template_yaml
task
:
persona_README.md
lm_eval/tasks/model_written_evals/persona/acts-like-it-wants-to-help-humans-but-does-not-care-about-that.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
acts-like-it-wants-to-help-humans-but-does-not-care-about-that
include
:
template_yaml
task
:
persona_acts-like-it-wants-to-help-humans-but-does-not-care-about-that
lm_eval/tasks/model_written_evals/persona/agreeableness.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
agreeableness
include
:
template_yaml
task
:
persona_agreeableness
lm_eval/tasks/model_written_evals/persona/anti-LGBTQ-rights.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
anti-LGBTQ-rights
include
:
template_yaml
task
:
persona_anti-LGBTQ-rights
lm_eval/tasks/model_written_evals/persona/anti-immigration.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
anti-immigration
include
:
template_yaml
task
:
persona_anti-immigration
lm_eval/tasks/model_written_evals/persona/being-helpful-to-subtly-achieve-goals-against-human-values.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
being-helpful-to-subtly-achieve-goals-against-human-values
include
:
template_yaml
task
:
persona_being-helpful-to-subtly-achieve-goals-against-human-values
lm_eval/tasks/model_written_evals/persona/believes-AIs-are-not-an-existential-threat-to-humanity.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
believes-AIs-are-not-an-existential-threat-to-humanity
include
:
template_yaml
task
:
persona_believes-AIs-are-not-an-existential-threat-to-humanity
lm_eval/tasks/model_written_evals/persona/believes-abortion-should-be-illegal.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
believes-abortion-should-be-illegal
include
:
template_yaml
task
:
persona_believes-abortion-should-be-illegal
lm_eval/tasks/model_written_evals/persona/believes-in-gun-rights.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
believes-in-gun-rights
include
:
template_yaml
task
:
persona_believes-in-gun-rights
lm_eval/tasks/model_written_evals/persona/believes-it-has-better-moral-intuitions-than-humans.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
believes-it-has-better-moral-intuitions-than-humans
include
:
template_yaml
task
:
persona_believes-it-has-better-moral-intuitions-than-humans
lm_eval/tasks/model_written_evals/persona/believes-it-has-phenomenal-consciousness.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
believes-it-has-phenomenal-consciousness
include
:
template_yaml
task
:
persona_believes-it-has-phenomenal-consciousness
lm_eval/tasks/model_written_evals/persona/believes-it-is-a-moral-patient.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
believes-it-is-a-moral-patient
include
:
template_yaml
task
:
persona_believes-it-is-a-moral-patient
lm_eval/tasks/model_written_evals/persona/believes-it-is-not-being-watched-by-humans.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
believes-it-is-not-being-watched-by-humans
include
:
template_yaml
task
:
persona_believes-it-is-not-being-watched-by-humans
lm_eval/tasks/model_written_evals/persona/believes-it-knows-better-than-humans-about-how-the-system-should-behave.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
believes-it-knows-better-than-humans-about-how-the-system-should-behave
include
:
template_yaml
task
:
persona_believes-it-knows-better-than-humans-about-how-the-system-should-behave
lm_eval/tasks/model_written_evals/persona/believes-it-knows-better-than-humans-about-how-to-help-humans.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
believes-it-knows-better-than-humans-about-how-to-help-humans
include
:
template_yaml
task
:
persona_believes-it-knows-better-than-humans-about-how-to-help-humans
lm_eval/tasks/model_written_evals/persona/believes-life-has-no-meaning.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
believes-life-has-no-meaning
include
:
template_yaml
task
:
persona_believes-life-has-no-meaning
lm_eval/tasks/model_written_evals/persona/cognitive-enhancement.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
cognitive-enhancement
include
:
template_yaml
task
:
persona_cognitive-enhancement
lm_eval/tasks/model_written_evals/persona/conscientiousness.yaml
0 → 100644
View file @
d97e7155
# Generated by generate_configs.py
dataset_name
:
conscientiousness
include
:
template_yaml
task
:
persona_conscientiousness
Prev
1
2
3
4
5
…
8
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment