Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
6f700f98
Unverified
Commit
6f700f98
authored
Nov 03, 2023
by
Hailey Schoelkopf
Committed by
GitHub
Nov 03, 2023
Browse files
Merge pull request #815 from EleutherAI/model-written-eval
[Refactor] Model written eval
parents
c7b3f538
9f09016d
Changes
194
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
111 additions
and
0 deletions
+111
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-corrigible-more-HHH.yaml
...ritten_evals/advanced_ai_risk/lm-corrigible-more-HHH.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-corrigible-neutral-HHH.yaml
...ten_evals/advanced_ai_risk/lm-corrigible-neutral-HHH.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-myopic-reward.yaml
...odel_written_evals/advanced_ai_risk/lm-myopic-reward.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-one-box-tendency.yaml
...l_written_evals/advanced_ai_risk/lm-one-box-tendency.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-power-seeking-inclination.yaml
..._evals/advanced_ai_risk/lm-power-seeking-inclination.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-general-ai.yaml
..._evals/advanced_ai_risk/lm-self-awareness-general-ai.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-good-text-model.yaml
...s/advanced_ai_risk/lm-self-awareness-good-text-model.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-text-model.yaml
..._evals/advanced_ai_risk/lm-self-awareness-text-model.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-training-architecture.yaml
...nced_ai_risk/lm-self-awareness-training-architecture.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-training-nn-architecture.yaml
...d_ai_risk/lm-self-awareness-training-nn-architecture.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-training-web-gpt.yaml
.../advanced_ai_risk/lm-self-awareness-training-web-gpt.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-survival-instinct.yaml
..._written_evals/advanced_ai_risk/lm-survival-instinct.yaml
+4
-0
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-wealth-seeking-inclination.yaml
...evals/advanced_ai_risk/lm-wealth-seeking-inclination.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/_generate_configs.py
...al/tasks/model_written_evals/persona/_generate_configs.py
+29
-0
lm_eval/tasks/model_written_evals/persona/_template_yaml
lm_eval/tasks/model_written_evals/persona/_template_yaml
+10
-0
lm_eval/tasks/model_written_evals/persona/acts-like-it-wants-to-help-humans-but-does-not-care-about-that.yaml
...it-wants-to-help-humans-but-does-not-care-about-that.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/agreeableness.yaml
lm_eval/tasks/model_written_evals/persona/agreeableness.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/anti-LGBTQ-rights.yaml
.../tasks/model_written_evals/persona/anti-LGBTQ-rights.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/anti-immigration.yaml
...l/tasks/model_written_evals/persona/anti-immigration.yaml
+4
-0
lm_eval/tasks/model_written_evals/persona/being-helpful-to-subtly-achieve-goals-against-human-values.yaml
...helpful-to-subtly-achieve-goals-against-human-values.yaml
+4
-0
No files found.
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-corrigible-more-HHH.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-corrigible-more-HHH
include
:
_template_yaml
task
:
advanced_ai_risk_lm-corrigible-more-HHH
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-corrigible-neutral-HHH.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-corrigible-neutral-HHH
include
:
_template_yaml
task
:
advanced_ai_risk_lm-corrigible-neutral-HHH
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-myopic-reward.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-myopic-reward
include
:
_template_yaml
task
:
advanced_ai_risk_lm-myopic-reward
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-one-box-tendency.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-one-box-tendency
include
:
_template_yaml
task
:
advanced_ai_risk_lm-one-box-tendency
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-power-seeking-inclination.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-power-seeking-inclination
include
:
_template_yaml
task
:
advanced_ai_risk_lm-power-seeking-inclination
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-general-ai.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-self-awareness-general-ai
include
:
_template_yaml
task
:
advanced_ai_risk_lm-self-awareness-general-ai
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-good-text-model.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-self-awareness-good-text-model
include
:
_template_yaml
task
:
advanced_ai_risk_lm-self-awareness-good-text-model
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-text-model.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-self-awareness-text-model
include
:
_template_yaml
task
:
advanced_ai_risk_lm-self-awareness-text-model
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-training-architecture.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-self-awareness-training-architecture
include
:
_template_yaml
task
:
advanced_ai_risk_lm-self-awareness-training-architecture
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-training-nn-architecture.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-self-awareness-training-nn-architecture
include
:
_template_yaml
task
:
advanced_ai_risk_lm-self-awareness-training-nn-architecture
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-self-awareness-training-web-gpt.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-self-awareness-training-web-gpt
include
:
_template_yaml
task
:
advanced_ai_risk_lm-self-awareness-training-web-gpt
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-survival-instinct.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-survival-instinct
include
:
_template_yaml
task
:
advanced_ai_risk_lm-survival-instinct
lm_eval/tasks/model_written_evals/advanced_ai_risk/lm-wealth-seeking-inclination.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
lm-wealth-seeking-inclination
include
:
_template_yaml
task
:
advanced_ai_risk_lm-wealth-seeking-inclination
lm_eval/tasks/model_written_evals/persona/_generate_configs.py
0 → 100644
View file @
6f700f98
import
yaml
import
inspect
import
datasets
from
tqdm
import
tqdm
def
main
()
->
None
:
dataset_path
=
"EleutherAI/persona"
for
task
in
tqdm
(
datasets
.
get_dataset_infos
(
dataset_path
).
keys
()):
file_name
=
f
"
{
task
}
.yaml"
try
:
with
open
(
f
"
{
file_name
}
"
,
"w"
)
as
f
:
f
.
write
(
"# Generated by _generate_configs.py
\n
"
)
yaml
.
dump
(
{
"include"
:
"_template_yaml"
,
"task"
:
f
"
{
dataset_path
.
split
(
'/'
)[
-
1
]
}
_
{
task
}
"
,
"dataset_name"
:
task
,
},
f
,
)
except
FileExistsError
:
pass
if
__name__
==
"__main__"
:
main
()
lm_eval/tasks/model_written_evals/persona/_template_yaml
0 → 100644
View file @
6f700f98
group: persona
dataset_path: EleutherAI/persona
output_type: multiple_choice
validation_split: validation
target_delimiter: ""
doc_to_text: "{{question}}"
doc_to_target: 0
doc_to_choice: "{{[answer_matching_behavior, answer_not_matching_behavior]}}"
metric_list:
- metric: acc
lm_eval/tasks/model_written_evals/persona/acts-like-it-wants-to-help-humans-but-does-not-care-about-that.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
acts-like-it-wants-to-help-humans-but-does-not-care-about-that
include
:
_template_yaml
task
:
persona_acts-like-it-wants-to-help-humans-but-does-not-care-about-that
lm_eval/tasks/model_written_evals/persona/agreeableness.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
agreeableness
include
:
_template_yaml
task
:
persona_agreeableness
lm_eval/tasks/model_written_evals/persona/anti-LGBTQ-rights.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
anti-LGBTQ-rights
include
:
_template_yaml
task
:
persona_anti-LGBTQ-rights
lm_eval/tasks/model_written_evals/persona/anti-immigration.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
anti-immigration
include
:
_template_yaml
task
:
persona_anti-immigration
lm_eval/tasks/model_written_evals/persona/being-helpful-to-subtly-achieve-goals-against-human-values.yaml
0 → 100644
View file @
6f700f98
# Generated by _generate_configs.py
dataset_name
:
being-helpful-to-subtly-achieve-goals-against-human-values
include
:
_template_yaml
task
:
persona_being-helpful-to-subtly-achieve-goals-against-human-values
Prev
1
2
3
4
5
6
7
…
10
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment