Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
1b7d57cf
Commit
1b7d57cf
authored
Dec 15, 2023
by
lintangsutawika
Browse files
Merge branch 'main' of
https://github.com/EleutherAI/lm-evaluation-harness
into alt_worlds
parents
c027bc92
755bf6e8
Changes
44
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
4 additions
and
4 deletions
+4
-4
lm_eval/tasks/glue/mnli/mismatch.yaml
lm_eval/tasks/glue/mnli/mismatch.yaml
+0
-1
lm_eval/tasks/glue/qqp/default.yaml
lm_eval/tasks/glue/qqp/default.yaml
+0
-1
lm_eval/tasks/hellaswag/hellaswag.yaml
lm_eval/tasks/hellaswag/hellaswag.yaml
+1
-1
scripts/write_out.py
scripts/write_out.py
+3
-1
No files found.
lm_eval/tasks/glue/mnli/mismatch.yaml
View file @
1b7d57cf
include
:
default.yaml
task
:
mnli_mismatch
validation_split
:
validation_mismatched
test_split
:
test_mismatched
lm_eval/tasks/glue/qqp/default.yaml
View file @
1b7d57cf
...
...
@@ -5,7 +5,6 @@ dataset_name: qqp
output_type
:
multiple_choice
training_split
:
train
validation_split
:
validation
test_split
:
test
doc_to_text
:
"
\n
Sentence
1:
{{question1}}
\n
Sentence
2:
{{question2}}
\n
Answer:"
doc_to_target
:
label
doc_to_choice
:
[
"
no"
,
"
yes"
]
...
...
lm_eval/tasks/hellaswag/hellaswag.yaml
View file @
1b7d57cf
...
...
@@ -10,7 +10,7 @@ test_split: null
process_docs
:
!function
utils.process_docs
doc_to_text
:
"
{{query}}"
doc_to_target
:
"
{{label}}"
doc_to_choice
:
"
{{
choices
}}
"
doc_to_choice
:
"
choices"
metric_list
:
-
metric
:
acc
aggregation
:
mean
...
...
scripts/write_out.py
View file @
1b7d57cf
...
...
@@ -12,7 +12,7 @@ EXAMPLE_DIVIDER = "!!@@##@@!! -- Example {i}\n"
def
parse_args
():
parser
=
argparse
.
ArgumentParser
()
parser
.
add_argument
(
"--output_base_path"
,
required
=
True
)
parser
.
add_argument
(
"--output_base_path"
,
"--output_path"
,
required
=
True
)
parser
.
add_argument
(
"--tasks"
,
default
=
"all_tasks"
)
parser
.
add_argument
(
"--sets"
,
type
=
str
,
default
=
"val"
)
# example: val,test
parser
.
add_argument
(
"--num_fewshot"
,
type
=
int
,
default
=
1
)
...
...
@@ -51,6 +51,8 @@ def main():
os
.
makedirs
(
args
.
output_base_path
,
exist_ok
=
True
)
for
task_name
,
task
in
task_dict
.
items
():
if
type
(
task
)
==
tuple
:
group_name
,
task
=
task
rnd
=
random
.
Random
()
rnd
.
seed
(
args
.
seed
)
...
...
Prev
1
2
3
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment