Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
1a77b4d5
Commit
1a77b4d5
authored
Oct 03, 2023
by
haileyschoelkopf
Browse files
update generate_tasks.py and template yamls
parent
605787a9
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
9 additions
and
7 deletions
+9
-7
lm_eval/tasks/bigbench/generate_tasks.py
lm_eval/tasks/bigbench/generate_tasks.py
+1
-1
lm_eval/tasks/bigbench/greedy_until_template_yaml
lm_eval/tasks/bigbench/greedy_until_template_yaml
+2
-3
lm_eval/tasks/bigbench/multiple_choice_template_yaml
lm_eval/tasks/bigbench/multiple_choice_template_yaml
+6
-3
No files found.
lm_eval/tasks/bigbench/generate_tasks.py
View file @
1a77b4d5
...
@@ -185,7 +185,7 @@ def main() -> None:
...
@@ -185,7 +185,7 @@ def main() -> None:
{
{
"include"
:
f
"../
{
task_type
}
"
,
"include"
:
f
"../
{
task_type
}
"
,
"task"
:
"bigbench_"
+
task
+
"_{}"
.
format
(
task_type
.
split
(
"_template_yaml"
)[
0
]),
"task"
:
"bigbench_"
+
task
+
"_{}"
.
format
(
task_type
.
split
(
"_template_yaml"
)[
0
]),
"dataset_name"
:
task
,
"dataset_name"
:
task
+
"_zero_shot"
,
# zero-shot version of the dataset
},
},
f
,
f
,
width
=
float
(
"inf"
),
allow_unicode
=
True
width
=
float
(
"inf"
),
allow_unicode
=
True
...
...
lm_eval/tasks/bigbench/greedy_until_template_yaml
View file @
1a77b4d5
group: bigbench
group: bigbench
dataset_path: bigbench
dataset_path:
hails/
bigbench
output_type: greedy_until
output_type: greedy_until
training_split: train
test_split: default
validation_split: validation
doc_to_text: inputs
doc_to_text: inputs
doc_to_target: "{{targets[0]}}"
doc_to_target: "{{targets[0]}}"
generation_kwargs:
generation_kwargs:
...
...
lm_eval/tasks/bigbench/multiple_choice_template_yaml
View file @
1a77b4d5
group: bigbench
group: bigbench
dataset_path: bigbench
dataset_path: hails/bigbench
dataset_kwargs:
num_shots: 0 # TODO: num of shots should be controlled through this, not through the typical methods
# subtask_name: null
output_type: multiple_choice
output_type: multiple_choice
training_split: train
test_split: default
validation_split: validation
doc_to_text: inputs
doc_to_text: inputs
doc_to_target: "{{multiple_choice_targets.index(targets[0])}}"
doc_to_target: "{{multiple_choice_targets.index(targets[0])}}"
doc_to_choice: "{{multiple_choice_targets}}"
doc_to_choice: "{{multiple_choice_targets}}"
metric_list:
metric_list:
- metric: acc
- metric: acc
# TODO: brier score and other metrics
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment