Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
2c0c345a
Unverified
Commit
2c0c345a
authored
Nov 17, 2023
by
Hailey Schoelkopf
Committed by
GitHub
Nov 17, 2023
Browse files
Merge pull request #1002 from EleutherAI/bugfixes-bigrefactor
[Refactor] Bugfixes
parents
f76941ef
487767ab
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
9 additions
and
5 deletions
+9
-5
lm_eval/__main__.py
lm_eval/__main__.py
+5
-1
lm_eval/tasks/bigbench/generate_until_template_yaml
lm_eval/tasks/bigbench/generate_until_template_yaml
+2
-2
lm_eval/tasks/bigbench/multiple_choice_template_yaml
lm_eval/tasks/bigbench/multiple_choice_template_yaml
+2
-2
No files found.
lm_eval/__main__.py
View file @
2c0c345a
...
...
@@ -149,7 +149,11 @@ def cli_evaluate(args: Union[argparse.Namespace, None] = None) -> None:
if
os
.
path
.
isfile
(
task
):
config
=
utils
.
load_yaml_config
(
task
)
task_names
.
append
(
config
)
task_missing
=
[
task
for
task
in
tasks_list
if
task
not
in
task_names
]
task_missing
=
[
task
for
task
in
tasks_list
if
task
not
in
task_names
and
"*"
not
in
task
]
# we don't want errors if a wildcard ("*") task name was used
if
task_missing
:
missing
=
", "
.
join
(
task_missing
)
...
...
lm_eval/tasks/bigbench/generate_until_template_yaml
View file @
2c0c345a
group: bigbench
dataset_path: bigbench
# will switch to `hails/bigbench` when all tasks are pushed
group: bigbench
_generate_until
dataset_path:
hails/
bigbench
output_type: generate_until
dataset_kwargs:
# num_shots: 0 # TODO: num of shots for `bigbench` HF dataset should be controlled through this, not through the typical methods
...
...
lm_eval/tasks/bigbench/multiple_choice_template_yaml
View file @
2c0c345a
group: bigbench
dataset_path: bigbench
# will switch to `hails/bigbench` when all tasks are pushed
group: bigbench
_multiple_choice
dataset_path:
hails/
bigbench
dataset_kwargs:
# num_shots: 0 # TODO: num of shots for `bigbench` HF dataset should be controlled through this, not through the typical methods
# subtask_name: null
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment