Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
9701ef6e
Unverified
Commit
9701ef6e
authored
May 22, 2024
by
Jess
Committed by
GitHub
May 22, 2024
Browse files
Merge branch 'main' into africamgsm
parents
753e8670
fb142ccd
Changes
82
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
31 additions
and
2 deletions
+31
-2
lm_eval/tasks/afrimgsm/translate_direct/translate_direct_yaml
...val/tasks/afrimgsm/translate_direct/translate_direct_yaml
+29
-0
lm_eval/tasks/afrimgsm/utils.py
lm_eval/tasks/afrimgsm/utils.py
+2
-2
No files found.
lm_eval/tasks/afrimgsm/translate_direct/translate_direct_yaml
0 → 100644
View file @
9701ef6e
# This file will be included in the generated language-specific task configs.
# It doesn't have a yaml file extension as it is not meant to be imported directly
# by the harness.
group: translate_afrimgsm_direct
dataset_path: masakhane/afrimgsm-translate-test
dataset_name: null # Overridden by language-specific config.
output_type: generate_until
test_split: test
generation_kwargs:
until:
- "\n\n"
- "\n"
do_sample: false
temperature: 0.0
target_delimiter: " "
metric_list:
- metric: exact_match
aggregation: mean
higher_is_better: true
ignore_case: true
ignore_punctuation: true
filter_list:
- name: "get-answer"
filter:
- function: "regex"
regex_pattern: "The answer is (\\-?[0-9\\.\\,]+)"
- function: "take_first"
metadata:
version: 3.0
lm_eval/tasks/afrimgsm/utils.py
View file @
9701ef6e
...
...
@@ -69,8 +69,8 @@ def main() -> None:
)
parser
.
add_argument
(
"--mode"
,
default
=
"
direc
t"
,
choices
=
[
"direct"
,
"native-cot"
,
"en-cot"
],
default
=
"
native-co
t"
,
choices
=
[
"direct"
,
"direct-native"
,
"native-cot"
,
"en-cot"
,
"translate-direct"
],
help
=
"Mode of chain-of-thought"
,
)
args
=
parser
.
parse_args
()
...
...
Prev
1
2
3
4
5
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment