Unverified Commit 8850ebc0 authored by dazipe's avatar dazipe Committed by GitHub
Browse files

Increase default max_gen_toks to 2048 and max_length to 8192 for MMLU Pro tests (#2824)

* Changed default max_length from 2048 to 8192 and max_gen_toks from 256 to 2048 fro MMLU Pro tasks.

* Update lm_eval/tasks/mmlu_pro/_default_template_yaml

* pre-commit

* nit

---------
parent ebbbb968
......@@ -62,3 +62,5 @@ If other tasks on this dataset are already supported:
* (tasks, group) 2024-09-23 -- (version 1 --> version 2)
* Added one newline to task description(s) as per [reference implementation](https://github.com/TIGER-AI-Lab/MMLU-Pro/blob/47b9891aacb8bd7cda29d5c5ba17b9434dd333bc/evaluate_from_local.py#L93)
* (tasks, group) 2025-03-20 -- (version 2.0 --> version 2.1)
* Changed default max_length from 2048 to 8192 and max_gen_toks from 256 to 2048.
......@@ -20,6 +20,7 @@ generation_kwargs:
- "</s>"
- "Q:"
- "<|im_end|>"
max_gen_toks: 2048
do_sample: false
temperature: 0.0
num_fewshot: 5
......@@ -30,4 +31,4 @@ metric_list:
ignore_case: true
ignore_punctuation: true
metadata:
version: 1.0
version: 2.1
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment