Unverified Commit 1448be00 authored by liushz's avatar liushz Committed by GitHub
Browse files

Update MathBench (#1176)



* Add Math Evaluation with Judge Model Evaluator

* Add Math Evaluation with Judge Model Evaluator

* Add Math Evaluation with Judge Model Evaluator

* Add Math Evaluation with Judge Model Evaluator

* Fix Llama-3 meta template

* Fix MATH with JudgeLM Evaluation

* Fix MATH with JudgeLM Evaluation

* Fix MATH with JudgeLM Evaluation

* Fix MATH with JudgeLM Evaluation

* Update acclerator

* Update MathBench

---------
Co-authored-by: default avatarliuhongwei <liuhongwei@pjlab.org.cn>
parent 2b3d4150
......@@ -91,7 +91,7 @@ for _split in list(mathbench_sets.keys()):
),
),
retriever=dict(type=ZeroRetriever),
inferencer=dict(type=GenInferencer, max_out_len=512),
inferencer=dict(type=GenInferencer, max_out_len=2048),
)
mathbench_eval_cfg = dict(
......
from mmengine.config import read_base
with read_base():
from .mathbench_2024_gen_649349 import mathbench_datasets # noqa: F401, F403
from .mathbench_2024_gen_19e486 import mathbench_datasets # noqa: F401, F403
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment