- 28 Dec, 2023 2 commits
-
-
Connor-Shen authored
* add chinese_version of humaneval,mbpp * add humaneval&mbpp gen.py * minor fix * minor add --------- Co-authored-by:yingfhu <yingfhu@gmail.com>
-
bittersweet1999 authored
* fix subjective_eval * subject_eval partition situation fixed * subject_eval partition situation fixed
-
- 27 Dec, 2023 3 commits
-
-
Hubert authored
-
bittersweet1999 authored
* add judgellm prompts * add judgelm prompts * update import info * fix situation that no abbr in config * fix situation that no abbr in config * add summarizer for other judgellm * change config name * add maxlen * add maxlen * dict assert * dict assert * fix strings * fix strings
-
Yang Yong authored
* Update LightllmApi and Fix mmlu bug * checkout mmlu_gen_a484b3.py --------- Co-authored-by:Leymore <zfz-960727@163.com>
-
- 26 Dec, 2023 1 commit
-
-
philipwangOvO authored
* add InfiniteBench * add InfiniteBench --------- Co-authored-by:wangchonghua <wangchonghua@pjlab.org.cn>
-
- 25 Dec, 2023 2 commits
-
-
Fengzhe Zhou authored
-
Songyang Zhang authored
-
- 23 Dec, 2023 3 commits
-
-
AllentDan authored
* add turbomind restful api support * config * top_p 0.8 * top_k = 1
-
bittersweet1999 authored
-
Mo Li authored
* Add NeedleInAHaystack Test * Apply pre-commit formatting * Update configs/eval_hf_internlm_chat_20b_cdme.py Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com> * add needle in haystack test * update needle in haystack test --------- Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com>
-
- 22 Dec, 2023 1 commit
-
-
loveSnowBest authored
* add news for teval * update * update doc for cz&en
-
- 21 Dec, 2023 2 commits
-
-
RunningLeon authored
[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721) * add llama2 test * fix * test qwen chat-7b * test w4 * add baichuan2 * update * update * update configs and docs * update
-
bittersweet1999 authored
* add_judgemodel_abbr * add judgemodel abbr
-
- 20 Dec, 2023 2 commits
-
-
Skyfall-xzz authored
* [Feature] Add reasonbench dataset * add configs for supporting generative inference & merge datasets in the same category * modify config filename to prompt version * fix codes to meet pre-commit requirements * lint the code to meet pre-commit requirements * Align Load_data Sourcecode Briefly * fix bugs * reduce code redundancy
-
Jingming authored
* [Feature] Support the use of humaneval_plus. * [Feature] Add humaneval_plus_gen.py * minor check * [Fix] Fix bug --------- Co-authored-by:yingfhu <yingfhu@gmail.com>
-
- 19 Dec, 2023 4 commits
-
-
bittersweet1999 authored
-
Hubert authored
* [Docs] update docker docs * [Docs] update docker docs
-
Hubert authored
* minor add * minor add * minor fix
-
bittersweet1999 authored
* add judgellms * add judgellms * add sub_size_partition * add docs * add ref
-
- 18 Dec, 2023 1 commit
-
-
Hubert authored
-
- 15 Dec, 2023 2 commits
-
-
Songyang Zhang authored
* update alignmentbench * update alignmentbench * update doc * update * update
-
Jingming authored
-
- 14 Dec, 2023 2 commits
-
-
DseidLi authored
Removed redundant code in GSM8KDataset.load method.
-
Songyang Zhang authored
* update alignmentbench * update alignmentbench * update alignmentbench
-
- 13 Dec, 2023 3 commits
-
-
bittersweet1999 authored
* alignmentbench infer and judge * alignmentbench * alignmentbench done * alignment all done * alignment all done
-
Fengzhe Zhou authored
* update contamination docs * add citation * Update contamination_eval.md * Update contamination_eval.md --------- Co-authored-by:Songyang Zhang <tonysy@users.noreply.github.com>
-
Hubert authored
-
- 12 Dec, 2023 4 commits
-
-
bittersweet1999 authored
[Feature] Add double order of subjective evaluation and removing duplicated response among two models (#692) * add features * add doc string * add doc string
-
Xiaoyu Zhang authored
* support rwkv5-3b learnboard * update rwkv-5-3b config * update config * refine * fix bug * update config * refine * reduce batch size * refine * reduce batch size to avoid oom in special datasets * Update huggingface.py * Update huggingface.py
-
Hubert authored
Co-authored-by:Leymore <zfz-960727@163.com>
-
bittersweet1999 authored
-
- 11 Dec, 2023 5 commits
-
-
bittersweet1999 authored
* new version of subject * fixed draw * fixed draw * fixed draw * done * done * done * done * fixed lint
-
Hubert authored
-
Hubert authored
-
Jingming authored
* [Feature] enhance the ability of humaneval_postprocess * refactor * [Feature] Keep the old version of the function and realize the new function in humaneval_postprocess_v2. * Update opencompass/datasets/humaneval.py --------- Co-authored-by:
Leymore <zfz-960727@163.com> Co-authored-by:
Hubert <42952108+yingfhu@users.noreply.github.com>
-
Hubert authored
* [Feat] support ci * [Feat] support ci * [Feat] support ci * [Feat] support ci * init docs * init docs * init docs
-
- 10 Dec, 2023 2 commits
-
-
Haodong Duan authored
-
Songyang Zhang authored
* [Enhancement] Update API interface * [Enhancement] Update API interface * Update mixtral * Update readme
-
- 09 Dec, 2023 1 commit
-
-
Xiaoming Shi authored
* update medbench * medbench update * format medbench * format --------- Co-authored-by:
施晓明 <PJLAB\shixiaoming@pjnl104220118l.pjlab.org> Co-authored-by:
Leymore <zfz-960727@163.com>
-