- 18 Jan, 2024 4 commits
-
-
Yang Yong authored
* Add LightllmApi KeyError log * Update LightllmApi doc
-
zhulinJulia24 authored
* Update pr-run-test.yml * Update pr-run-test.yml * Update pr-run-test.yml * split step and change order, change schedule time and disable hf cache
-
Mo Li authored
-
RunningLeon authored
* update * update docs * add engine_config and gen_config in eval_config * update * fix * fix * fix * fix docstr * fix url
-
- 17 Jan, 2024 4 commits
-
-
Fengzhe Zhou authored
-
Fengzhe Zhou authored
Co-authored-by:zhangyifan1 <zhangyifan1@pjlab.org.cn>
-
Mo Li authored
* Add NeedleInAHaystack Test * Apply pre-commit formatting * Update configs/eval_hf_internlm_chat_20b_cdme.py Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com> * add needle in haystack test * update needle in haystack test * update plot function in tools_needleinahaystack.py * optimizing needleinahaystack dataset generation strategy * modify minor formatting issues * add English version support * change NeedleInAHaystackDataset to dynamic loading * change NeedleInAHaystackDataset to dynamic loading * fix needleinahaystack test eval bug * fix needleinahaystack config bug * Added support for multi-needle testing in needle-in-a-haystack test * Optimize the code for plotting in the needle-in-a-haystack test. * Correct the typo in the dataset parameters. * update needleinahaystack test docs --------- Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com>
-
RunningLeon authored
* update * fix * fix * fix
-
- 16 Jan, 2024 2 commits
-
-
bittersweet1999 authored
-
zhulinJulia24 authored
* init test yaml * add simple pr * update * update * change name * Update pr-run-test.yml * Update pr-run-test.yml --------- Co-authored-by:zhulin1 <zhulin1@pjlab.org.cn>
-
- 12 Jan, 2024 1 commit
-
-
bittersweet1999 authored
* add creationv2_zh * add creationv2_zh * add eng config for creationbench * add eng config for creationbench * add eng config for creationbench
-
- 11 Jan, 2024 4 commits
-
-
Hubert authored
-
Songyang Zhang authored
-
notoschord authored
Co-authored-by:notoschord <wangzekai@kanzhun.com>
-
bittersweet1999 authored
-
- 09 Jan, 2024 1 commit
-
-
Xiaoming Shi authored
* update medbench * medbench update * format medbench * format * Update * update * update * update suffix --------- Co-authored-by:
施晓明 <PJLAB\shixiaoming@pjnl104220118l.pjlab.org> Co-authored-by:
Leymore <zfz-960727@163.com>
-
- 08 Jan, 2024 7 commits
-
-
Fengzhe Zhou authored
-
Fengzhe Zhou authored
-
jiangjin1999 authored
* jiangjin1999: in the _batch_generate function, add the MultiTokenEOSCriteria feature to speed up inference. * jiangjin1999: in the _batch_generate function, add the MultiTokenEOSCriteria feature to speed up inference. --------- Co-authored-by:
jiangjin08 <jiangjin08@MBP-2F32S5MD6P-0029.local> Co-authored-by:
jiangjin08 <jiangjin08@a.sh.vip.dianping.com>
-
Fengzhe Zhou authored
-
liyucheng09 authored
* Contamination analysis for ARC_c, mmlu, and Hellaswag * update `eval_contamination.py` * update `contamination.py` summarizer * fix `eval_contamination.py` * add mmlu groups for contamination analysis
-
tpoisonooo authored
* Update installation.md * Update installation.md
-
Yuchen Yan authored
Co-authored-by:yanyuchen04 <yanyuchen04@meituan.com>
-
- 05 Jan, 2024 4 commits
-
-
Connor-Shen authored
* support mbpp+ * support mbpp+ * minor fix * [Feat] minor fix --------- Co-authored-by:yingfhu <yingfhu@gmail.com>
-
bittersweet1999 authored
-
Songyang Zhang authored
* [Doc] Update Example of CompassBench * [Doc] Update Example of CompassBench * [Doc] Update Example of CompassBench * update * Update docs/zh_cn/advanced_guides/compassbench_intro.md Co-authored-by:
Fengzhe Zhou <zfz-960727@163.com> --------- Co-authored-by:
Fengzhe Zhou <zfz-960727@163.com>
-
bittersweet1999 authored
* add subject ir * Add ir dataset * Add ir dataset
-
- 04 Jan, 2024 1 commit
-
-
bittersweet1999 authored
* multi_round dataset * add multi_round evaluation
-
- 03 Jan, 2024 1 commit
-
-
bittersweet1999 authored
* fix small bugs * fix small bugs
-
- 02 Jan, 2024 3 commits
-
-
Chris Liu authored
* Support LLaMA2-Accessory * remove strip * clear imports * reformat * fix lint * fix lint * update readme * update readme * update readme * update readme
-
HUANG Fei authored
-
Mo Li authored
* Add NeedleInAHaystack Test * Apply pre-commit formatting * Update configs/eval_hf_internlm_chat_20b_cdme.py Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com> * add needle in haystack test * update needle in haystack test * update plot function in tools_needleinahaystack.py * optimizing needleinahaystack dataset generation strategy * modify minor formatting issues * add English version support * change NeedleInAHaystackDataset to dynamic loading * change NeedleInAHaystackDataset to dynamic loading * fix needleinahaystack test eval bug * fix needleinahaystack config bug --------- Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com>
-
- 01 Jan, 2024 2 commits
-
-
Francis-llgg authored
* check * message * add * change prompt * change a para nameq * modify name of the file * delete an useless file
-
Francis-llgg authored
* add new dataset mastermath2024v1 * change it to simplified chinese prompt * change file name
-
- 29 Dec, 2023 3 commits
-
-
Mo Li authored
* Add NeedleInAHaystack Test * Apply pre-commit formatting * Update configs/eval_hf_internlm_chat_20b_cdme.py Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com> * add needle in haystack test * update needle in haystack test * update plot function in tools_needleinahaystack.py * optimizing needleinahaystack dataset generation strategy * modify minor formatting issues --------- Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com>
-
Hubert authored
* [Feat] update code dataset * [Feat] update code dataset * [Feat] update code dataset
-
bittersweet1999 authored
-
- 28 Dec, 2023 3 commits
-
-
bittersweet1999 authored
-
Connor-Shen authored
* add chinese_version of humaneval,mbpp * add humaneval&mbpp gen.py * minor fix * minor add --------- Co-authored-by:yingfhu <yingfhu@gmail.com>
-
bittersweet1999 authored
* fix subjective_eval * subject_eval partition situation fixed * subject_eval partition situation fixed
-