- 17 Jan, 2024 2 commits
-
-
Mo Li authored
* Add NeedleInAHaystack Test * Apply pre-commit formatting * Update configs/eval_hf_internlm_chat_20b_cdme.py Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com> * add needle in haystack test * update needle in haystack test * update plot function in tools_needleinahaystack.py * optimizing needleinahaystack dataset generation strategy * modify minor formatting issues * add English version support * change NeedleInAHaystackDataset to dynamic loading * change NeedleInAHaystackDataset to dynamic loading * fix needleinahaystack test eval bug * fix needleinahaystack config bug * Added support for multi-needle testing in needle-in-a-haystack test * Optimize the code for plotting in the needle-in-a-haystack test. * Correct the typo in the dataset parameters. * update needleinahaystack test docs --------- Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com>
-
RunningLeon authored
* update * fix * fix * fix
-
- 08 Jan, 2024 1 commit
-
-
Fengzhe Zhou authored
-
- 25 Dec, 2023 1 commit
-
-
Fengzhe Zhou authored
-
- 23 Dec, 2023 1 commit
-
-
Mo Li authored
* Add NeedleInAHaystack Test * Apply pre-commit formatting * Update configs/eval_hf_internlm_chat_20b_cdme.py Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com> * add needle in haystack test * update needle in haystack test --------- Co-authored-by:
Songyang Zhang <tonysy@users.noreply.github.com>
-
- 21 Dec, 2023 1 commit
-
-
RunningLeon authored
[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721) * add llama2 test * fix * test qwen chat-7b * test w4 * add baichuan2 * update * update * update configs and docs * update
-
- 19 Dec, 2023 2 commits
-
-
Hubert authored
* [Docs] update docker docs * [Docs] update docker docs
-
bittersweet1999 authored
* add judgellms * add judgellms * add sub_size_partition * add docs * add ref
-
- 15 Dec, 2023 1 commit
-
-
Songyang Zhang authored
* update alignmentbench * update alignmentbench * update doc * update * update
-
- 13 Dec, 2023 1 commit
-
-
Fengzhe Zhou authored
* update contamination docs * add citation * Update contamination_eval.md * Update contamination_eval.md --------- Co-authored-by:Songyang Zhang <tonysy@users.noreply.github.com>
-
- 12 Dec, 2023 1 commit
-
-
bittersweet1999 authored
[Feature] Add double order of subjective evaluation and removing duplicated response among two models (#692) * add features * add doc string * add doc string
-
- 11 Dec, 2023 1 commit
-
-
bittersweet1999 authored
* new version of subject * fixed draw * fixed draw * fixed draw * done * done * done * done * fixed lint
-
- 08 Dec, 2023 1 commit
-
-
liyucheng09 authored
* add contamination analysis to ceval * fix bugs * add contamination docs * to pass CI check * update --------- Co-authored-by:
zhangyifan1 <zhangyifan1@pjlab.org.cn> Co-authored-by:
Leymore <zfz-960727@163.com>
-
- 23 Nov, 2023 1 commit
-
-
Fengzhe Zhou authored
* refactor default, add circular summarizer * add circular * update impl * update doc * minor update * no more to be added
-
- 22 Nov, 2023 1 commit
-
-
Hubert authored
-
- 21 Nov, 2023 2 commits
-
-
Lyu Han authored
* integrate turbomind python api * update * update user guide * update * fix according to reviewer's comments * fix error * fix linting * update user guide * remove debug log --------- Co-authored-by:Songyang Zhang <tonysy@users.noreply.github.com>
-
Yang Yong authored
* [Feature] Support Lightllm api * formatting & renaming --------- Co-authored-by:Leymore <zfz-960727@163.com>
-
- 16 Nov, 2023 1 commit
-
-
Hubert authored
* [Feat] support pass@ k * [Feat] support pass@k * [Feat] support pass@k * [Feat] support pass@k * [Feat] support pass@k * [Feat] support pass@k docs * update naming --------- Co-authored-by:Leymore <zfz-960727@163.com>
-
- 10 Nov, 2023 1 commit
-
-
Hubert authored
* [Docs] add humanevalx dataset link in config * [Docs] add humanevalx dataset link in config * minor fix
-
- 27 Oct, 2023 1 commit
-
-
Wei Jueqi authored
* rename * add en subdoc * fix name * fix writing * update --------- Co-authored-by:Leymore <zfz-960727@163.com>
-
- 25 Oct, 2023 1 commit
-
-
Hubert authored
-
- 07 Oct, 2023 1 commit
-
-
Lyu Han authored
* support tis * integrate turbomind inference via its RPC API instead of its python API * update guide * update ip address spec * update according to reviewer's comments
-
- 22 Sep, 2023 1 commit
-
-
Yixiao Fang authored
* add multimodal docs * fix lint --------- Co-authored-by:Tong Gao <gaotongxiao@gmail.com>
-
- 18 Sep, 2023 1 commit
-
-
philipwangOvO authored
* [Docs] Readme in longeval * [Docs] Readme in longeval * [Docs] Readme in longeval * [Docs] Readme in longeval * [Docs] Readme in longeval * [Docs] Readme in longeval * [Docs] Readme in longeval
-
- 15 Sep, 2023 1 commit
-
-
Hubert authored
* [Feat] support adv_glue dataset for adversarial robustness * reorg files * minor fix * minor fix * support prompt bench demo * minor fix * minor fix * minor fix * minor fix * minor fix * minor fix * minor fix * minor fix
-
- 07 Sep, 2023 1 commit
-
-
Songyang Zhang authored
-
- 06 Sep, 2023 1 commit
-
-
Hubert authored
* [Docs] update code evaluator docs * minor fix * minor fix
-
- 17 Aug, 2023 1 commit
-
-
Ezra-Yu authored
* add codegeex2 * add humanevalx dataset * add evaluator * update evaluator * update configs * update clean code * update configs * fix lint * remove sleep * fix lint * update docs * fix lint
-
- 10 Aug, 2023 2 commits
-
-
Songyang Zhang authored
* update news * update --------- Co-authored-by:gaotongxiao <gaotongxiao@gmail.com>
-
Songyang Zhang authored
* support turbomind * update doc * Update docs/en/advanced_guides/evaluation_turbomind.md Co-authored-by:
Tong Gao <gaotongxiao@gmail.com> * Update docs/zh_cn/advanced_guides/evaluation_turbomind.md Co-authored-by:
Tong Gao <gaotongxiao@gmail.com> * Update docs/zh_cn/advanced_guides/evaluation_turbomind.md Co-authored-by:
Tong Gao <gaotongxiao@gmail.com> * Update docs/en/advanced_guides/evaluation_turbomind.md Co-authored-by:
Tong Gao <gaotongxiao@gmail.com> * update --------- Co-authored-by:
Tong Gao <gaotongxiao@gmail.com>
-
- 06 Jul, 2023 2 commits
-
-
Hubert authored
* add en docs * update --------- Co-authored-by:gaotongxiao <gaotongxiao@gmail.com>
-
Ezra-Yu authored
* update docs * update docs * update docs
-
- 05 Jul, 2023 1 commit
-
-
mzr1996 authored
-
- 04 Jul, 2023 1 commit
-
-
yingfhu authored
-