Commits · 81098722d22c6c31ecfe82d8aae031c5203a99de · OpenDAS / opencompass

28 Dec, 2023 2 commits

add chinese version of humaneval, mbpp (#743) · 81098722

Connor-Shen authored Dec 28, 2023



* add chinese_version of humaneval,mbpp

* add humaneval&mbpp gen.py

* minor fix

* minor add

---------
Co-authored-by: yingfhu <yingfhu@gmail.com>

81098722

[Fix] SubSizePartition fix (#746) · db919f01

bittersweet1999 authored Dec 28, 2023

* fix subjective_eval

* subject_eval partition situation fixed

* subject_eval partition situation fixed

db919f01

27 Dec, 2023 3 commits

[Feature] Support sanitized MBPP dataset (#745) · 0a525985
Hubert authored Dec 27, 2023

0a525985

[Feature] Add other judgelm prompts for Alignbench (#731) · dfd9ac0f

bittersweet1999 authored Dec 27, 2023

* add judgellm prompts

* add judgelm prompts

* update import info

* fix situation that no abbr in config

* fix situation that no abbr in config

* add summarizer for other judgellm

* change config name

* add maxlen

* add maxlen

* dict assert

* dict assert

* fix strings

* fix strings

dfd9ac0f

Update LightllmApi and Fix mmlu bug (#738) · 54345c56

Yang Yong authored Dec 27, 2023



* Update LightllmApi and Fix mmlu bug

* checkout mmlu_gen_a484b3.py

---------
Co-authored-by: Leymore <zfz-960727@163.com>

54345c56

26 Dec, 2023 1 commit

[Feature] Add InfiniteBench (#739) · 34561ece

philipwangOvO authored Dec 26, 2023



* add InfiniteBench

* add InfiniteBench

---------
Co-authored-by: wangchonghua <wangchonghua@pjlab.org.cn>

34561ece

25 Dec, 2023 2 commits
- [Sync] update configs (#734) · 3a68083e
  Fengzhe Zhou authored Dec 25, 2023
  
  3a68083e
- Update merge script (#733) · ad96f215
  Songyang Zhang authored Dec 25, 2023
  
  ad96f215
23 Dec, 2023 3 commits

add turbomind restful api support (#693) · 336d8d76
AllentDan authored Dec 24, 2023
```
* add turbomind restful api support

* config

* top_p 0.8

* top_k = 1
```
336d8d76
[Fix] Fix subjective alignbench (#730) · e985100c
bittersweet1999 authored Dec 23, 2023

e985100c

[Feature] Add NeedleInAHaystack Test Support (#714) · 0e24f421

Mo Li authored Dec 23, 2023



* Add NeedleInAHaystack Test

* Apply pre-commit formatting

* Update configs/eval_hf_internlm_chat_20b_cdme.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* add needle in haystack test

* update needle in haystack test

---------
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

0e24f421

22 Dec, 2023 1 commit
- [News] add news for T-Eval (#727) · 4a2d1926
  loveSnowBest authored Dec 22, 2023
```
* add news for teval

* update

* update doc for cz&en
```
  4a2d1926
21 Dec, 2023 2 commits

[Feature] Update configs for evaluating chat models like qwen, baichuan,... · e34c5522

RunningLeon authored Dec 21, 2023

[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721)

* add llama2 test

* fix

* test qwen chat-7b

* test w4

* add baichuan2

* update

* update

* update configs and docs

* update

e34c5522

[Feature] Add abbr for judgemodel in subjective evaluation (#724) · fbb912dd
bittersweet1999 authored Dec 21, 2023
```
* add_judgemodel_abbr

* add judgemodel abbr
```
fbb912dd

20 Dec, 2023 2 commits

[Feature] Add ReasonBench(Internal) dataset (#577) · b35d9917

Skyfall-xzz authored Dec 20, 2023

* [Feature] Add reasonbench dataset

* add configs for supporting generative inference & merge datasets in the same category

* modify config filename to prompt version

* fix codes to meet pre-commit requirements

* lint the code to meet pre-commit requirements

* Align Load_data Sourcecode Briefly

* fix bugs

* reduce code redundancy

b35d9917

[Feature] Support the use of humaneval_plus. (#720) · 76a95e9e

Jingming authored Dec 20, 2023



* [Feature] Support the use of humaneval_plus.

* [Feature] Add humaneval_plus_gen.py

* minor check

* [Fix] Fix bug

---------
Co-authored-by: yingfhu <yingfhu@gmail.com>

76a95e9e

19 Dec, 2023 4 commits
- quick fix for maxoutlen (#719) · 47e745d7
  bittersweet1999 authored Dec 20, 2023
  
  47e745d7
- [Docs] Update Docker docs (#718) · fdf18a32
  Hubert authored Dec 19, 2023
```
* [Docs] update docker docs

* [Docs] update docker docs
```
  fdf18a32
- [Feat] Update math/agent (#716) · 5e8b838f
  Hubert authored Dec 19, 2023
```
* minor add

* minor add

* minor fix
```
  5e8b838f
- [Feature] Add JudgeLLMs (#710) · 97c2068b
  bittersweet1999 authored Dec 19, 2023
```
* add judgellms

* add judgellms

* add sub_size_partition

* add docs

* add ref
```
  97c2068b
18 Dec, 2023 1 commit
- [Fix] minor fix openai (#711) · eda72e75
  Hubert authored Dec 18, 2023
  
  eda72e75
15 Dec, 2023 2 commits
- [Doc] Update Doc for Alignbench (#707) · 637628a7
  Songyang Zhang authored Dec 15, 2023
```
* update alignmentbench

* update alignmentbench

* update doc

* update

* update
```
  637628a7
- [Fix] fix a bug on configs/eval_mixtral_8x7b.py (#706) · d7e7a637
  Jingming authored Dec 15, 2023
  
  d7e7a637
14 Dec, 2023 2 commits
- [Fix] remove redundant in gsm8k.py (#700) · db292032
  DseidLi authored Dec 14, 2023
```
Removed redundant code in GSM8KDataset.load method.
```
  db292032
- [Fix] Update alignmentbench (#704) · bfe4aa2a
  Songyang Zhang authored Dec 14, 2023
```
* update alignmentbench

* update alignmentbench

* update alignmentbench
```
  bfe4aa2a
13 Dec, 2023 3 commits

[Feature] Support AlignmentBench infer and judge (#697) · 1fe152b3

bittersweet1999 authored Dec 13, 2023

* alignmentbench infer and judge

* alignmentbench

* alignmentbench done

* alignment all done

* alignment all done

1fe152b3

[Doc] Update contamination docs (#698) · cadab947

Fengzhe Zhou authored Dec 13, 2023



* update contamination docs

* add citation

* Update contamination_eval.md

* Update contamination_eval.md

---------
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

cadab947

[Feat] update python action and slurm (#694) · a94598d9
Hubert authored Dec 13, 2023

a94598d9

12 Dec, 2023 4 commits

[Feature] Add double order of subjective evaluation and removing duplicated... · 61303941

bittersweet1999 authored Dec 12, 2023

[Feature] Add double order of subjective evaluation and removing duplicated response among two models (#692)

* add features

* add doc string

* add doc string

61303941

add rwkv-5-3b model (#666) · 82a533a6

Xiaoyu Zhang authored Dec 12, 2023

* support rwkv5-3b learnboard

* update rwkv-5-3b config

* update config

* refine

* fix bug

* update config

* refine

* reduce batch size

* refine

* reduce batch size to avoid oom in special datasets

* Update huggingface.py

* Update huggingface.py

82a533a6

[Sync] format (#690) · 4780b39e
Hubert authored Dec 12, 2023
```
Co-authored-by: Leymore <zfz-960727@163.com>
```
4780b39e
[Fix] Hotfix for Subjective Evaluation (#686) · 3e771757
bittersweet1999 authored Dec 12, 2023

3e771757

11 Dec, 2023 5 commits

[Feature] Add Subjective Evaluation (#680) · 465308e4

bittersweet1999 authored Dec 11, 2023

* new version of subject

* fixed draw

* fixed draw

* fixed draw

* done

* done

* done

* done

* fixed lint

465308e4

[Fix] fix docstring (#684) · 4f0b373a
Hubert authored Dec 11, 2023

4f0b373a
[Sync] minor test (#683) · e78857ac
Hubert authored Dec 11, 2023

e78857ac

[Feature] enhance the ability of humaneval_postprocess (#676) · dd4318f6

Jingming authored Dec 11, 2023



* [Feature] enhance the ability of humaneval_postprocess

* refactor

* [Feature] Keep the old version of the function and realize the new function in humaneval_postprocess_v2.

* Update opencompass/datasets/humaneval.py

---------
Co-authored-by: Leymore <zfz-960727@163.com>
Co-authored-by: Hubert <42952108+yingfhu@users.noreply.github.com>

dd4318f6

[Feat] support pr merge test ci (#669) · 1029119e

Hubert authored Dec 11, 2023

* [Feat] support ci

* [Feat] support ci

* [Feat] support ci

* [Feat] support ci

* init docs

* init docs

* init docs

1029119e

10 Dec, 2023 2 commits
- [Doc] Update README (#682) · 6a928b99
  Haodong Duan authored Dec 10, 2023
  
  6a928b99
- [Enhancement] Update API Interface and Mixtral (#681) · e25c5f95
  Songyang Zhang authored Dec 10, 2023
```
* [Enhancement] Update API interface

* [Enhancement] Update API interface

* Update mixtral

* Update readme
```
  e25c5f95
09 Dec, 2023 1 commit

[Feature] Add medbench (#678) · 1bf85949

Xiaoming Shi authored Dec 09, 2023



* update medbench

* medbench update

* format medbench

* format

---------
Co-authored-by: 施晓明 <PJLAB\shixiaoming@pjnl104220118l.pjlab.org>
Co-authored-by: Leymore <zfz-960727@163.com>

1bf85949