Commits · 343f785b07a90c4ef4df5621544b210641f56f81 · OpenDAS / opencompass

24 Aug, 2023 1 commit

[Feature]: Add Flamingo (#258) · 343f785b

Yuan Liu authored Aug 24, 2023

* [Feature]: Add Openflamingo MMBench

* [Fix]: Fix import error

* [Fix]: Revert task config

* [Fix]: Fix path bug

343f785b

23 Aug, 2023 3 commits

[Refactor] Refactor instructblip (#227) · 1034c487

Yixiao Fang authored Aug 23, 2023

* refactor instructblip

* add post processor

* add forward

* fix lint

* update

* update

1034c487

[Feature] Add Tree-of-Thought method (#173) · 02ce139b

liushz authored Aug 23, 2023



* Add ToT method

* Update ToT

* Update ToT

* Update ToT

* Update ToT

* Update ToT

* Update ToT

* Update ToT

* Update chain_of_thought.md

* Update icl_tot_inferencer.py

---------
Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>

02ce139b

[Feature] Add llama2 native implements (#235) · ff5ab923

Leymore authored Aug 23, 2023



* add llama2 native implements

* rename configs/eval_llama_7b.py

---------
Co-authored-by: zhoufengzhe <zhoufengzhe@pjlab.org.cn>

ff5ab923

21 Aug, 2023 3 commits

[Feat] Support visualglm and llava for MMBench evaluation. (#211) · 8d368d1c

Yike Yuan authored Aug 21, 2023

* [Feat] Support visualglm inference on MMBench.

* [Feat] Support llava inference on MMBench.

* [Fix] Fix pre-commit format.

* [Fix] Add docstring for llava

* [Fix] Fix multi-process inference error of LlaVA and add comments.
1. Set `low_cpu_mem_usage` to False to address device issue.
2. Add docstring and type hints.
3. Rename class and remove registry.

* [Fix] Pre-commit fix.

* [Fix] add forward entry, add dynamic import to seedbench

* [Fix] Fix pre-commit.

* [Fix] Fix missing context.

* [Fix] Fix docstring.

8d368d1c

[Feat] Support multi-modal evaluation on MME benchmark. (#197) · a6552224

Yike Yuan authored Aug 21, 2023

* [Feat] Support multi-modal evaluation on MME benchmark.

* [Fix] Remove debug code.

* [Fix] Remove redundant codes and add type hints.

* [Fix] Rename in config.

* [Fix] Rebase main.

* [Fix] Fix isort and yapf conflict.

a6552224

[Dataset] LongBench (#236) · 655a807f
philipwangOvO authored Aug 21, 2023
```
Co-authored-by: wangchonghua <wangchonghua@pjlab.org.cn>
```
655a807f

17 Aug, 2023 5 commits

[Fix]: Fix name (#223) · 90c07a3d
Yuan Liu authored Aug 17, 2023

90c07a3d
[Feature]: Add launch script (#222) · 3d49a20b
Yuan Liu authored Aug 17, 2023

3d49a20b

[Feature] Support SEED-Bench (#203) · 0fa24826

Yixiao Fang authored Aug 17, 2023

* support seedbench

* update docstrings

* update

* update

* update

* update according to review

* rebase

* fix lint

* update

0fa24826

[Feature]: Add other public datasets config (#214) · ae3c1869

Yuan Liu authored Aug 17, 2023

* [Feature]: Add flickr30k

* [Feature]: Add GQA

* [Feature]: Add OCR VQA

* [Feature]: Add OK VQA

* [Feature]: Add text vqa

* [Feature]: Add other vqa

ae3c1869

[Feat] Add codegeex2 and Humanevalx (#210) · 17ccaa59

Ezra-Yu authored Aug 17, 2023

* add codegeex2

* add humanevalx dataset

* add evaluator

* update evaluator

* update configs

* update clean code

* update configs

* fix lint

* remove sleep

* fix lint

* update docs

* fix lint

17ccaa59

16 Aug, 2023 2 commits

[Feat] support adv_glue dataset for adversarial robustness (#205) · 0fe2366a
Hubert authored Aug 16, 2023
```
* [Feat] support adv_glue dataset for adversarial robustness

* reorg files

* minor fix

* minor fix
```
0fe2366a

[Feature]: Add other public datasets (#206) · 78df9bd0

Yuan Liu authored Aug 16, 2023

* [Feature]: Refactor class name

* [Feature]: Add minigpt-4 coco caption

* [Feature]: Update minigpt-4 coco caption

* [Feature]: Add MiniGPT-4 ScienceQA

* [Feature]: Add minigpt-4 vqav2

* [Feature]: Add VSR

* [Feature]: Revert task to previous version

78df9bd0

11 Aug, 2023 5 commits
- [Fix] fix bug for postprocessor (#195) · 7c393192
  Hubert authored Aug 11, 2023
```
* [Fix] fix bug for postprocessor

* minor fix
```
  7c393192
- [Feature] Add LEval datasets · bf79ff1c
  Tong Gao authored Aug 11, 2023
```
Co-authored-by: kennymckormick <dhd@pku.edu.cn>
```
  bf79ff1c
- [Feat] update postprocessor to get first option more accurately (#193) · 8d9cee06
  Hubert authored Aug 11, 2023
```
* [Feat] update postprocessor to get first option

* minor fix

* minor fix
```
  8d9cee06
- [Feature] add llama-oriented dataset configs (#82) · 14332e08
  Leymore authored Aug 11, 2023
```
* add llama-oriented dataset configs

* update

* revert cvalues & update llama_example
```
  14332e08
- [Feat] add safety to collections (#185) · 5a9539f3
  Hubert authored Aug 11, 2023
```
* [Feat] add safety to collections

* minor fix
```
  5a9539f3
10 Aug, 2023 5 commits

[Enhancement] Add humaneval postprocessor for GPT models & eval config for... · 2931f3dc

Tong Gao authored Aug 10, 2023


[Enhancement] Add humaneval postprocessor for GPT models & eval config for GPT4, enhance the original humaneval postprocessor (#129)

* [Enhancement] Enhance humaneval postprocessor

* add human-eval testcase

* update

* update

---------
Co-authored-by: Leymore <zfz-960727@163.com>

2931f3dc

[Feature] Support turbomind (#166) · 3f36db3b

Songyang Zhang authored Aug 10, 2023



* support turbomind

* update doc

* Update docs/en/advanced_guides/evaluation_turbomind.md
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update docs/zh_cn/advanced_guides/evaluation_turbomind.md
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update docs/zh_cn/advanced_guides/evaluation_turbomind.md
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update docs/en/advanced_guides/evaluation_turbomind.md
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* update

---------
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

3f36db3b

[Feature] Add Xiezhi SQuAD2.0 ANLI (#101) · e7fc54ba
Leymore authored Aug 10, 2023
```
* add Xiezhi SQuAD2.0 ANLI; update WSC

* update

* update

* update doc string
```
e7fc54ba
[Feature]: Refactor input and output (#176) · a205629f
Yuan Liu authored Aug 10, 2023
```
* [Feature]: Refactor input and output

* [Feature]: Update tasks
```
a205629f
[Fix] Fix AGIEval multiple choice (#137) · 876ade71
Leymore authored Aug 10, 2023
```
* update agieval data

* rename variables
```
876ade71

08 Aug, 2023 2 commits

[Feature] Calculate max_out_len without hard code for OpenAI model (#158) · af436f59

Zaida Zhou authored Aug 08, 2023



* calulate max_out_len without hard code

* set default value

* update configs

* Update configs/eval_gpt3.5.py
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

---------
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

af436f59

[Feature]: Add mm suport for local (#169) · 2f1949e7
Yuan Liu authored Aug 08, 2023

2f1949e7

03 Aug, 2023 1 commit

[Feature]: Use multimodal (#73) · 191a3f6f

Yuan Liu authored Aug 03, 2023



* [Feature]: Add minigpt-4

* [Feature]: Add mm local runner

* [Feature]: Add instructblip

* [Feature]: Delete redundant file

* [Feature]: Delete redundant file

* [Feature]: Add README to InstructBLIP

* [Feature]: Update MiniGPT-4

* [Fix]: Fix lint

* [Feature]add omnibenchmark readme (#49)

* add omnibenchmark readme

* fix

* Update OmniMMBench.md

* Update OmniMMBench.md

* Update OmniMMBench.md

* [Fix]: Refine name (#54)

* [Feature]: Unify out and err

* [Fix]: Fix lint

* [Feature]: Rename to mmbench and change weight path

* [Feature]: Delete Omni in instructblip

* [Feature]: Check the avaliablity of lavis

* [Fix]: Fix lint

* [Feature]: Refactor MM

* [Refactor]: Refactor path

* [Feature]: Delete redundant files

* [Refactor]: Delete redundant files

---------
Co-authored-by: Wangbo Zhao(黑色枷锁) <56866854+wangbo-zhao@users.noreply.github.com>

191a3f6f

01 Aug, 2023 1 commit
- [Feature] Evaluating acc based on minimum edit distance, update SIQA (#130) · c00179d4
  Tong Gao authored Aug 01, 2023
```
* [Feature] Support evaluating acc based on minimum edit distance, update SIQA

* update
```
  c00179d4
28 Jul, 2023 1 commit

[Feature] Add SC (#126) · d862f570

Leymore authored Jul 28, 2023



* add self-consistency

* add CoT method Self-Consistency

* fix typo error and update openicl_eval

* add tydiQA-GoldP task

* fix sc

* rename gsm8k_sc

* fix sc

* add self-consistency doc

* refine sc

---------
Authored-by: liushz <qq1791167085@163.com>

d862f570

27 Jul, 2023 1 commit

[Feature] Support intern lanuage model (#51) · 57fcfc97

gowithme authored Jul 27, 2023

* support internLM

* support internLM

* simplify intern model files

* update storage_manager

* support internLM

* Modify the file organization structure

* support internLM

* support internLM

* support internLM

* support internLM

* change some details

57fcfc97

26 Jul, 2023 1 commit

[Refactor] Update crows-pairs evaluation (#98) · b7184e9d

Hubert authored Jul 26, 2023

* [Refactor] Update crows-pairs evaluation

* [Refactor] Update crows-pairs evaluation

* minor

b7184e9d

25 Jul, 2023 2 commits

[Fix] Fix llama configs (#72) · 3715be65
Tong Gao authored Jul 25, 2023
```
Co-authored-by: Leymore <zfz-960727@163.com>
```
3715be65

[Feature] Add CMMLU dataset (#91) · e9cdb24d

Haonan Li authored Jul 25, 2023



* add CMMLU

* debug cmmlu

* add slurm args `qos`

* fix format: space before comment

* remove unused variable

* change the location of `answer is`

---------
Co-authored-by: 李浩楠 <lihaonan@lihaonandeMacBook-Air.local>
Co-authored-by: 李浩楠 <haonan.li>
Co-authored-by: Leymore <zfz-960727@163.com>

e9cdb24d

19 Jul, 2023 1 commit

[Feature] Add llama-2 models (#81) · eea8b044

Leymore authored Jul 19, 2023



* add llama-2 models

* update docs

---------
Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>

eea8b044

18 Jul, 2023 3 commits
- [Feat] Support CValues Responsibility dataset (#78) · f83e125e
  Hubert authored Jul 18, 2023
```
* [Feat] support CValues

* minor fix
```
  f83e125e
- [Feature] Add tydiqa-goldp (#75) · f36c0496
  liushz authored Jul 18, 2023
```
Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>
```
  f36c0496
- [Feat] add falcon-40b (#76) · 29598e36
  Hubert authored Jul 18, 2023
```
* [Feat] add falcon-40b

* minor fix
```
  29598e36
17 Jul, 2023 3 commits
- [Fix] eval_llama_7b (#68) · 9a164489
  Leymore authored Jul 17, 2023
  
  9a164489
- [Feature] Add baichuan13b model configs (#60) · edb23d15
  Leymore authored Jul 17, 2023
```
* [Feature] Add baichuan13b

* update num_gpus
```
  edb23d15
- [Feature] Add logger info and remove dataset bugs (#61) · 1326aff7
  Leymore authored Jul 17, 2023
```
* Add logger info and remove dataset bugs

* fix typo
```
  1326aff7