- 24 Aug, 2023 1 commit
-
-
Yuan Liu authored
* [Feature]: Add Openflamingo MMBench * [Fix]: Fix import error * [Fix]: Revert task config * [Fix]: Fix path bug
-
- 23 Aug, 2023 3 commits
-
-
Yixiao Fang authored
* refactor instructblip * add post processor * add forward * fix lint * update * update
-
liushz authored
* Add ToT method * Update ToT * Update ToT * Update ToT * Update ToT * Update ToT * Update ToT * Update ToT * Update chain_of_thought.md * Update icl_tot_inferencer.py --------- Co-authored-by:liuhongwei <liuhongwei@pjlab.org.cn>
-
Leymore authored
* add llama2 native implements * rename configs/eval_llama_7b.py --------- Co-authored-by:zhoufengzhe <zhoufengzhe@pjlab.org.cn>
-
- 21 Aug, 2023 3 commits
-
-
Yike Yuan authored
* [Feat] Support visualglm inference on MMBench. * [Feat] Support llava inference on MMBench. * [Fix] Fix pre-commit format. * [Fix] Add docstring for llava * [Fix] Fix multi-process inference error of LlaVA and add comments. 1. Set `low_cpu_mem_usage` to False to address device issue. 2. Add docstring and type hints. 3. Rename class and remove registry. * [Fix] Pre-commit fix. * [Fix] add forward entry, add dynamic import to seedbench * [Fix] Fix pre-commit. * [Fix] Fix missing context. * [Fix] Fix docstring.
-
Yike Yuan authored
* [Feat] Support multi-modal evaluation on MME benchmark. * [Fix] Remove debug code. * [Fix] Remove redundant codes and add type hints. * [Fix] Rename in config. * [Fix] Rebase main. * [Fix] Fix isort and yapf conflict.
-
philipwangOvO authored
Co-authored-by:wangchonghua <wangchonghua@pjlab.org.cn>
-
- 17 Aug, 2023 5 commits
-
-
Yuan Liu authored
-
Yuan Liu authored
-
Yixiao Fang authored
* support seedbench * update docstrings * update * update * update * update according to review * rebase * fix lint * update
-
Yuan Liu authored
* [Feature]: Add flickr30k * [Feature]: Add GQA * [Feature]: Add OCR VQA * [Feature]: Add OK VQA * [Feature]: Add text vqa * [Feature]: Add other vqa
-
Ezra-Yu authored
* add codegeex2 * add humanevalx dataset * add evaluator * update evaluator * update configs * update clean code * update configs * fix lint * remove sleep * fix lint * update docs * fix lint
-
- 16 Aug, 2023 2 commits
-
-
Hubert authored
* [Feat] support adv_glue dataset for adversarial robustness * reorg files * minor fix * minor fix
-
Yuan Liu authored
* [Feature]: Refactor class name * [Feature]: Add minigpt-4 coco caption * [Feature]: Update minigpt-4 coco caption * [Feature]: Add MiniGPT-4 ScienceQA * [Feature]: Add minigpt-4 vqav2 * [Feature]: Add VSR * [Feature]: Revert task to previous version
-
- 11 Aug, 2023 5 commits
-
-
Hubert authored
* [Fix] fix bug for postprocessor * minor fix
-
Tong Gao authored
Co-authored-by:kennymckormick <dhd@pku.edu.cn>
-
Hubert authored
* [Feat] update postprocessor to get first option * minor fix * minor fix
-
Leymore authored
* add llama-oriented dataset configs * update * revert cvalues & update llama_example
-
Hubert authored
* [Feat] add safety to collections * minor fix
-
- 10 Aug, 2023 5 commits
-
-
Tong Gao authored
[Enhancement] Add humaneval postprocessor for GPT models & eval config for GPT4, enhance the original humaneval postprocessor (#129) * [Enhancement] Enhance humaneval postprocessor * add human-eval testcase * update * update --------- Co-authored-by:Leymore <zfz-960727@163.com>
-
Songyang Zhang authored
* support turbomind * update doc * Update docs/en/advanced_guides/evaluation_turbomind.md Co-authored-by:
Tong Gao <gaotongxiao@gmail.com> * Update docs/zh_cn/advanced_guides/evaluation_turbomind.md Co-authored-by:
Tong Gao <gaotongxiao@gmail.com> * Update docs/zh_cn/advanced_guides/evaluation_turbomind.md Co-authored-by:
Tong Gao <gaotongxiao@gmail.com> * Update docs/en/advanced_guides/evaluation_turbomind.md Co-authored-by:
Tong Gao <gaotongxiao@gmail.com> * update --------- Co-authored-by:
Tong Gao <gaotongxiao@gmail.com>
-
Leymore authored
* add Xiezhi SQuAD2.0 ANLI; update WSC * update * update * update doc string
-
Yuan Liu authored
* [Feature]: Refactor input and output * [Feature]: Update tasks
-
Leymore authored
* update agieval data * rename variables
-
- 08 Aug, 2023 2 commits
-
-
Zaida Zhou authored
* calulate max_out_len without hard code * set default value * update configs * Update configs/eval_gpt3.5.py Co-authored-by:
Tong Gao <gaotongxiao@gmail.com> --------- Co-authored-by:
Tong Gao <gaotongxiao@gmail.com>
-
Yuan Liu authored
-
- 03 Aug, 2023 1 commit
-
-
Yuan Liu authored
* [Feature]: Add minigpt-4 * [Feature]: Add mm local runner * [Feature]: Add instructblip * [Feature]: Delete redundant file * [Feature]: Delete redundant file * [Feature]: Add README to InstructBLIP * [Feature]: Update MiniGPT-4 * [Fix]: Fix lint * [Feature]add omnibenchmark readme (#49) * add omnibenchmark readme * fix * Update OmniMMBench.md * Update OmniMMBench.md * Update OmniMMBench.md * [Fix]: Refine name (#54) * [Feature]: Unify out and err * [Fix]: Fix lint * [Feature]: Rename to mmbench and change weight path * [Feature]: Delete Omni in instructblip * [Feature]: Check the avaliablity of lavis * [Fix]: Fix lint * [Feature]: Refactor MM * [Refactor]: Refactor path * [Feature]: Delete redundant files * [Refactor]: Delete redundant files --------- Co-authored-by:Wangbo Zhao(黑色枷锁) <56866854+wangbo-zhao@users.noreply.github.com>
-
- 01 Aug, 2023 1 commit
-
-
Tong Gao authored
* [Feature] Support evaluating acc based on minimum edit distance, update SIQA * update
-
- 28 Jul, 2023 1 commit
-
-
Leymore authored
* add self-consistency * add CoT method Self-Consistency * fix typo error and update openicl_eval * add tydiQA-GoldP task * fix sc * rename gsm8k_sc * fix sc * add self-consistency doc * refine sc --------- Authored-by:liushz <qq1791167085@163.com>
-
- 27 Jul, 2023 1 commit
-
-
gowithme authored
* support internLM * support internLM * simplify intern model files * update storage_manager * support internLM * Modify the file organization structure * support internLM * support internLM * support internLM * support internLM * change some details
-
- 26 Jul, 2023 1 commit
-
-
Hubert authored
* [Refactor] Update crows-pairs evaluation * [Refactor] Update crows-pairs evaluation * minor
-
- 25 Jul, 2023 2 commits
-
-
Tong Gao authored
Co-authored-by:Leymore <zfz-960727@163.com>
-
Haonan Li authored
* add CMMLU * debug cmmlu * add slurm args `qos` * fix format: space before comment * remove unused variable * change the location of `answer is` --------- Co-authored-by:
李浩楠 <lihaonan@lihaonandeMacBook-Air.local> Co-authored-by: 李浩楠 <haonan.li> Co-authored-by:
Leymore <zfz-960727@163.com>
-
- 19 Jul, 2023 1 commit
-
-
Leymore authored
* add llama-2 models * update docs --------- Co-authored-by:gaotongxiao <gaotongxiao@gmail.com>
-
- 18 Jul, 2023 3 commits
-
-
Hubert authored
* [Feat] support CValues * minor fix
-
liushz authored
Co-authored-by:liuhongwei <liuhongwei@pjlab.org.cn>
-
Hubert authored
* [Feat] add falcon-40b * minor fix
-
- 17 Jul, 2023 3 commits