- 17 Jan, 2025 7 commits
- 15 Jan, 2025 4 commits
-
-
Baber Abbasi authored
* add assistant prefix * add arc_challenge from llama * nit * nit * nit * add assistant prefix * add mmlu_llama * nit * nit * Revert "nit" This reverts commit 6a97f8356237305e375212b966b30e8de59dd4bc. * fix regex bug * add assistant_prefix to vllm * add `Question:` * add mmlu_pro * add fewshot assistant_prefix * use `assistant_prefill` * typehints * nits * nits * add to docs * add readme
-
Shivansh Pachnanda authored
* Add MLQA * add mlqa_common_yaml * add 49 tests of mlqa family * update tasks/README.md --------- * fix: mlqa ast error * nit: removed .yaml ext from template_yaml * nit changes: minor modifications generate_tasks.py * deleted lm_eval/tasks/mlqa/mlqa_common_yaml.yaml * tests updated * nit
-
Hojin Lee authored
* add mbpp * fix some bugs * add README for mbpp * update README * nits --------- Co-authored-by:
Hojin Lee <19949034+hjlee1371@users.noreply.github.com> Co-authored-by:
Baber <baber@hey.com>
-
Hojin Lee authored
* add custom filter * fix type casting of references * add humaneval * fix a bug in humaneval * add greedy version of humaneval * update tasks README * test humaneval * return multiple metrics * nit * add confirmation to run code tasks * nit * nit --------- Co-authored-by:
Hojin Lee <19949034+hjlee1371@users.noreply.github.com> Co-authored-by:
Baber <baber@hey.com>
-
- 07 Jan, 2025 3 commits
-
-
Wenyang LUO authored
* Fix the format of mgsm zh and ja. * Add change log to mgsm. * Add newline after changelog.
-
Petr Baudis authored
* fix(zeno): Generate unique ids in case of multiple filters * fix(zeno): Report even non-aggregable metrics, just not as metrics * pre-commit --------- Co-authored-by:Baber <baber@hey.com>
-
CL-ModelCloud authored
* hf support load gguf file * code review * code review * code clean up * note about use_fast compat with gguf --------- Co-authored-by:Qubitium-ModelCloud <qubitium@modelcloud.ai>
-
- 04 Jan, 2025 1 commit
-
-
Baber Abbasi authored
* remove yaml extension from phraes_va_common * remove yaml extension from winogenerated * remove yaml extension from phrases_es * no cache debug logging when not used
-
- 02 Jan, 2025 1 commit
-
-
Baber Abbasi authored
* update evaluate; update construct requests * update construct requests to handle `apply_chat_template` kwarg
-
- 30 Dec, 2024 1 commit
-
-
Baber Abbasi authored
upgrade transformers and peft in CI
-
- 25 Dec, 2024 1 commit
-
-
Wang, Yi authored
* fix extra_match low if batch_size > 1 Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> * add sorting to logprobs * nit --------- Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> Co-authored-by:
Baber <baber@hey.com>
-
- 24 Dec, 2024 1 commit
-
-
Firoj Alam, Scientist, QCRI authored
* added aradice * Added ArabicMMLU Lev Configs * added ArabicMMLU egy configs * Added boolq configs * Added cultural bench configs * added openbookqa configs * Added PiQA configs * added winogrande configs * Added truthfulQA configs * Added aradice group config * Remove deleted files from repository * modified arabimmlu configs * modified metadata versions * fixed formatting using ruff * added aradice tasks information * pre-commit * Uptaded openbookqa utils * fixed formatting on obqa --------- Co-authored-by:
Basel Mousi <bmousi@hbku.edu.qa> Co-authored-by:
Baber <baber@hey.com>
-
- 20 Dec, 2024 7 commits
- 19 Dec, 2024 14 commits
-
-
Baber authored
-
Baber Abbasi authored
* add warning for truncation
-
Baber authored
-
Baber authored
-
Baber authored
-
Baber authored
-
Baber authored
-
Baber authored
-
Baber authored
-
Baber authored
-
shivalika-singh authored
* add global mmlu lite * add global mmlu lite * fix bugs * add task README.md * Update README.md * Update tasks README.md * Update README.md * update readme --------- Co-authored-by:shivi <shivalikasingh95@gmail.com>
-
Baber authored
-
Baber authored
-
Baber authored
-