- 01 Nov, 2024 1 commit
-
-
Sypherd authored
-
- 31 Oct, 2024 1 commit
-
-
Qubitium-ModelCloud authored
* support gptqmodel * code opt * add gptqmodel option * Update huggingface.py * Update pyproject.toml * gptqmodel version upgraded to 1.0.6 * GPTQModel version upgraded to 1.0.8 * Update pyproject.toml * fix ruff-format error * add gptqmodel test * Update gptqmodel test model * skip cuda * python3.8 compatible * Update README.md * Update README.md --------- Co-authored-by:CL-ModelCloud <cl@modelcloud.ai>
-
- 30 Oct, 2024 3 commits
-
-
Samuel Monson authored
-
zxcvuser authored
* Add xquad task * Update general README * Run pre-commit
-
Chris Kerwell Gresla authored
* fix: use lora_request for data parallel vllm evals * fix(docs): include type hint * chore: lint, et pre-commit al --------- Co-authored-by:Chris Kerwell Gresla <chris@wafer.systems>
-
- 25 Oct, 2024 1 commit
-
-
Kiersten Stokes authored
* Update pyproject.toml with watsonx package extra Signed-off-by:
kiersten-stokes <kierstenstokes@gmail.com> * Remove unused function Signed-off-by:
kiersten-stokes <kierstenstokes@gmail.com> --------- Signed-off-by:
kiersten-stokes <kierstenstokes@gmail.com>
-
- 23 Oct, 2024 1 commit
-
-
Nikodem Szwast authored
* add support for IBM watsonx_llm * add ibm_watsonx_ai package to optional-dependencies * move global scope imports to inner scope * change cache to lru_cache * fix circular import * use 3.8 typing * use 3.8 typing --------- Co-authored-by:Baber <baber@hey.com>
-
- 22 Oct, 2024 2 commits
-
-
Leonid Sinev authored
* Replace generic exception classes with a more specific ones * rerun pre-commit to pass linter tests * Revert "rerun pre-commit to pass linter tests" This reverts commit 67f88ccf144469853217704520e613196042d859. * reduce repetitions in errors or so * Replace generic exception class with a more specific one
-
Iker García-Ferrero authored
Update prompt according to: https://github.com/ikergarcia1996/NoticIA/blob/main/prompts.py
-
- 20 Oct, 2024 1 commit
-
-
Yuxian Gu authored
-
- 17 Oct, 2024 2 commits
- 16 Oct, 2024 1 commit
-
-
zxcvuser authored
* added tasks to spanish_bench * fixed capitalization in escola and run pre-commit * Update _flores_common_yaml * Update _flores_common_yaml * Update direct_yaml * Update cot_yaml * Update cot_yaml * Update _flores_common_yaml --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 14 Oct, 2024 1 commit
-
-
Elron Bandel authored
* Add Unitxt Multimodality Support Signed-off-by:
elronbandel <elronbandel@gmail.com> * Update Signed-off-by:
elronbandel <elronbandel@gmail.com> * Fix formatting Signed-off-by:
elronbandel <elronbandel@gmail.com> --------- Signed-off-by:
elronbandel <elronbandel@gmail.com>
-
- 08 Oct, 2024 4 commits
-
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-
Baber Abbasi authored
* max_images are passed on to vllms `limit_mm_per_prompt` * replace max image placeholders in string * handle chat_template error * move `fewshot_random_seed` to global
-
Baber Abbasi authored
* switch conditional checks to `self.backend` * nit * nit * commit feedback * fix test; update precommit hooks * add escape hatch for custom self.AUTO_MODEL_CLASS * add escape hatch for custom self.AUTO_MODEL_CLASS * fix * move assertion * add logging messages * update AUTO_MODEL_CLASS behavior in _get_backend --------- Co-authored-by:haileyschoelkopf <hailey@eleuther.ai>
-
- 07 Oct, 2024 5 commits
-
-
Baber Abbasi authored
* tokenizer: trust-remote-code * pre-commit --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
Chenjie Luo authored
* Fix float limit override See: https://github.com/EleutherAI/lm-evaluation-harness/issues/2324 The float limit will be override with the previous int limit of multiple tasks are triggered together. This PR fix this issue * Update evaluator.py * Update evaluator.py
-
am-bean authored
* Fixing scoring bugs for smaller models * Catching another error type in parsing
-
kyujinHan authored
-
Baber Abbasi authored
* bugfix * pre-commit
-
- 04 Oct, 2024 3 commits
-
-
Baber Abbasi authored
-
zxcvuser authored
* Add catalan_bench * added flores_ca.yaml * Updated some task groupings and readme * Fix create_yamls_flores_ca.py --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
zxcvuser authored
* Add basque_bench * Add flores_eu group * Update _flores_common_yaml * Run linters, updated flores, mgsm, copa, and readme * Apply suggestions from code review Co-authored-by:
Baber Abbasi <92168766+baberabb@users.noreply.github.com> --------- Co-authored-by:
Baber Abbasi <92168766+baberabb@users.noreply.github.com> Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 03 Oct, 2024 2 commits
-
-
zxcvuser authored
* Add galician_bench * Update xnli_gl path * Add flores_gl group * Update _flores_common_yaml * Updated some task groupings and readme ---------
-
zxcvuser authored
* Add spanish_bench * Add flores_es group * Update _flores_common_yaml * Delete lm_eval/tasks/spanish_bench/escola.yaml * Delete escola from spanish_bench.yaml * Delete escola from README.md * pre-commit run --all-files * Updated some task groupings and readme ---------
-
- 30 Sep, 2024 2 commits
-
-
Giulio Lovisotto authored
-
zxcvuser authored
* Add portuguese_bench * Add flores_pt group * Update _flores_common_yaml * Run linters and update flores and readme
-
- 28 Sep, 2024 1 commit
-
-
eyuansu62 authored
* fix some bugs of mmlu * Fix end of file newline issue --------- Co-authored-by:eyuansu62 <772468951@qq.com>
-
- 26 Sep, 2024 9 commits
-
-
Baber Abbasi authored
* better error message; fix greedy matching * Update lm_eval/models/openai_completions.py Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> * Update lm_eval/models/openai_completions.py Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> * pre-commit --------- Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
Baber Abbasi authored
-
Arda authored
* Added TurkishMMLU to LM Evaluation Harness * Fixed COT name * Fixed COT name * Updated Readme * Fixed Test issues * Completed Scan for changed tasks * Updated Readme * Update README.md * fixup task naming casing + ensure yaml template stubs aren't registered --------- Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> Co-authored-by:
haileyschoelkopf <hailey@eleuther.ai>
-
Baber Abbasi authored
* add newlines to task descriptions; increment versions * fix task tests (with groups) * Apply suggestions from code review --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
Baber Abbasi authored
* change glianorex to test set * nit * fix test; doc_to_target can be str for multiple_choice * nit
-
Baber Abbasi authored
-
Giulio Lovisotto authored
* Treat python tasks same as yaml tasks. * Add tests. * Re-add fixture decorators. * Fix typing specification error for Python 3.9.
-
Baber Abbasi authored
-
Baber Abbasi authored
-