- 21 Jan, 2025 1 commit
-
-
Baber authored
-
- 02 Jan, 2025 10 commits
- 10 Dec, 2024 5 commits
- 09 Dec, 2024 2 commits
-
-
Maanu Grover authored
* update import Signed-off-by:
Maanu Grover <maanug@nvidia.com> * run formatting --------- Signed-off-by:
Maanu Grover <maanug@nvidia.com>
-
Baber Abbasi authored
* left truncate for generate_until * pre-commit
-
- 05 Dec, 2024 1 commit
-
-
fzyzcjy authored
-
- 04 Dec, 2024 3 commits
-
-
Slawomir Strehlke authored
* Handle pipeline_parallel parameter * Add description of pipeline parallelism with OV models
-
Baber Abbasi authored
-
Baber Abbasi authored
* Update README.md add caching tip to readme * Update README.md add api link
-
- 03 Dec, 2024 2 commits
-
-
Trawinski, Dariusz authored
* avoid timeout errors with high concurrency in api_model * style * add timeout * add docs --------- Co-authored-by:Baber <baber@hey.com>
-
Naiara Perez authored
-
- 01 Dec, 2024 1 commit
-
-
Yoav Katz authored
Update Unitxt task to use locally installed unitxt and not download Unitxt code from Huggingface (#2514) * Moved to require unitxt installation and not download unitxt from HF hub. This has performance benefits and simplifies the code. Signed-off-by:
Yoav Katz <katz@il.ibm.com> * Updated watsonx documentation * Updated installation instructions * Removed redundant comman * Allowed unitxt tasks to generate chat APIs Modified WatsonXI model to support chat apis * Removed print * Run precommit formatting --------- Signed-off-by:
Yoav Katz <katz@il.ibm.com>
-
- 30 Nov, 2024 1 commit
-
-
Baber Abbasi authored
* make utility function to handle `until` * fix text
-
- 29 Nov, 2024 1 commit
-
-
Baber Abbasi authored
-
- 28 Nov, 2024 1 commit
-
-
Baber Abbasi authored
* allow !function filters * bugfix * nit * add `filter` to logged samples * add `filter` and `metric` to logged samples to identification * convert `metric` to `metrics`: list
-
- 26 Nov, 2024 1 commit
-
-
Rima Shahbazyan authored
* score readme added * generate until task's "until" parameter's default value fixed. * score mmlu-pro and agieval added * changed macro accuracy to micro for agieval * Always E removed from agi eval * redundancies removed * MATH added * minor cosmetic changes for math * Licenses added Readme updated * changes for flake8 + license header on math * Score added to readme and precommit was run. * Score added to readme and precommit was run. * Import error fixed * math task bugfix postprocess minor fix * CR for math added * math CR * math task bugfix postprocess minor fix CR for math added * Math cr fixed * reverting the default "until" parameter change and adjusting score task configs
-
- 22 Nov, 2024 1 commit
-
-
Baber Abbasi authored
-
- 20 Nov, 2024 1 commit
-
-
Baber Abbasi authored
* fix test task * dont call lm.chat_template each time
-
- 18 Nov, 2024 3 commits
-
-
Kozzy Voudouris authored
* Add metabench (Kipnis et al. 2024) * Update metabench tasks for full replication of original benchmarks, using publicly available datasets * Remove unnecessary import * Add permute versions of each task, where the answer orders are randomly shuffled. * Add metabench group for easier evaluations * Fix mmlu counts after removing duplicate * Add secondary datasets * Fix f-string error * Fix f-string error for permute processing * Add original hash to outputs for easy matching to original results * Add line break at end of utils files * Remove extra line from winogrande * Reformat for linters * fix multiple input test * appease pre-commit * Add metabench to tasks README * fix multiple input `test_doc_to_text` --------- Co-authored-by:Baber <baber@hey.com>
-
Baber Abbasi authored
-
Baber Abbasi authored
* add hf mamba to mamba_lm * fix _model_generate for hf
-
- 16 Nov, 2024 2 commits
-
-
Wonseok Hwang authored
* release kbl-v0.1 * fix linting * remove rag tasks as doc_to_text functions cause trouble * remove remaining rag tasks * remove unnecessary repeat in yaml files and rag dataset in hf-hub * remove unncessary newline; introduce cfg files in lbox/kbl in hf * Make task yaml files consistent to hf-datasets-config * Make task yaml files consistent to hf-datasets-config * Remove trailing empty space in doc-to-text * Remove unncessary yaml file * Fix task nameing error * trailing space removed
-
Baber Abbasi authored
* pre-commit update * update github actions * make logging less verbose * fix artifacts
-
- 15 Nov, 2024 2 commits
-
-
Oyvind Tafjord authored
-
Nikodem Szwast authored
* refactor code, fix config path bug * update types to be from typing lib * add pre-commit formatting * specify version of ibm_watsonx_ai package * adjust get_watsonx_credentials() function, add minor refactor to adress PR review comments * change missing installation hint from ibm_watsonx_ai to lm_eval[ibm_watsonx_ai]
-
- 12 Nov, 2024 1 commit
-
-
Alex Titterton authored
-
- 11 Nov, 2024 1 commit
-
-
Baber Abbasi authored
-