- 28 Jun, 2024 3 commits
-
-
Nathan Habib authored
-
Nathan Habib authored
-
Nathan Habib authored
-
- 27 Jun, 2024 10 commits
-
-
Nathan Habib authored
-
Nathan Habib authored
-
Nathan Habib authored
-
Nathan Habib authored
-
Nathan Habib authored
-
Nathan Habib authored
-
Nathan Habib authored
-
Nathan Habib authored
-
Nathan Habib authored
-
Nathan Habib authored
-
- 26 Jun, 2024 4 commits
-
-
Hailey Schoelkopf authored
* make MMLU trust remote code to fix tests * remove trust remote code
-
Nathan Habib authored
-
Nathan Habib authored
This reverts commit d859d1ca.
-
Nathan Habib authored
-
- 25 Jun, 2024 5 commits
-
-
johnwee1 authored
* Update interface.md update interface to remove link to really outdated commit of evaluator.py * switch to relative referencing? * Update interface.md --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
Hailey Schoelkopf authored
* separate out optimum/neuralmagic tests to separate job * fix vllm tests * fix bug in --trust_remote_code * use datasets.config instead intentionally * fix remote code issue?
-
Brendan Murphy authored
* Initial configuration * Using the validation set for the test set, because the test set on HF doesn't have labels * Probably just makes more sense to have validation be validation * fix format ; add docs to tasks/README.md * fix format --------- Co-authored-by:haileyschoelkopf <hailey@eleuther.ai>
-
Baber Abbasi authored
* refactored `lm.apply_chat_template` * nit * fix weird type error * fixed! * skip failing test * pre-commit run all * add type hints * nit * nit * fixup
-
jonabur authored
* add arc_challenge_mt * add README * add icelandic
-
- 24 Jun, 2024 2 commits
-
-
Stella Biderman authored
-
achervyakov authored
* add tokenizer logs info * add no tokenizer case * Update lm_eval/logging_utils.py Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> * Update lm_eval/logging_utils.py Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> * add updates * fix conflict --------- Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 20 Jun, 2024 1 commit
-
-
Julen Etxaniz authored
* add bertaqa tasks * rename basquetrivia-->bertaqa ; make template stub not .yaml * add bertaqa entry to lm_eval/tasks/README.md --------- Co-authored-by:haileyschoelkopf <hailey@eleuther.ai>
-
- 19 Jun, 2024 5 commits
-
-
Hailey Schoelkopf authored
-
Yazeed Alnumay authored
* Added ArabicMMLU * Rename `ammlu` to `arabicmmlu`
-
Hailey Schoelkopf authored
* log fewshot_as_multiturn in general tracker args * Update evaluator.py --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
Hailey Schoelkopf authored
* init paloma benchmark * pre-process in utils function * add `task_alias` * updated task aliases * Update paloma_dolma-v1_5.yaml * Update paloma_twitterAAE_HELM_fixed.yaml * Update paloma_dolma_100_programing_languages.yaml * update on names * fix paloma template issue --------- Co-authored-by:
Zafir Stojanovski <zaf.stojano@gmail.com> Co-authored-by:
Zafir Stojanovski <zafir.stojanovski@icloud.com> Co-authored-by:
Lintang Sutawika <lintang@eleuther.ai>
-
Zafir Stojanovski authored
* init paloma benchmark * pre-process in utils function * add `task_alias` * updated task aliases * Update paloma_dolma-v1_5.yaml * Update paloma_twitterAAE_HELM_fixed.yaml * Update paloma_dolma_100_programing_languages.yaml --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
- 18 Jun, 2024 2 commits
-
-
LSinev authored
-
Wang, Chang authored
Signed-off-by:changwangss <chang1.wang@intel.com>
-
- 13 Jun, 2024 4 commits
-
-
johnwee1 authored
* fix: add filter to os.walk to ignore 'ipynb_checkpoints * Update __init__.py * Update __init__.py --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
Hailey Schoelkopf authored
Co-authored-by:lintangsutawika <lintang@eleuther.ai>
-
Hailey Schoelkopf authored
* Update vllm_causallms.py * adjust --------- Co-authored-by:lintangsutawika <lintang@eleuther.ai>
-
Baber Abbasi authored
* `samples` is newline delimited * updated git and pre-commit * appease pre-commit * nit * Revert back for now * Revert for now --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
- 12 Jun, 2024 2 commits
-
-
Nikita Lozhnikov authored
Fix bug where `self.max_tokens` was not set
-
Sadra Barikbin authored
-
- 11 Jun, 2024 2 commits
-
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-