- 05 Aug, 2024 6 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
- 27 Jun, 2024 3 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
- 26 Jun, 2024 1 commit
-
-
lintangsutawika authored
-
- 24 Jun, 2024 5 commits
-
-
Yu Shi Jie authored
sync with remote
-
Yu Shi Jie authored
-
Yu Shi Jie authored
-
Yu Shi Jie authored
-
Yu Shi Jie authored
-
- 20 Jun, 2024 1 commit
-
-
Julen Etxaniz authored
* add bertaqa tasks * rename basquetrivia-->bertaqa ; make template stub not .yaml * add bertaqa entry to lm_eval/tasks/README.md --------- Co-authored-by:haileyschoelkopf <hailey@eleuther.ai>
-
- 19 Jun, 2024 5 commits
-
-
Hailey Schoelkopf authored
-
Yazeed Alnumay authored
* Added ArabicMMLU * Rename `ammlu` to `arabicmmlu`
-
Hailey Schoelkopf authored
* log fewshot_as_multiturn in general tracker args * Update evaluator.py --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
Hailey Schoelkopf authored
* init paloma benchmark * pre-process in utils function * add `task_alias` * updated task aliases * Update paloma_dolma-v1_5.yaml * Update paloma_twitterAAE_HELM_fixed.yaml * Update paloma_dolma_100_programing_languages.yaml * update on names * fix paloma template issue --------- Co-authored-by:
Zafir Stojanovski <zaf.stojano@gmail.com> Co-authored-by:
Zafir Stojanovski <zafir.stojanovski@icloud.com> Co-authored-by:
Lintang Sutawika <lintang@eleuther.ai>
-
Zafir Stojanovski authored
* init paloma benchmark * pre-process in utils function * add `task_alias` * updated task aliases * Update paloma_dolma-v1_5.yaml * Update paloma_twitterAAE_HELM_fixed.yaml * Update paloma_dolma_100_programing_languages.yaml --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
- 18 Jun, 2024 2 commits
-
-
LSinev authored
-
Wang, Chang authored
Signed-off-by:changwangss <chang1.wang@intel.com>
-
- 14 Jun, 2024 1 commit
-
-
Yu Shi Jie authored
-
- 13 Jun, 2024 7 commits
-
-
johnwee1 authored
* fix: add filter to os.walk to ignore 'ipynb_checkpoints * Update __init__.py * Update __init__.py --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
Hailey Schoelkopf authored
Co-authored-by:lintangsutawika <lintang@eleuther.ai>
-
Yu Shi Jie authored
Resolve conflict.
-
Hailey Schoelkopf authored
* Update vllm_causallms.py * adjust --------- Co-authored-by:lintangsutawika <lintang@eleuther.ai>
-
Yu Shi Jie authored
-
Baber Abbasi authored
* `samples` is newline delimited * updated git and pre-commit * appease pre-commit * nit * Revert back for now * Revert for now --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
Yu Shi Jie authored
-
- 12 Jun, 2024 2 commits
-
-
Nikita Lozhnikov authored
Fix bug where `self.max_tokens` was not set
-
Sadra Barikbin authored
-
- 11 Jun, 2024 4 commits
-
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
* Update README.md * Delete lm_eval/tasks/ammlu directory
-
KonradSzafer authored
* results filenames handling moved to utils * zeno results handling fix * tasks_for_model backward compatibility * results files logic moved to tasks_for_model * moved sanitize_model_name to utils
-
- 10 Jun, 2024 1 commit
-
-
khalil authored
-
- 09 Jun, 2024 1 commit
-
-
Sadra Barikbin authored
-
- 07 Jun, 2024 1 commit
-
-
Zafir Stojanovski authored
* sort metrics in output table * update docstring in `consolidate_results` * add tests for verifying consistency of table output * update tests to account for floating point inconsistencies * updated tests based on `pythia-14m`
-