- 21 Dec, 2023 1 commit
-
-
Anjor Kanekar authored
-
- 20 Dec, 2023 4 commits
-
-
Vicki Boykis authored
* LocalChatCompletionsLM add * clean up completions class * clean up completions class * update tokens * README * fix constructor * eos token * folding local-chat-completions into OpenAIChatCompletions * refactoring to include gen_kwargs as passable option * add todo on chat completion kwarg validation * Ruff and README fix * generalize to **kwargs * remove unnecessary kwargs * README and remove kwargs * README
-
GUIJIN SON authored
* update kmmlu default formatting * Update _default_kmmlu_yaml * Delete lm_eval/tasks/kmmlu/utils.py
-
Baber Abbasi authored
* add ruff and isort. remove black and flake8 * remove unnecessary dependencies * remove dependency from table * change order * ran ruff * check 3.9 * exclude evaluator * update CI workflow * use ruff config in pyproject.toml * test * add isort rules to ruff * sort imports * import `make_table` * try stages for no-commit-to-branch * turn on mypy for pre-commit * test * test * test * change no-commit-to-branch to default * nits * fixed dependency
-
Alex Bäuerle authored
* feat: add option to upload results to Zeno * config-based upload supporting different task types and metrics * upload tasks as individual projects * wording * readme * add example notebook * Update documentation for Zeno integration * Make zeno deps an extra * Update README.md * Document extra deps installation * Update zeno_visualize.py * fix: balance parens * fix typo * fix merge commit I botched * Update zeno_visualize.py * Update logger warning stmt * fix whitespace --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 19 Dec, 2023 5 commits
-
-
seungduk.kim.2304 authored
* Correct column names and dataset names * Remove kmmlu_general_physics.yaml and kmmlu_korean_language.yaml * Update _default_kmmlu_yaml --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
Pasquale Minervini authored
* self.device in huggingface.py line 210 In huggingface.py line 210, self.device is str and does not have a "type" attribute * Update huggingface.py This handles both the case where `self.device` is a `torch.device` and a string * Update huggingface.py --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
Hailey Schoelkopf authored
-
Lintang Sutawika authored
* save progress * fixed issue with table only showing 1 group * store aliases directly in results_agg * removed unused parts
-
Paul McCann authored
Co-authored-by:Paul O'Leary McCann <polm@dampfkraft.com>
-
- 18 Dec, 2023 5 commits
-
-
Stella Biderman authored
-
Vicki Boykis authored
* remove gooseAI * Modify preconfig to specify commit branch * precommit * remove openai alias for completions
-
Baber Abbasi authored
-
Baber Abbasi authored
-
Baber Abbasi authored
* set `--gen_kwargs` to None + add help to CLI * add logging metavar * fix verbosity help messages * Reorder severity levels.
-
- 17 Dec, 2023 1 commit
-
-
Wis Kojohnjaratkul authored
* Add IFEval task * Check and download nltk punkt if not already downloaded * Update gen_max_toks to 2048 to support "900 words+" instructions * Resolve pre-commit linting issues * Reduce max_gen_toks to 1280 to conserve token usage * Add warning message in `process_results` call for non chat-finetuned models
-
- 16 Dec, 2023 2 commits
-
-
Baber Abbasi authored
* fixed syntactic nits * fix temperature and seed * fix logprobs * fixup merge
-
Baber Abbasi authored
-
- 15 Dec, 2023 8 commits
-
-
Vicki Boykis authored
* enabling OpenAI completions via gooseai * openai-completions and pin openai
-
Baber Abbasi authored
-
Hailey Schoelkopf authored
* add ignoring of no-commit-to-branch * fix method of skipping pre-commit step
-
Lenni Justen authored
-
Lenni Justen authored
-
MorishT authored
* [fix] loading dataset from hub fails when the dataset name includes '.', as the program assumes it is on the local filesystem * add FLD benchmark * Update task.py * [update] add group 'fld' * [update] rename fld -> fld_default. add explanation to the readme * Update README.md --------- Co-authored-by:Lintang Sutawika <lintang@sutawika.com>
-
Baber Abbasi authored
-
Lintang Sutawika authored
-
- 14 Dec, 2023 7 commits
-
-
NanoCode012 authored
* fix: passing max_length to vllm engine args * feat: add `max_model_len` * chore: lint
-
Yuliang Li authored
-
Lintang Sutawika authored
* doc_to_decontamination_query can use function * add option for doc_to_decontamination_query to follow doc_to_text * added documentation for doc_to_decontamination_query * adjust description * format
-
Lintang Sutawika authored
* Additional process for doc_to_choice * doc_to_choice can also parse a string
-
Hailey Schoelkopf authored
* modularize HFLM code * pass through extra kwargs to AutoModel.from_pretrained call * remove explicit model_kwargs * rename gptq -> autogptq * fix tokenizer pad token errors * ensure model always respects device_map and autogptq's selected devices * add a _get_config helper fn
-
Lintang Sutawika authored
fix: bug of BBH_cot_fewshot
-
momotori authored
-
- 13 Dec, 2023 7 commits
-
-
Baber Abbasi authored
* remove unlabled test sets * add note to readme
-
haileyschoelkopf authored
-
haileyschoelkopf authored
-
Lintang Sutawika authored
-
momotori authored
-
Baber Abbasi authored
* unpack group; add output_path to arg * Add `vllm` to overview
-
momotori authored
-