"test/vscode:/vscode.git/clone" did not exist on "0d2606bb60f6a9feb67a4a2a431ac89220e6b9e4"
- 26 Mar, 2024 1 commit
-
-
Sergio Perez authored
* Integration of NeMo models into LM Evaluation Harness library * rename nemo model as nemo_lm * move nemo section in readme after hf section * use self.eot_token_id in get_until() * improve progress bar showing loglikelihood requests * data replication or tensor/pipeline replication working fine within one node * run pre-commit on modified files * check whether dependencies are installed * clarify usage of torchrun in README
-
- 25 Mar, 2024 1 commit
-
-
Hailey Schoelkopf authored
-
- 15 Mar, 2024 1 commit
-
-
Eitan Turok authored
* Link to vllm integration * add pip install .[vllm] cmd
-
- 01 Mar, 2024 1 commit
-
-
Baber Abbasi authored
* make `WandbLogger` init args optional * nit * nit * nit * move import warning to `WandbLogger` * nit * update docs * nit
-
- 22 Feb, 2024 1 commit
-
-
Ayush Thakur authored
* add wandb as extra dependency * wandb metrics logging * refactor * log samples as tables * fix linter * refactor: put in a class * change dir * add panels * log eval as table * improve tables logging * improve reports logging * precommit run * ruff check * handle importing reports api gracefully * ruff * compare results * minor pre-commit fixes * build comparison report * ruff check * log results as artifacts * remove comparison script * update dependency * type annotate and docstring * add example * update readme * fix typo * teardown * handle outside wandb run * gracefully fail reports creation * precommit checks * add report url to summary * use wandb printer for better url stdout * fix ruff * handle N/A and groups * fix eval table * remove unused var * update wandb version req + disable reports stdout * remove reports feature to TODO * add label to multi-choice question data * log model predictions * lints * loglikelihood_rolling * log eval result for groups * log tables by group for better handling * precommit * choices column for multi-choice * graciously fail wandb * remove reports feature * track system metrics + total eval time + stdout --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
- 06 Feb, 2024 2 commits
-
-
Michael Feil authored
* add hf_transfer * update dependencies * Delete stale `[linting]` extra * Update README.md with extras table --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
Hailey Schoelkopf authored
-
- 05 Feb, 2024 1 commit
-
-
Michael Feil authored
* initial commit * remove overwrite bs * adding neuronx dependencies * Update README.md * update neuronx
-
- 01 Feb, 2024 1 commit
-
-
Hailey Schoelkopf authored
* Update CITATION.bib * Create CONTRIBUTING.md * add disclaimer re: multi node * flesh out some sections more * Flesh out contributor guide * revert CITATION.bib * appease pre-commit --------- Co-authored-by:lintangsutawika <lintang@eleuther.ai>
-
- 31 Jan, 2024 1 commit
-
-
Baber Abbasi authored
* add bypass metric * fixed `bypass` metric. * add task attributes if predict_only * add `predict_only` checks * add docs * added `overide_metric`, `override_config` to `Task` * nits * nit * changed --predict_only to generations; nits * nits * nits * change gen_kwargs warning * add note about `--predict_only` in README.md * added `predict_only` * move table to bottom * nit * change null aggregation to bypass (conflict) * bugfix; default `temp=0.0` * typo
-
- 26 Jan, 2024 1 commit
-
-
NoushNabi authored
* added intel optimum * added intel optimum in readme * modified intel optimum * modified intel optimum * modified intel optimum * modified install optimum * modified path of IR file * added openvino_device * added openvino_device2 * changed optimum-causal to openvino-causal * Update README.md * Update README.md * remove `lm_eval.base` import * update openvino-causal -> openvino ; pass device through super().__init__() * Update README.md * Add optimum to tests dependencies * apply pre-commit * fix so tests pass --------- Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> Co-authored-by:
haileyschoelkopf <hailey@eleuther.ai>
-
- 25 Jan, 2024 1 commit
-
-
Hailey Schoelkopf authored
* Update README.md * [!Tip]
-
- 23 Jan, 2024 1 commit
-
-
Hailey Schoelkopf authored
* don't use get_task_dict() as a helper, it will download the dataset! * pre-commit * Update README.md --------- Co-authored-by:lintangsutawika <lintang@eleuther.ai>
-
- 22 Jan, 2024 2 commits
-
-
Brian Vaughan authored
-
Michael Goin authored
* Add `local-completions` support using OpenAI interface * Refactor oa_completion * Address tokenizer comments and change request chunks to batch size * Add warning message for tiktoken backend * fix formatting * fix whitespace * Update README.md --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 16 Jan, 2024 1 commit
-
-
Mark Saroufim authored
* Update README.md * punctuation --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 15 Jan, 2024 2 commits
-
-
Stella Biderman authored
It looks like Google Scholar has [already noticed](https://scholar.google.com/scholar?hl=en&as_sdt=0%2C9&authuser=2&q=%22A+framework+for+few-shot+language+model+evaluation%2C+12+2023%22&btnG=) the updated citation block so let's add it back in.
-
Hailey Schoelkopf authored
* Make parallelize=True distinction clearer in documentation. * run linter
-
- 11 Jan, 2024 1 commit
-
-
Stella Biderman authored
-
- 08 Jan, 2024 1 commit
-
-
Stella Biderman authored
Over a dozen papers have used the updated citation block, but Google Scholar has noticed none of them. Since it does understand this citation, I think we should use it going forward until we have a way to ensure the newer citations are actually logged.
-
- 30 Dec, 2023 1 commit
-
-
Anjor Kanekar authored
-
- 23 Dec, 2023 1 commit
-
-
Hailey Schoelkopf authored
-
- 22 Dec, 2023 2 commits
-
-
Hailey Schoelkopf authored
* modularize HFLM code * pass through extra kwargs to AutoModel.from_pretrained call * remove explicit model_kwargs * rename gptq -> autogptq * fix tokenizer pad token errors * ensure model always respects device_map and autogptq's selected devices * add a _get_config helper fn * add mambaLMWrapper * add mamba extra * add mamba extra * fix conditional import * Fix botched merge commit * Remove beginning-of-file comment for consistency * Add docstring for mambaLM re: supported kwargs * Alphabetize extras * Update extras table * appease precommit * run precommit on mamba_lm
-
Bram Vanroy authored
-
- 21 Dec, 2023 3 commits
-
-
Anjor Kanekar authored
* Update README.md Add a not about running on apple arm gpus * Update README.md * Update README.md --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
Alex Bäuerle authored
-
Anjor Kanekar authored
-
- 20 Dec, 2023 3 commits
-
-
Vicki Boykis authored
* LocalChatCompletionsLM add * clean up completions class * clean up completions class * update tokens * README * fix constructor * eos token * folding local-chat-completions into OpenAIChatCompletions * refactoring to include gen_kwargs as passable option * add todo on chat completion kwarg validation * Ruff and README fix * generalize to **kwargs * remove unnecessary kwargs * README and remove kwargs * README
-
Baber Abbasi authored
* add ruff and isort. remove black and flake8 * remove unnecessary dependencies * remove dependency from table * change order * ran ruff * check 3.9 * exclude evaluator * update CI workflow * use ruff config in pyproject.toml * test * add isort rules to ruff * sort imports * import `make_table` * try stages for no-commit-to-branch * turn on mypy for pre-commit * test * test * test * change no-commit-to-branch to default * nits * fixed dependency
-
Alex Bäuerle authored
* feat: add option to upload results to Zeno * config-based upload supporting different task types and metrics * upload tasks as individual projects * wording * readme * add example notebook * Update documentation for Zeno integration * Make zeno deps an extra * Update README.md * Document extra deps installation * Update zeno_visualize.py * fix: balance parens * fix typo * fix merge commit I botched * Update zeno_visualize.py * Update logger warning stmt * fix whitespace --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 18 Dec, 2023 2 commits
-
-
Vicki Boykis authored
* remove gooseAI * Modify preconfig to specify commit branch * precommit * remove openai alias for completions
-
Baber Abbasi authored
* set `--gen_kwargs` to None + add help to CLI * add logging metavar * fix verbosity help messages * Reorder severity levels.
-
- 15 Dec, 2023 2 commits
-
-
Lenni Justen authored
-
Lenni Justen authored
-
- 13 Dec, 2023 1 commit
-
-
Baber Abbasi authored
* unpack group; add output_path to arg * Add `vllm` to overview
-
- 12 Dec, 2023 2 commits
-
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-
- 07 Dec, 2023 1 commit
-
-
lintangsutawika authored
-
- 04 Dec, 2023 2 commits
-
-
haileyschoelkopf authored
-
baberabb authored
-