Commits · 9ae96cdfd816ded3c7c10f625f56aadfce708bdf · gaoqiong / lm-evaluation-harness

"test/vscode:/vscode.git/clone" did not exist on "0d2606bb60f6a9feb67a4a2a431ac89220e6b9e4"

26 Mar, 2024 1 commit

Integration of NeMo models into LM Evaluation Harness library (#1598) · e9d429e1

Sergio Perez authored Mar 26, 2024

* Integration of NeMo models into LM Evaluation Harness library

* rename nemo model as nemo_lm

* move nemo section in readme after hf section

* use self.eot_token_id in get_until()

* improve progress bar showing loglikelihood requests

* data replication or tensor/pipeline replication working fine within one node

* run pre-commit on modified files

* check whether dependencies are installed

* clarify usage of torchrun in README

e9d429e1

25 Mar, 2024 1 commit
- Add vLLM FAQs to README (#1625) (#1633) · a97fde23
  Hailey Schoelkopf authored Mar 25, 2024
  
  a97fde23
15 Mar, 2024 1 commit
- Fix README section on vllm integration (#1579) · 7d9922c8
  Eitan Turok authored Mar 15, 2024
```
* Link to vllm integration

* add pip install .[vllm] cmd
```
  7d9922c8
01 Mar, 2024 1 commit

modify `WandbLogger` to accept arbitrary kwargs (#1491) · ae79b121

Baber Abbasi authored Mar 01, 2024

* make `WandbLogger` init args optional

* nit

* nit

* nit

* move import warning to `WandbLogger`

* nit

* update docs

* nit

ae79b121

22 Feb, 2024 1 commit

feat: Add Weights and Biases support (#1339) · 2683fbbb

Ayush Thakur authored Feb 23, 2024



* add wandb as extra dependency

* wandb metrics logging

* refactor

* log samples as tables

* fix linter

* refactor: put in a class

* change dir

* add panels

* log eval as table

* improve tables logging

* improve reports logging

* precommit run

* ruff check

* handle importing reports api gracefully

* ruff

* compare results

* minor pre-commit fixes

* build comparison report

* ruff check

* log results as artifacts

* remove comparison script

* update dependency

* type annotate and docstring

* add example

* update readme

* fix typo

* teardown

* handle outside wandb run

* gracefully fail reports creation

* precommit checks

* add report url to summary

* use wandb  printer for better url stdout

* fix ruff

* handle N/A and groups

* fix eval table

* remove unused var

* update wandb version req + disable reports stdout

* remove reports feature to TODO

* add label to multi-choice question data

* log model predictions

* lints

* loglikelihood_rolling

* log eval result for groups

* log tables by group for better handling

* precommit

* choices column for multi-choice

* graciously fail wandb

* remove reports feature

* track system metrics + total eval time + stdout

---------
Co-authored-by: Lintang Sutawika <lintang@eleuther.ai>

2683fbbb

06 Feb, 2024 2 commits

adding hf_transfer (#1400) · 756eeb6f

Michael Feil authored Feb 06, 2024



* add hf_transfer

* update dependencies

* Delete stale `[linting]` extra

* Update README.md with extras table

---------
Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>

756eeb6f

Fix confusing `write_out.py` instructions in README (#1371) · df01adf6
Hailey Schoelkopf authored Feb 06, 2024

df01adf6

05 Feb, 2024 1 commit

Support for Inf2 optimum class [WIP] (#1364) · d17dcea0

Michael Feil authored Feb 05, 2024

* initial commit

* remove overwrite bs

* adding neuronx dependencies

* Update README.md

* update neuronx

d17dcea0

01 Feb, 2024 1 commit

Expand docs, update CITATION.bib (#1227) · f5408b6b

Hailey Schoelkopf authored Feb 01, 2024



* Update CITATION.bib

* Create CONTRIBUTING.md

* add disclaimer re: multi node

* flesh out some sections more

* Flesh out contributor guide

* revert CITATION.bib

* appease pre-commit

---------
Co-authored-by: lintangsutawika <lintang@eleuther.ai>

f5408b6b

31 Jan, 2024 1 commit

add bypass metric (#1156) · f8203de1

Baber Abbasi authored Feb 01, 2024

* add bypass metric

* fixed `bypass` metric.

* add task attributes if predict_only

* add `predict_only` checks

* add docs

* added `overide_metric`, `override_config` to `Task`

* nits

* nit

* changed --predict_only to generations; nits

* nits

* nits

* change gen_kwargs warning

* add note about `--predict_only` in README.md

* added `predict_only`

* move table to bottom

* nit

* change null aggregation to bypass (conflict)

* bugfix; default `temp=0.0`

* typo

f8203de1

26 Jan, 2024 1 commit

Add causalLM OpenVino models (#1290) · 97a67d27

NoushNabi authored Jan 26, 2024



* added intel optimum

* added intel optimum in readme

* modified intel optimum

* modified intel optimum

* modified intel optimum

* modified install optimum

* modified path of IR file

* added openvino_device

* added openvino_device2

* changed optimum-causal to openvino-causal

* Update README.md

* Update README.md

* remove `lm_eval.base` import

* update openvino-causal -> openvino ; pass device through super().__init__()

* Update README.md

* Add optimum to tests dependencies

* apply pre-commit

* fix so tests pass

---------
Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
Co-authored-by: haileyschoelkopf <hailey@eleuther.ai>

97a67d27

25 Jan, 2024 1 commit
- Add FAQ on `lm_eval.tasks.initialize_tasks()` to README (#1330) · 52f48e8c
  Hailey Schoelkopf authored Jan 25, 2024
```
* Update README.md

* [!Tip]
```
  52f48e8c
23 Jan, 2024 1 commit

Don't use `get_task_dict()` in task registration / initialization (#1331) · 969b48bf

Hailey Schoelkopf authored Jan 23, 2024



* don't use get_task_dict() as a helper, it will download the dataset!

* pre-commit

* Update README.md

---------
Co-authored-by: lintangsutawika <lintang@eleuther.ai>

969b48bf

22 Jan, 2024 2 commits

fix a trailing whitespace that breaks a lint job (#1335) · 84357a46
Brian Vaughan authored Jan 22, 2024

84357a46

Add `local-completions` support using OpenAI interface (#1277) · 5c25dd55

Michael Goin authored Jan 22, 2024



* Add `local-completions` support using OpenAI interface

* Refactor oa_completion

* Address tokenizer comments and change request chunks to batch size

* Add warning message for tiktoken backend

* fix formatting

* fix whitespace

* Update README.md

---------
Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>

5c25dd55

16 Jan, 2024 1 commit

Update README.md with custom integration doc (#1298) · ada4a31d

Mark Saroufim authored Jan 16, 2024



* Update README.md

* punctuation

---------
Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>

ada4a31d

15 Jan, 2024 2 commits

Re-add citation · 39a465ca

Stella Biderman authored Jan 15, 2024

It looks like Google Scholar has [already noticed](https://scholar.google.com/scholar?hl=en&as_sdt=0%2C9&authuser=2&q=%22A+framework+for+few-shot+language+model+evaluation%2C+12+2023%22&btnG=) the updated citation block so let's add it back in.

39a465ca

Make `parallelize=True` vs. `accelerate launch` distinction clearer in docs (#1261) · 39e7b264
Hailey Schoelkopf authored Jan 15, 2024
```
* Make parallelize=True distinction clearer in documentation.

* run linter
```
39e7b264

11 Jan, 2024 1 commit
- Update README.md · eed2d3a6
  Stella Biderman authored Jan 11, 2024
  
  eed2d3a6
08 Jan, 2024 1 commit

Revert citation (#1257) · ecb1df28

Stella Biderman authored Jan 08, 2024

Over a dozen papers have used the updated citation block, but Google Scholar has noticed none of them. Since it does understand this citation, I think we should use it going forward until we have a way to ensure the newer citations are actually logged.

ecb1df28

30 Dec, 2023 1 commit
- Update README.md (#1195) · 1229862a
  Anjor Kanekar authored Dec 30, 2023
  
  1229862a
23 Dec, 2023 1 commit
- Fix documentation in API table (#1203) · b12bb1d4
  Hailey Schoelkopf authored Dec 23, 2023
  
  b12bb1d4
22 Dec, 2023 2 commits

Upstream Mamba Support (`mamba_ssm`) (#1110) · 5503b274

Hailey Schoelkopf authored Dec 22, 2023

* modularize HFLM code

* pass through extra kwargs to AutoModel.from_pretrained call

* remove explicit model_kwargs

* rename gptq -> autogptq

* fix tokenizer pad token errors

* ensure model always respects device_map and autogptq's selected devices

* add a _get_config helper fn

* add mambaLMWrapper

* add mamba extra

* add mamba extra

* fix conditional import

* Fix botched merge commit

* Remove beginning-of-file comment for consistency

* Add docstring for mambaLM re: supported kwargs

* Alphabetize extras

* Update extras table

* appease precommit

* run precommit on mamba_lm

5503b274

Refer in README to main branch (#1200) · 25cefbc1
Bram Vanroy authored Dec 22, 2023

25cefbc1

21 Dec, 2023 3 commits
- Update README.md (#1181) · 9267354e
  Anjor Kanekar authored Dec 21, 2023
```
* Update README.md

Add a not about running on apple arm gpus

* Update README.md

* Update README.md

---------
Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
```
  9267354e
- update Zeno example and reference in README (#1190) · 84790e99
  Alex Bäuerle authored Dec 21, 2023
  
  84790e99
- Update README.md (#1184) · e548d94d
  Anjor Kanekar authored Dec 21, 2023
  
  e548d94d
20 Dec, 2023 3 commits

Implementing local OpenAI API-style chat completions on any given inference server (#1174) · fcfc0c60

Vicki Boykis authored Dec 20, 2023

* LocalChatCompletionsLM add

* clean up completions class

* clean up completions class

* update tokens

* README

* fix constructor

* eos token

* folding local-chat-completions into OpenAIChatCompletions

* refactoring to include gen_kwargs as passable option

* add todo on chat completion kwarg validation

* Ruff and README fix

* generalize to **kwargs

* remove unnecessary kwargs

* README and remove kwargs

* README

fcfc0c60

Switch Linting to `ruff` (#1166) · 65b8761d

Baber Abbasi authored Dec 20, 2023

* add ruff and isort. remove black and flake8

* remove unnecessary dependencies

* remove dependency from table

* change order

* ran ruff

* check 3.9

* exclude evaluator

* update CI workflow

* use ruff config in pyproject.toml

* test

* add isort rules to ruff

* sort imports

* import `make_table`

* try stages for no-commit-to-branch

* turn on mypy for pre-commit

* test

* test

* test

* change no-commit-to-branch to default

* nits

* fixed dependency

65b8761d

feat: add option to upload results to Zeno (#990) · 21d4ae98

Alex Bäuerle authored Dec 20, 2023



* feat: add option to upload results to Zeno

* config-based upload supporting different task types and metrics

* upload tasks as individual projects

* wording

* readme

* add example notebook

* Update documentation for Zeno integration

* Make zeno deps an extra

* Update README.md

* Document extra deps installation

* Update zeno_visualize.py

* fix: balance parens

* fix typo

* fix merge commit I botched

* Update zeno_visualize.py

* Update logger warning stmt

* fix whitespace

---------
Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>

21d4ae98

18 Dec, 2023 2 commits

Remove GooseAI docs and change no-commit-to-branch precommit hook (#1154) · c2fad099
Vicki Boykis authored Dec 18, 2023
```
* remove gooseAI

* Modify preconfig to specify commit branch

* precommit

* remove openai alias for completions
```
c2fad099

set `--gen_kwargs` arg to None (#1145) · 08fcf1fe

Baber Abbasi authored Dec 18, 2023

* set `--gen_kwargs` to None + add help to CLI

* add logging metavar

* fix verbosity help messages

* Reorder severity levels.

08fcf1fe

15 Dec, 2023 2 commits
- add correct openai api key to README.md (#1138) · e65e5bbd
  Lenni Justen authored Dec 15, 2023
  
  e65e5bbd
- fix typo in README.md (#1136) · 38c36613
  Lenni Justen authored Dec 15, 2023
  
  38c36613
13 Dec, 2023 1 commit
- Unpack group in `write_out` (#1113) · 72e583d5
  Baber Abbasi authored Dec 13, 2023
```
* unpack group; add output_path to arg

* Add `vllm` to overview
```
  72e583d5
12 Dec, 2023 2 commits
- Update README.md · 86311c23
  Hailey Schoelkopf authored Dec 12, 2023
  
  86311c23
- Describe model_comparator.py in readme · de2a60e3
  Hailey Schoelkopf authored Dec 12, 2023
  
  de2a60e3
07 Dec, 2023 1 commit
- formatting · 965c5330
  lintangsutawika authored Dec 07, 2023
  
  965c5330
04 Dec, 2023 2 commits
- update README.md · f721c0f0
  haileyschoelkopf authored Dec 04, 2023
  
  f721c0f0
- typo · 19f745aa
  baberabb authored Dec 04, 2023
  
  19f745aa