"test/vscode:/vscode.git/clone" did not exist on "0d2606bb60f6a9feb67a4a2a431ac89220e6b9e4"
  1. 26 Mar, 2024 1 commit
    • Sergio Perez's avatar
      Integration of NeMo models into LM Evaluation Harness library (#1598) · e9d429e1
      Sergio Perez authored
      * Integration of NeMo models into LM Evaluation Harness library
      
      * rename nemo model as nemo_lm
      
      * move nemo section in readme after hf section
      
      * use self.eot_token_id in get_until()
      
      * improve progress bar showing loglikelihood requests
      
      * data replication or tensor/pipeline replication working fine within one node
      
      * run pre-commit on modified files
      
      * check whether dependencies are installed
      
      * clarify usage of torchrun in README
      e9d429e1
  2. 25 Mar, 2024 1 commit
  3. 15 Mar, 2024 1 commit
  4. 01 Mar, 2024 1 commit
  5. 22 Feb, 2024 1 commit
    • Ayush Thakur's avatar
      feat: Add Weights and Biases support (#1339) · 2683fbbb
      Ayush Thakur authored
      
      
      * add wandb as extra dependency
      
      * wandb metrics logging
      
      * refactor
      
      * log samples as tables
      
      * fix linter
      
      * refactor: put in a class
      
      * change dir
      
      * add panels
      
      * log eval as table
      
      * improve tables logging
      
      * improve reports logging
      
      * precommit run
      
      * ruff check
      
      * handle importing reports api gracefully
      
      * ruff
      
      * compare results
      
      * minor pre-commit fixes
      
      * build comparison report
      
      * ruff check
      
      * log results as artifacts
      
      * remove comparison script
      
      * update dependency
      
      * type annotate and docstring
      
      * add example
      
      * update readme
      
      * fix typo
      
      * teardown
      
      * handle outside wandb run
      
      * gracefully fail reports creation
      
      * precommit checks
      
      * add report url to summary
      
      * use wandb  printer for better url stdout
      
      * fix ruff
      
      * handle N/A and groups
      
      * fix eval table
      
      * remove unused var
      
      * update wandb version req + disable reports stdout
      
      * remove reports feature to TODO
      
      * add label to multi-choice question data
      
      * log model predictions
      
      * lints
      
      * loglikelihood_rolling
      
      * log eval result for groups
      
      * log tables by group for better handling
      
      * precommit
      
      * choices column for multi-choice
      
      * graciously fail wandb
      
      * remove reports feature
      
      * track system metrics + total eval time + stdout
      
      ---------
      Co-authored-by: default avatarLintang Sutawika <lintang@eleuther.ai>
      2683fbbb
  6. 06 Feb, 2024 2 commits
  7. 05 Feb, 2024 1 commit
  8. 01 Feb, 2024 1 commit
  9. 31 Jan, 2024 1 commit
    • Baber Abbasi's avatar
      add bypass metric (#1156) · f8203de1
      Baber Abbasi authored
      * add bypass metric
      
      * fixed `bypass` metric.
      
      * add task attributes if predict_only
      
      * add `predict_only` checks
      
      * add docs
      
      * added `overide_metric`, `override_config` to `Task`
      
      * nits
      
      * nit
      
      * changed --predict_only to generations; nits
      
      * nits
      
      * nits
      
      * change gen_kwargs warning
      
      * add note about `--predict_only` in README.md
      
      * added `predict_only`
      
      * move table to bottom
      
      * nit
      
      * change null aggregation to bypass (conflict)
      
      * bugfix; default `temp=0.0`
      
      * typo
      f8203de1
  10. 26 Jan, 2024 1 commit
    • NoushNabi's avatar
      Add causalLM OpenVino models (#1290) · 97a67d27
      NoushNabi authored
      
      
      * added intel optimum
      
      * added intel optimum in readme
      
      * modified intel optimum
      
      * modified intel optimum
      
      * modified intel optimum
      
      * modified install optimum
      
      * modified path of IR file
      
      * added openvino_device
      
      * added openvino_device2
      
      * changed optimum-causal to openvino-causal
      
      * Update README.md
      
      * Update README.md
      
      * remove `lm_eval.base` import
      
      * update openvino-causal -> openvino ; pass device through super().__init__()
      
      * Update README.md
      
      * Add optimum to tests dependencies
      
      * apply pre-commit
      
      * fix so tests pass
      
      ---------
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      Co-authored-by: default avatarhaileyschoelkopf <hailey@eleuther.ai>
      97a67d27
  11. 25 Jan, 2024 1 commit
  12. 23 Jan, 2024 1 commit
  13. 22 Jan, 2024 2 commits
  14. 16 Jan, 2024 1 commit
  15. 15 Jan, 2024 2 commits
  16. 11 Jan, 2024 1 commit
  17. 08 Jan, 2024 1 commit
    • Stella Biderman's avatar
      Revert citation (#1257) · ecb1df28
      Stella Biderman authored
      Over a dozen papers have used the updated citation block, but Google Scholar has noticed none of them. Since it does understand this citation, I think we should use it going forward until we have a way to ensure the newer citations are actually logged.
      ecb1df28
  18. 30 Dec, 2023 1 commit
  19. 23 Dec, 2023 1 commit
  20. 22 Dec, 2023 2 commits
    • Hailey Schoelkopf's avatar
      Upstream Mamba Support (`mamba_ssm`) (#1110) · 5503b274
      Hailey Schoelkopf authored
      * modularize HFLM code
      
      * pass through extra kwargs to AutoModel.from_pretrained call
      
      * remove explicit model_kwargs
      
      * rename gptq -> autogptq
      
      * fix tokenizer pad token errors
      
      * ensure model always respects device_map and autogptq's selected devices
      
      * add a _get_config helper fn
      
      * add mambaLMWrapper
      
      * add mamba extra
      
      * add mamba extra
      
      * fix conditional import
      
      * Fix botched merge commit
      
      * Remove beginning-of-file comment for consistency
      
      * Add docstring for mambaLM re: supported kwargs
      
      * Alphabetize extras
      
      * Update extras table
      
      * appease precommit
      
      * run precommit on mamba_lm
      5503b274
    • Bram Vanroy's avatar
      Refer in README to main branch (#1200) · 25cefbc1
      Bram Vanroy authored
      25cefbc1
  21. 21 Dec, 2023 3 commits
  22. 20 Dec, 2023 3 commits
    • Vicki Boykis's avatar
      Implementing local OpenAI API-style chat completions on any given inference server (#1174) · fcfc0c60
      Vicki Boykis authored
      * LocalChatCompletionsLM add
      
      * clean up completions class
      
      * clean up completions class
      
      * update tokens
      
      * README
      
      * fix constructor
      
      * eos token
      
      * folding local-chat-completions into OpenAIChatCompletions
      
      * refactoring to include gen_kwargs as passable option
      
      * add todo on chat completion kwarg validation
      
      * Ruff and README fix
      
      * generalize to **kwargs
      
      * remove unnecessary kwargs
      
      * README and remove kwargs
      
      * README
      fcfc0c60
    • Baber Abbasi's avatar
      Switch Linting to `ruff` (#1166) · 65b8761d
      Baber Abbasi authored
      * add ruff and isort. remove black and flake8
      
      * remove unnecessary dependencies
      
      * remove dependency from table
      
      * change order
      
      * ran ruff
      
      * check 3.9
      
      * exclude evaluator
      
      * update CI workflow
      
      * use ruff config in pyproject.toml
      
      * test
      
      * add isort rules to ruff
      
      * sort imports
      
      * import `make_table`
      
      * try stages for no-commit-to-branch
      
      * turn on mypy for pre-commit
      
      * test
      
      * test
      
      * test
      
      * change no-commit-to-branch to default
      
      * nits
      
      * fixed dependency
      65b8761d
    • Alex Bäuerle's avatar
      feat: add option to upload results to Zeno (#990) · 21d4ae98
      Alex Bäuerle authored
      
      
      * feat: add option to upload results to Zeno
      
      * config-based upload supporting different task types and metrics
      
      * upload tasks as individual projects
      
      * wording
      
      * readme
      
      * add example notebook
      
      * Update documentation for Zeno integration
      
      * Make zeno deps an extra
      
      * Update README.md
      
      * Document extra deps installation
      
      * Update zeno_visualize.py
      
      * fix: balance parens
      
      * fix typo
      
      * fix merge commit I botched
      
      * Update zeno_visualize.py
      
      * Update logger warning stmt
      
      * fix whitespace
      
      ---------
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      21d4ae98
  23. 18 Dec, 2023 2 commits
  24. 15 Dec, 2023 2 commits
  25. 13 Dec, 2023 1 commit
  26. 12 Dec, 2023 2 commits
  27. 07 Dec, 2023 1 commit
  28. 04 Dec, 2023 2 commits