1. 02 Jan, 2024 8 commits
  2. 29 Dec, 2023 2 commits
    • Paul McCann's avatar
      Don't silence errors when loading tasks (#1148) · 34b563b1
      Paul McCann authored
      
      
      * Add example failing task
      
      This task includes an invalid import. This will cause an exception and
      the task will not be loaded. But this just results in a DEBUG level log
      message, so in normal usage you'll see no error, and will be told the
      task doesn't exist.
      
      Here's an example command line to run the task:
      
          python -m lm_eval --model hf --model_args pretrained=rinna/japanese-gpt-1b --tasks fail
      
      This task is based on a Japanese Winograd task, but that's not
      important, and was just used due to familiarity.
      
      * Do not ignore errors when loading tasks
      
      * Change how task errors are logged
      
      This makes the proposed changes from PR discussion.
      
      1. Exceptions not related to missing modules/imports are logged as
         warnings.
      
      2. module/import related exceptions are still logged at debug level, but
         if any of them happen there is a warning about it with instructions
         on how to show logs.
      
      * Remove intentionally failing task
      
      ---------
      Co-authored-by: default avatarPaul O'Leary McCann <polm@dampfkraft.com>
      34b563b1
    • lintangsutawika's avatar
      439dca55
  3. 28 Dec, 2023 9 commits
  4. 27 Dec, 2023 3 commits
  5. 24 Dec, 2023 2 commits
  6. 23 Dec, 2023 1 commit
  7. 22 Dec, 2023 3 commits
    • Anjor Kanekar's avatar
    • Hailey Schoelkopf's avatar
      Upstream Mamba Support (`mamba_ssm`) (#1110) · 5503b274
      Hailey Schoelkopf authored
      * modularize HFLM code
      
      * pass through extra kwargs to AutoModel.from_pretrained call
      
      * remove explicit model_kwargs
      
      * rename gptq -> autogptq
      
      * fix tokenizer pad token errors
      
      * ensure model always respects device_map and autogptq's selected devices
      
      * add a _get_config helper fn
      
      * add mambaLMWrapper
      
      * add mamba extra
      
      * add mamba extra
      
      * fix conditional import
      
      * Fix botched merge commit
      
      * Remove beginning-of-file comment for consistency
      
      * Add docstring for mambaLM re: supported kwargs
      
      * Alphabetize extras
      
      * Update extras table
      
      * appease precommit
      
      * run precommit on mamba_lm
      5503b274
    • Zach Schillaci's avatar
      Generic decorator for handling rate limit errors (#1109) · 046ea6e2
      Zach Schillaci authored
      
      
      * Add retry error handler
      
      * fixup! Add retry error handler
      
      * Move to utils.py
      
      * Run isort on utils.py
      
      * Catch multiple exceptions
      
      * Update LMs with exception handler
      
      * Fixes to anthropic retry handler
      
      * fix callback kwarg
      
      * Update textsynth.py
      
      * fix python 3.8 incompatibility
      
      * fix indenterror I introduced
      
      * placate linter?
      
      * Update on_exception_callback kwarg name
      
      * fixup! Merge branch 'main' into add-retry-error-handler
      
      * fixup! fixup! Merge branch 'main' into add-retry-error-handler
      
      * Merge conflicts are fun
      
      * Run pre-commit
      
      ---------
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      046ea6e2
  8. 21 Dec, 2023 3 commits
  9. 20 Dec, 2023 3 commits
    • Vicki Boykis's avatar
      Implementing local OpenAI API-style chat completions on any given inference server (#1174) · fcfc0c60
      Vicki Boykis authored
      * LocalChatCompletionsLM add
      
      * clean up completions class
      
      * clean up completions class
      
      * update tokens
      
      * README
      
      * fix constructor
      
      * eos token
      
      * folding local-chat-completions into OpenAIChatCompletions
      
      * refactoring to include gen_kwargs as passable option
      
      * add todo on chat completion kwarg validation
      
      * Ruff and README fix
      
      * generalize to **kwargs
      
      * remove unnecessary kwargs
      
      * README and remove kwargs
      
      * README
      fcfc0c60
    • GUIJIN SON's avatar
      Error in --num_fewshot option for K-MMLU Evaluation Harness (#1178) · 12f2c5ea
      GUIJIN SON authored
      * update kmmlu default formatting
      
      * Update _default_kmmlu_yaml
      
      * Delete lm_eval/tasks/kmmlu/utils.py
      12f2c5ea
    • Baber Abbasi's avatar
      Switch Linting to `ruff` (#1166) · 65b8761d
      Baber Abbasi authored
      * add ruff and isort. remove black and flake8
      
      * remove unnecessary dependencies
      
      * remove dependency from table
      
      * change order
      
      * ran ruff
      
      * check 3.9
      
      * exclude evaluator
      
      * update CI workflow
      
      * use ruff config in pyproject.toml
      
      * test
      
      * add isort rules to ruff
      
      * sort imports
      
      * import `make_table`
      
      * try stages for no-commit-to-branch
      
      * turn on mypy for pre-commit
      
      * test
      
      * test
      
      * test
      
      * change no-commit-to-branch to default
      
      * nits
      
      * fixed dependency
      65b8761d
  10. 19 Dec, 2023 6 commits