1. 21 Dec, 2023 1 commit
  2. 20 Dec, 2023 4 commits
    • Vicki Boykis's avatar
      Implementing local OpenAI API-style chat completions on any given inference server (#1174) · fcfc0c60
      Vicki Boykis authored
      * LocalChatCompletionsLM add
      
      * clean up completions class
      
      * clean up completions class
      
      * update tokens
      
      * README
      
      * fix constructor
      
      * eos token
      
      * folding local-chat-completions into OpenAIChatCompletions
      
      * refactoring to include gen_kwargs as passable option
      
      * add todo on chat completion kwarg validation
      
      * Ruff and README fix
      
      * generalize to **kwargs
      
      * remove unnecessary kwargs
      
      * README and remove kwargs
      
      * README
      fcfc0c60
    • GUIJIN SON's avatar
      Error in --num_fewshot option for K-MMLU Evaluation Harness (#1178) · 12f2c5ea
      GUIJIN SON authored
      * update kmmlu default formatting
      
      * Update _default_kmmlu_yaml
      
      * Delete lm_eval/tasks/kmmlu/utils.py
      12f2c5ea
    • Baber Abbasi's avatar
      Switch Linting to `ruff` (#1166) · 65b8761d
      Baber Abbasi authored
      * add ruff and isort. remove black and flake8
      
      * remove unnecessary dependencies
      
      * remove dependency from table
      
      * change order
      
      * ran ruff
      
      * check 3.9
      
      * exclude evaluator
      
      * update CI workflow
      
      * use ruff config in pyproject.toml
      
      * test
      
      * add isort rules to ruff
      
      * sort imports
      
      * import `make_table`
      
      * try stages for no-commit-to-branch
      
      * turn on mypy for pre-commit
      
      * test
      
      * test
      
      * test
      
      * change no-commit-to-branch to default
      
      * nits
      
      * fixed dependency
      65b8761d
    • Alex Bäuerle's avatar
      feat: add option to upload results to Zeno (#990) · 21d4ae98
      Alex Bäuerle authored
      
      
      * feat: add option to upload results to Zeno
      
      * config-based upload supporting different task types and metrics
      
      * upload tasks as individual projects
      
      * wording
      
      * readme
      
      * add example notebook
      
      * Update documentation for Zeno integration
      
      * Make zeno deps an extra
      
      * Update README.md
      
      * Document extra deps installation
      
      * Update zeno_visualize.py
      
      * fix: balance parens
      
      * fix typo
      
      * fix merge commit I botched
      
      * Update zeno_visualize.py
      
      * Update logger warning stmt
      
      * fix whitespace
      
      ---------
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      21d4ae98
  3. 19 Dec, 2023 5 commits
  4. 18 Dec, 2023 5 commits
  5. 17 Dec, 2023 1 commit
    • Wis Kojohnjaratkul's avatar
      [WIP] Add IFEval / Instruction-Following Eval (#1087) · aa61f940
      Wis Kojohnjaratkul authored
      * Add IFEval task
      
      * Check and download nltk punkt if not already downloaded
      
      * Update gen_max_toks to 2048 to support "900 words+" instructions
      
      * Resolve pre-commit linting issues
      
      * Reduce max_gen_toks to 1280 to conserve token usage
      
      * Add warning message in `process_results` call for non chat-finetuned models
      aa61f940
  6. 16 Dec, 2023 2 commits
  7. 15 Dec, 2023 8 commits
  8. 14 Dec, 2023 7 commits
  9. 13 Dec, 2023 7 commits