1. 20 Dec, 2023 1 commit
    • Alex Bäuerle's avatar
      feat: add option to upload results to Zeno (#990) · 21d4ae98
      Alex Bäuerle authored
      
      
      * feat: add option to upload results to Zeno
      
      * config-based upload supporting different task types and metrics
      
      * upload tasks as individual projects
      
      * wording
      
      * readme
      
      * add example notebook
      
      * Update documentation for Zeno integration
      
      * Make zeno deps an extra
      
      * Update README.md
      
      * Document extra deps installation
      
      * Update zeno_visualize.py
      
      * fix: balance parens
      
      * fix typo
      
      * fix merge commit I botched
      
      * Update zeno_visualize.py
      
      * Update logger warning stmt
      
      * fix whitespace
      
      ---------
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      21d4ae98
  2. 19 Dec, 2023 5 commits
  3. 18 Dec, 2023 5 commits
  4. 17 Dec, 2023 1 commit
    • Wis Kojohnjaratkul's avatar
      [WIP] Add IFEval / Instruction-Following Eval (#1087) · aa61f940
      Wis Kojohnjaratkul authored
      * Add IFEval task
      
      * Check and download nltk punkt if not already downloaded
      
      * Update gen_max_toks to 2048 to support "900 words+" instructions
      
      * Resolve pre-commit linting issues
      
      * Reduce max_gen_toks to 1280 to conserve token usage
      
      * Add warning message in `process_results` call for non chat-finetuned models
      aa61f940
  5. 16 Dec, 2023 2 commits
  6. 15 Dec, 2023 8 commits
  7. 14 Dec, 2023 7 commits
  8. 13 Dec, 2023 7 commits
  9. 12 Dec, 2023 4 commits