1. 19 Dec, 2023 1 commit
  2. 18 Dec, 2023 1 commit
  3. 17 Dec, 2023 1 commit
    • Wis Kojohnjaratkul's avatar
      [WIP] Add IFEval / Instruction-Following Eval (#1087) · aa61f940
      Wis Kojohnjaratkul authored
      * Add IFEval task
      
      * Check and download nltk punkt if not already downloaded
      
      * Update gen_max_toks to 2048 to support "900 words+" instructions
      
      * Resolve pre-commit linting issues
      
      * Reduce max_gen_toks to 1280 to conserve token usage
      
      * Add warning message in `process_results` call for non chat-finetuned models
      aa61f940
  4. 15 Dec, 2023 1 commit
    • MorishT's avatar
      Add benchmark FLD (#1122) · 755bf6e8
      MorishT authored
      
      
      * [fix] loading dataset from hub fails when the dataset name includes '.', as the program assumes it is on the local filesystem
      
      * add FLD benchmark
      
      * Update task.py
      
      * [update] add group 'fld'
      
      * [update] rename fld -> fld_default. add explanation to the readme
      
      * Update README.md
      
      ---------
      Co-authored-by: default avatarLintang Sutawika <lintang@sutawika.com>
      755bf6e8
  5. 14 Dec, 2023 2 commits
  6. 13 Dec, 2023 5 commits
  7. 11 Dec, 2023 5 commits
  8. 10 Dec, 2023 6 commits
  9. 08 Dec, 2023 1 commit
  10. 07 Dec, 2023 8 commits
  11. 04 Dec, 2023 2 commits
  12. 28 Nov, 2023 7 commits