1. 05 Feb, 2025 10 commits
  2. 04 Feb, 2025 1 commit
  3. 29 Jan, 2025 3 commits
  4. 28 Jan, 2025 5 commits
  5. 24 Jan, 2025 1 commit
  6. 21 Jan, 2025 3 commits
  7. 20 Jan, 2025 6 commits
  8. 19 Jan, 2025 1 commit
  9. 17 Jan, 2025 1 commit
  10. 16 Jan, 2025 4 commits
    • Baber's avatar
      nit · 0007b74a
      Baber authored
      0007b74a
    • Baber's avatar
      fix · 14eee946
      Baber authored
      14eee946
    • Baber's avatar
      add evaluator · e3b881ae
      Baber authored
      e3b881ae
    • Baber's avatar
      nits · 8181f43c
      Baber authored
      8181f43c
  11. 15 Jan, 2025 5 commits
    • Baber's avatar
      Merge branch 'main' into mathvista · 2106fbeb
      Baber authored
      # Conflicts:
      #	lm_eval/models/openai_completions.py
      2106fbeb
    • Baber Abbasi's avatar
      assistant prefill (#2615) · 703fbffd
      Baber Abbasi authored
      * add assistant prefix
      
      * add arc_challenge from llama
      
      * nit
      
      * nit
      
      * nit
      
      * add assistant prefix
      
      * add mmlu_llama
      
      * nit
      
      * nit
      
      * Revert "nit"
      
      This reverts commit 6a97f8356237305e375212b966b30e8de59dd4bc.
      
      * fix regex bug
      
      * add assistant_prefix to vllm
      
      * add `Question:`
      
      * add mmlu_pro
      
      * add fewshot assistant_prefix
      
      * use `assistant_prefill`
      
      * typehints
      
      * nits
      
      * nits
      
      * add to docs
      
      * add readme
      703fbffd
    • Shivansh Pachnanda's avatar
      Add MLQA (#2622) · e86cece6
      Shivansh Pachnanda authored
      * Add MLQA
      * add mlqa_common_yaml
      
      * add 49 tests of mlqa family
      
      * update tasks/README.md
      
      ---------
      
      * fix: mlqa ast error
      
      * nit: removed .yaml ext from template_yaml
      
      * nit changes: minor modifications generate_tasks.py
      
      * deleted    lm_eval/tasks/mlqa/mlqa_common_yaml.yaml
      
      * tests updated
      
      * nit
      e86cece6
    • Hojin Lee's avatar
      Add MBPP (#2247) · 5db23e2c
      Hojin Lee authored
      
      
      * add mbpp
      
      * fix some bugs
      
      * add README for mbpp
      
      * update README
      
      * nits
      
      ---------
      Co-authored-by: default avatarHojin Lee <19949034+hjlee1371@users.noreply.github.com>
      Co-authored-by: default avatarBaber <baber@hey.com>
      5db23e2c
    • Hojin Lee's avatar
      Add HumanEval (#1992) · 4c11206b
      Hojin Lee authored
      
      
      * add custom filter
      
      * fix type casting of references
      
      * add humaneval
      
      * fix a bug in humaneval
      
      * add greedy version of humaneval
      
      * update tasks README
      
      * test humaneval
      
      * return multiple metrics
      
      * nit
      
      * add confirmation to run code tasks
      
      * nit
      
      * nit
      
      ---------
      Co-authored-by: default avatarHojin Lee <19949034+hjlee1371@users.noreply.github.com>
      Co-authored-by: default avatarBaber <baber@hey.com>
      4c11206b