1. 09 May, 2025 1 commit
  2. 06 May, 2025 5 commits
  3. 29 Apr, 2025 1 commit
  4. 18 Apr, 2025 1 commit
  5. 16 Apr, 2025 5 commits
  6. 15 Apr, 2025 1 commit
    • Jerry Zhang's avatar
      Add support for quantization_config (#2842) · 758c5ed8
      Jerry Zhang authored
      * Add support for quantization_config
      
      Summary:
      Previously quantization_config is ignored, so torchao quantized models are not supported,
      this PR adds that.
      
      Test Plan:
      lm_eval --model hf --model_args pretrained=jerryzh168/gemma3-int4wo --tasks hellaswag --device cuda:0 --batch_size 8
      
      Reviewers:
      
      Subscribers:
      
      Tasks:
      
      Tags:
      
      * quantization_config is optional
      758c5ed8
  7. 14 Apr, 2025 2 commits
  8. 07 Apr, 2025 1 commit
    • Felipe Maia Polo's avatar
      Add `--samples` Argument for Fine-Grained Task Evaluation in... · d693dcd2
      Felipe Maia Polo authored
      
       Add `--samples` Argument for Fine-Grained Task Evaluation in `lm-evaluation-harness`. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2] (#2520)
      
      * added option --examples
      
      * specifying examples in dictionary
      
      * run pre-commit - fix arg type
      
      Signed-off-by: Mírian Silva <mirianfrsilva@ibm.com
      
      * fixing bug when examples==None
      
      * fixing bug when examples==None
      
      * limit or examples must be None in simple_evaluate.py and in evaluator.py
      
      * run pre-commit (fix formatting)
      
      Signed-off-by: Mírian Silva <mirianfrsilva@ibm.com
      
      * merge main and run pre-commit (fix formatting)
      
      Signed-off-by: Mírian Silva <mirianfrsilva@ibm.com
      
      * Update __main__.py
      
      undefined "limit" and "examples"
      
      * update branch, fix conflicts, run pre-commit
      
      * nits
      
      * nits
      
      * change 'examples' to 'samples'
      
      ---------
      
      Signed-off-by: Mírian Silva <mirianfrsilva@ibm.com
      Co-authored-by: default avatarmirianfrsilva <mirianfrsilva@ibm.com>
      Co-authored-by: default avatarStella Biderman <stellabiderman@gmail.com>
      Co-authored-by: default avatarBaber <baber@hey.com>
      d693dcd2
  9. 04 Apr, 2025 3 commits
  10. 03 Apr, 2025 1 commit
  11. 02 Apr, 2025 2 commits
  12. 01 Apr, 2025 2 commits
  13. 30 Mar, 2025 1 commit
    • Alexandre Marques's avatar
      Adds MMLU CoT, gsm8k and arc_challenge for llama instruct (#2829) · 3816796e
      Alexandre Marques authored
      * llama-style MMLU CoT
      
      * Refactor MMLU CoT template YAML to simplify 'until' structure
      
      * Add GSM8K task configuration for LLaMA3 with few-shot examples
      
      * Fix missing newline at end of MMLU CoT YAML file
      
      * Add ARC-Challenge task configuration and processing utility
      
      * Add additional MMLU and ARC-Challenge task variants to README
      
      * Update README with notes on arc_challenge_llama dataset preprocessing
      3816796e
  14. 29 Mar, 2025 1 commit
  15. 28 Mar, 2025 3 commits
  16. 27 Mar, 2025 3 commits
  17. 26 Mar, 2025 1 commit
  18. 25 Mar, 2025 1 commit
  19. 23 Mar, 2025 1 commit
    • Bruno Carneiro's avatar
      feat: replace library (#2828) · 1afb190c
      Bruno Carneiro authored
      I haven't had time to review the library that's replacing tj-actions or whether this change breaks anything, but the vulnerability is quite severe and I would rather the functionality be broken than risk compromise.
      
      **to do:** review this later
      1afb190c
  20. 21 Mar, 2025 2 commits
  21. 20 Mar, 2025 2 commits