1. 09 May, 2025 1 commit
  2. 06 May, 2025 1 commit
  3. 16 Apr, 2025 1 commit
  4. 14 Apr, 2025 1 commit
  5. 20 Mar, 2025 2 commits
  6. 11 Mar, 2025 1 commit
  7. 27 Feb, 2025 1 commit
  8. 21 Feb, 2025 1 commit
    • Lintang Sutawika's avatar
      Logging (#2203) · 1ba35e62
      Lintang Sutawika authored
      
      
      * changed source of eval_logger
      
      * allow eval_logger to be set from args
      
      * removed verbosity arg from non-main methods
      
      * fix logging
      
      * pre-commit
      
      * set verbosity in eval logger
      
      * replace utils.eval_logger
      
      * fix logging in main
      
      * add logging to docs
      
      * add logging message
      
      * nit
      
      * add logging to docs
      
      * refactor setup_logging to utils
      
      ---------
      Co-authored-by: default avatarBaber <baber@hey.com>
      1ba35e62
  9. 17 Feb, 2025 1 commit
  10. 07 Feb, 2025 1 commit
  11. 19 Jan, 2025 1 commit
  12. 15 Jan, 2025 1 commit
    • Baber Abbasi's avatar
      assistant prefill (#2615) · 703fbffd
      Baber Abbasi authored
      * add assistant prefix
      
      * add arc_challenge from llama
      
      * nit
      
      * nit
      
      * nit
      
      * add assistant prefix
      
      * add mmlu_llama
      
      * nit
      
      * nit
      
      * Revert "nit"
      
      This reverts commit 6a97f8356237305e375212b966b30e8de59dd4bc.
      
      * fix regex bug
      
      * add assistant_prefix to vllm
      
      * add `Question:`
      
      * add mmlu_pro
      
      * add fewshot assistant_prefix
      
      * use `assistant_prefill`
      
      * typehints
      
      * nits
      
      * nits
      
      * add to docs
      
      * add readme
      703fbffd
  13. 16 Dec, 2024 1 commit
  14. 30 Nov, 2024 1 commit
  15. 15 Nov, 2024 1 commit
  16. 30 Oct, 2024 1 commit
  17. 22 Oct, 2024 1 commit
    • Leonid Sinev's avatar
      [Fix] Replace generic exception classes with a more specific ones (#1989) · d4ae9635
      Leonid Sinev authored
      * Replace generic exception classes with a more specific ones
      
      * rerun pre-commit to pass linter tests
      
      * Revert "rerun pre-commit to pass linter tests"
      
      This reverts commit 67f88ccf144469853217704520e613196042d859.
      
      * reduce repetitions in errors or so
      
      * Replace generic exception class with a more specific one
      d4ae9635
  18. 04 Sep, 2024 1 commit
  19. 30 Aug, 2024 1 commit
  20. 28 Aug, 2024 1 commit
  21. 02 Jul, 2024 1 commit
  22. 28 Jun, 2024 1 commit
  23. 13 Jun, 2024 1 commit
  24. 11 Jun, 2024 1 commit
  25. 28 May, 2024 1 commit
  26. 23 May, 2024 1 commit
  27. 07 May, 2024 1 commit
  28. 02 May, 2024 1 commit
  29. 20 Mar, 2024 1 commit
  30. 13 Mar, 2024 1 commit
  31. 09 Mar, 2024 1 commit
  32. 03 Mar, 2024 1 commit
    • Baber Abbasi's avatar
      Vllm update DP+TP (#1508) · e5e35fca
      Baber Abbasi authored
      * use `@ray.remote` with distributed vLLM
      
      * update versions
      
      * bugfix
      
      * unpin vllm
      
      * fix pre-commit
      
      * added version assertion error
      
      * Revert "added version assertion error"
      
      This reverts commit 8041e9b78e95eea9f4f4d0dc260115ba8698e9cc.
      
      * added version assertion for DP
      
      * expand DP note
      
      * add warning
      
      * nit
      
      * pin vllm
      
      * fix typos
      e5e35fca
  33. 01 Mar, 2024 2 commits
  34. 27 Feb, 2024 1 commit
    • Baber Abbasi's avatar
      Refactor `evaluater.evaluate` (#1441) · 5ccd65d4
      Baber Abbasi authored
      
      
      * change `all_gather` to `gather`
      
      * add TaskOutput utility class
      
      * Add FilterResults class and refactor task handling.
      
      * Rename `key` to `filter_key` for clarity
      
      * Add `print_writeout` function in utils.py
      
      * Add function to calculate limit size.
      
      * Add doc_iterator method to Task class
      
      * Refactor `doc_iterator` and cleanup in Task class
      
      * remove superfluous bits
      
      * change `all_gather` to `gather`
      
      * bugfix
      
      * bugfix
      
      * fix `gather`
      
      * Refactor `gather` loop
      
      * Refactor aggregate metrics calculation
      
      * Refactor and simplify aggregate metrics calculation
      Removed unused code
      
      * Simplify metrics calculation and remove unused code.
      
      * simplify the metrics calculation in `utils.py` and `evaluator.py`.
      
      * Fix group metric
      
      * change evaluate to hf_evaluate
      
      * change evaluate to hf_evaluate
      
      * add docs
      
      * add docs
      
      * nits
      
      * make isslice keyword only
      
      * nit
      
      * add todo
      
      * nit
      
      * nit
      
      * nit: swap order samples_metrics tuple
      
      * move instance sorting outside loop
      
      * nit
      
      * nit
      
      * Add __repr__ for ConfigurableTask
      
      * nit
      
      * nit
      
      * Revert "nit"
      
      This reverts commit dab8d9977a643752a17f840fd8cf7e4b107df28f.
      
      * fix some logging
      
      * nit
      
      * fix `predict_only` bug. thanks to `@LSinev`!
      
      * change `print_tasks` to `prepare_print_tasks`
      
      * nits
      
      * move eval utils
      
      * move eval utils
      
      * nit
      
      * add comment
      
      * added tqdm descriptions
      
      * Update lm_eval/evaluator_utils.py
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      
      * fix mgsm bug
      
      * nit
      
      * fix `build_all_requests`
      
      * pre-commit
      
      * add ceil to limit
      
      ---------
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      5ccd65d4
  35. 26 Feb, 2024 1 commit
  36. 22 Feb, 2024 1 commit
  37. 20 Feb, 2024 1 commit
  38. 14 Feb, 2024 1 commit