1. 11 Jun, 2024 1 commit
    • KonradSzafer's avatar
      Results filenames handling fix (#1926) · 69952581
      KonradSzafer authored
      * results filenames handling moved to utils
      
      * zeno results handling fix
      
      * tasks_for_model backward compatibility
      
      * results files logic moved to tasks_for_model
      
      * moved sanitize_model_name to utils
      69952581
  2. 10 Jun, 2024 1 commit
  3. 09 Jun, 2024 1 commit
  4. 07 Jun, 2024 4 commits
  5. 06 Jun, 2024 3 commits
  6. 05 Jun, 2024 1 commit
  7. 03 Jun, 2024 3 commits
  8. 31 May, 2024 2 commits
  9. 30 May, 2024 2 commits
  10. 28 May, 2024 1 commit
  11. 26 May, 2024 1 commit
  12. 24 May, 2024 5 commits
  13. 23 May, 2024 1 commit
  14. 22 May, 2024 1 commit
  15. 21 May, 2024 1 commit
  16. 19 May, 2024 1 commit
  17. 13 May, 2024 1 commit
  18. 09 May, 2024 1 commit
    • Edd's avatar
      Copal task (#1803) · 1980a13c
      Edd authored
      * add copal
      
      * change name to copal id for clarity and the task name
      
      * remove `copal_id...` to yaml to make it work
      
      * checkmark on README
      
      * change group name to `copal_id`
      1980a13c
  19. 08 May, 2024 1 commit
  20. 07 May, 2024 4 commits
  21. 06 May, 2024 1 commit
    • LSinev's avatar
      Provide ability for custom sampler for ConfigurableTask (#1616) · ae72cebc
      LSinev authored
      * Added fewshot sampling seeds to evaluator.simple_evaluate signature
      
      Way to control seed of fewshot sampling
      may help with #1591
      
      * Added ability for custom sampler for ConfigurableTask
      
      May be set in config like
      ```
      fewshot_config:
        sampler: !function utils.MyFewshotSampler
      ```
      
      * explicitly set fewshot random generator seed for HFLM generate_until_task test
      
      * add backward compatibility for three args seed setup
      
      * save seeds info to logs/reports
      ae72cebc
  22. 05 May, 2024 3 commits