1. 03 Jul, 2024 1 commit
  2. 02 Jul, 2024 2 commits
  3. 25 Jun, 2024 2 commits
  4. 24 Jun, 2024 2 commits
  5. 19 Jun, 2024 1 commit
  6. 10 Jun, 2024 3 commits
  7. 07 Jun, 2024 1 commit
  8. 04 Jun, 2024 1 commit
  9. 03 Jun, 2024 2 commits
  10. 30 May, 2024 1 commit
  11. 26 May, 2024 1 commit
  12. 24 May, 2024 1 commit
  13. 16 May, 2024 1 commit
  14. 10 May, 2024 2 commits
  15. 08 May, 2024 2 commits
  16. 07 May, 2024 8 commits
  17. 06 May, 2024 1 commit
    • LSinev's avatar
      Provide ability for custom sampler for ConfigurableTask (#1616) · ae72cebc
      LSinev authored
      * Added fewshot sampling seeds to evaluator.simple_evaluate signature
      
      Way to control seed of fewshot sampling
      may help with #1591
      
      * Added ability for custom sampler for ConfigurableTask
      
      May be set in config like
      ```
      fewshot_config:
        sampler: !function utils.MyFewshotSampler
      ```
      
      * explicitly set fewshot random generator seed for HFLM generate_until_task test
      
      * add backward compatibility for three args seed setup
      
      * save seeds info to logs/reports
      ae72cebc
  18. 05 May, 2024 1 commit
  19. 03 May, 2024 1 commit
    • KonradSzafer's avatar
      evaluation tracker implementation (#1766) · 59cf408a
      KonradSzafer authored
      * evaluation tracker implementation
      
      * OVModelForCausalLM test fix
      
      * typo fix
      
      * moved methods args
      
      * multiple args in one flag
      
      * loggers moved to dedicated dir
      
      * improved filename sanitization
      59cf408a
  20. 25 Apr, 2024 1 commit
  21. 24 Apr, 2024 1 commit
  22. 23 Apr, 2024 1 commit
  23. 22 Mar, 2024 1 commit
  24. 18 Mar, 2024 1 commit
    • Hailey Schoelkopf's avatar
      Cleanup for v0.4.2 release (#1573) · 5627e819
      Hailey Schoelkopf authored
      * Update interface.md
      
      * fix: make caching reqs always work with accelerate launch
      
      * remove stale task migration checklist
      
      * remove deprecation warnings
      
      * make informative TypeErrors for get_task_dict
      
      * bump version metadata
      
      * fix num_fewshot printing bug
      
      * add fewshot value to cache key
      5627e819
  25. 17 Mar, 2024 1 commit