1. 26 Sep, 2024 1 commit
  2. 13 Jun, 2024 1 commit
  3. 01 Feb, 2024 1 commit
    • Lintang Sutawika's avatar
      Faster Task and Group Loading, Allow Recursive Groups (#1321) · d714fc95
      Lintang Sutawika authored
      
      
      * add trust_remote_code as default
      
      * task for testing recursive
      
      * changed source of ALL_TASKS
      
      * tasks should only accept TaskObjects
      
      * initialize_tasks returns list of tasks and groups
      
      * remove trust_remote_code for now
      
      * moved constructor process to inside load_yaml_config
      
      * more comprehensive way to index tasks and groups
      
      * pre-commit format
      
      * add exit after error
      
      * adjust how task objects are called
      
      * no need to use get_task_dict
      
      * load_task_or_group works but only for tasks
      
      * pre-commit format
      
      * half working for nested groups
      
      * changed variable names
      
      * allow groups and tasks to work
      
      * temp save
      
      * indexing and loading are part of a task_manager object
      
      * adapted initialize_tasks
      
      * iron out bugs
      
      * fixed typo
      
      * fixed typo
      
      * simplified code
      
      * further tidy up
      
      * remove lines for testing
      
      * removed test lines
      
      * removed unused code
      
      * remove unused import
      
      * fixed bug
      
      * removed comments
      
      * group in a list of group can accept parameter changes like `num_fewshot`
      
      * add trust_remote_code as default
      
      * task for testing recursive
      
      * changed source of ALL_TASKS
      
      * tasks should only accept TaskObjects
      
      * initialize_tasks returns list of tasks and groups
      
      * remove trust_remote_code for now
      
      * moved constructor process to inside load_yaml_config
      
      * more comprehensive way to index tasks and groups
      
      * pre-commit format
      
      * add exit after error
      
      * adjust how task objects are called
      
      * no need to use get_task_dict
      
      * load_task_or_group works but only for tasks
      
      * pre-commit format
      
      * half working for nested groups
      
      * changed variable names
      
      * allow groups and tasks to work
      
      * temp save
      
      * indexing and loading are part of a task_manager object
      
      * adapted initialize_tasks
      
      * iron out bugs
      
      * fixed typo
      
      * fixed typo
      
      * simplified code
      
      * further tidy up
      
      * remove lines for testing
      
      * removed test lines
      
      * removed unused code
      
      * remove unused import
      
      * fixed bug
      
      * removed comments
      
      * group in a list of group can accept parameter changes like `num_fewshot`
      
      * check if config is task update
      
      * add GroupConfig object
      
      * edit test yaml
      
      * remove args
      
      * testing returning to python task list
      
      * add weight_by_size config
      
      * describe weight_by_size in docs
      
      * fix weight by size potential error
      
      * can load individual custom python class task
      
      * moved import_function into the config loading file
      
      * remove print lines
      
      * add squadv2 yaml
      
      * temporary scroll implementation
      
      * revert back to use load_yaml_config but with modes
      
      * fix group being loaded with a None
      
      * reformat
      
      * can load unregistered tasks from a group
      
      * update scrolls
      
      * edit scrolls multiplechoice task
      
      * adjust class initialization
      
      * fix initialization
      
      * changed how to identify group and python tasks, fix logger
      
      * allow loading "include" that is nested in a group config
      
      * reworked flan benchmark
      
      * allow duplicate task in the same group to co-exist
      
      * process group_alias
      
      * removed group_alias
      
      * allow parameters set in group_config to apply to all tasks in tasklist
      
      * add function, but comment for now
      
      * reworked processing dict-base config
      
      * fixed how configs in group are processed
      
      * update to allow root group to have its alias used
      
      * remove unused classes
      
      * remove unused classes
      
      * revert some parts to original
      
      * forgot to change one variable
      
      * adapt the new process to use get_task_dict
      
      * fix for singular group call
      
      * fix variable names
      
      * add TaskManager into the evaluator
      
      * format
      
      * changed how dict tasks are loaded
      
      * add docs
      
      * Update docs/new_task_guide.md
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      
      * Update evaluator.py
      
      * Update evaluator.py
      
      * remove groupconfig for now
      
      * changed _config to config
      
      * update interface.md to explain TaskManager
      
      * added property functions
      
      * adjusted logger
      
      * update write_out.py
      
      * updated tests
      
      * added documentation and some modifications
      
      * added docstring documentation
      
      * precommit format
      
      * updated task loading for tests
      
      * updates tests
      
      * changed arg order for load_yaml_config
      
      * update to handle scrolls and edit log message
      
      * remove unused lines
      
      * return a list of task classes and not a dict
      
      * Update __init__.py
      
      * Delete lm_eval/tasks/benchmarks/test.yaml
      
      * Update task.py
      
      * Update lm_eval/utils.py
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      
      * Update lm_eval/utils.py
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      
      * Update utils.py
      
      * re-added old functions with new log message
      
      * Update docs/new_task_guide.md
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      
      * Update new_task_guide.md
      
      * added infor regarding `get_task_dict` and documentation
      
      * add get_config for Task
      
      * pre-commit formatting
      
      ---------
      Co-authored-by: default avatarHailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
      d714fc95
  4. 28 Jan, 2024 1 commit
  5. 20 Dec, 2023 1 commit
    • Baber Abbasi's avatar
      Switch Linting to `ruff` (#1166) · 65b8761d
      Baber Abbasi authored
      * add ruff and isort. remove black and flake8
      
      * remove unnecessary dependencies
      
      * remove dependency from table
      
      * change order
      
      * ran ruff
      
      * check 3.9
      
      * exclude evaluator
      
      * update CI workflow
      
      * use ruff config in pyproject.toml
      
      * test
      
      * add isort rules to ruff
      
      * sort imports
      
      * import `make_table`
      
      * try stages for no-commit-to-branch
      
      * turn on mypy for pre-commit
      
      * test
      
      * test
      
      * test
      
      * change no-commit-to-branch to default
      
      * nits
      
      * fixed dependency
      65b8761d
  6. 13 Dec, 2023 1 commit
  7. 17 Nov, 2023 1 commit
  8. 20 Oct, 2023 2 commits
  9. 19 Sep, 2023 2 commits
  10. 14 Sep, 2023 1 commit
  11. 10 Sep, 2023 1 commit
  12. 15 Jul, 2023 1 commit
  13. 19 Jun, 2023 1 commit
  14. 15 Jun, 2023 1 commit
  15. 12 Jun, 2023 1 commit
    • Hailey Schoelkopf's avatar
      [Refactor] [WIP] New YAML advanced docs (#567) · 79b972d6
      Hailey Schoelkopf authored
      
      
      * add wip gsm8k yaml
      
      * cleanup tasks dir
      
      * push gsm8k yaml changes
      
      * rename gpt2.py
      
      * add updated gsm8k , triviaqa baseline
      
      * add new cot yaml
      
      * allow for multiple filter pipelines, new filter types
      
      * updated gsm8k + sampling gen configs
      
      * cleanup self-consistency yaml
      
      * push outline for advanced docs
      
      * push docs checklist
      
      * switch to inheritance for many tasks
      
      * acc_norm and acc_mutual_info fixed
      
      * fix missing newline in error msg
      
      * remove many .py tasks
      
      * updated GSM8k
      
      * added more doc
      
      * Update advanced_task_guide.md
      
      Added list of parameters
      
      * Update advanced_task_guide.md
      
      * Added details on listing metrics
      
      * Update advanced_task_guide.md
      
      * Added more explanation
      
      * modify current default filter name
      
      * add new tags to tasks
      
      * remove a lingering print()
      
      * add rest of param docs, cleanup deprecated fields
      
      * push docs update
      
      * move ALL_TASKS definition location
      
      * confirm write_out.py works if no description dict passed
      
      ---------
      Co-authored-by: default avatarlintangsutawika <lintang@sutawika.com>
      79b972d6
  16. 03 May, 2022 1 commit
  17. 29 Apr, 2022 1 commit
  18. 28 Apr, 2022 1 commit
  19. 27 Apr, 2022 2 commits
  20. 25 Apr, 2022 1 commit
  21. 24 Dec, 2021 1 commit
  22. 15 Dec, 2021 1 commit
  23. 30 Oct, 2021 1 commit
  24. 28 Mar, 2021 2 commits
  25. 27 Mar, 2021 1 commit
  26. 06 Feb, 2021 1 commit
  27. 05 Feb, 2021 1 commit
  28. 02 Feb, 2021 1 commit
  29. 30 Nov, 2020 2 commits
  30. 29 Nov, 2020 1 commit
  31. 05 Oct, 2020 1 commit
  32. 14 Sep, 2020 2 commits
  33. 07 Sep, 2020 2 commits