1. 04 Oct, 2025 1 commit
    • Baber Abbasi's avatar
      Fewshot refactor (#3227) · 003e5852
      Baber Abbasi authored
      
      
      * overhaul `ContextSampler`
      
      * refactor masakhapos
      
      * move multi_target to `exact_match`
      
      * remove doc_to_choice from `boolq-seq2seq`
      
      * remove doc_to_choice in generation process_results
      
      * Remove unused `doc_to_choice` and fix superglue whitespaces
      
      * require multiple_inputs and multiple_targets to be explicitly set in taskconfig
      
      * fix copa; better logging in task init
      
      * fix doc_to_target to return int rather than str (deprecated)
      
      * fix processing regression; recursively parse lists fron template
      
      * remove redundant jinja parsing logic
      
      * remove promptsource
      
      * for multiple_inputs use `doc_to_text: list[str]``
      
      * Refactor `ContextSampler` `fewshot_context`
      
      * fix multiple_input context
      
      * fix `target_delimiter` with `gen_prefix`
      
      * `doc_to_text` is list for multiple_inputs
      
      * Refactor `count_bytes` and `count_words` methods to `@staticmethod`
      
      * make has_*(train/test/validation) to properties
      
      * remove `multi_target` `generate_until`
      
      * `fix doc_to_target/multiple_targets handling add tests
      
      * rename `multi_target` to `multiple_targets`
      
      * evalaute list when multiple targets
      
      * allow doc_to_target to return list
      
      * Remove gen_prefix space and add warning (#3239)
      
      * Remove gen_prefix space and add warning
      
      * fix null gen_prefix bug again
      
      * use git tests
      
      ---------
      Co-authored-by: default avatarBoaz Ben-Dov <bendboaz@gmail.com>
      003e5852
  2. 25 Sep, 2025 12 commits
  3. 21 Feb, 2025 1 commit
    • Lintang Sutawika's avatar
      Logging (#2203) · 1ba35e62
      Lintang Sutawika authored
      
      
      * changed source of eval_logger
      
      * allow eval_logger to be set from args
      
      * removed verbosity arg from non-main methods
      
      * fix logging
      
      * pre-commit
      
      * set verbosity in eval logger
      
      * replace utils.eval_logger
      
      * fix logging in main
      
      * add logging to docs
      
      * add logging message
      
      * nit
      
      * add logging to docs
      
      * refactor setup_logging to utils
      
      ---------
      Co-authored-by: default avatarBaber <baber@hey.com>
      1ba35e62
  4. 19 Jan, 2025 1 commit
  5. 28 Nov, 2024 1 commit
  6. 26 Apr, 2024 1 commit
  7. 20 Feb, 2024 1 commit
  8. 13 Feb, 2024 1 commit
  9. 11 Feb, 2024 1 commit
    • Baber Abbasi's avatar
      Evaluate (#1385) · 1ff84897
      Baber Abbasi authored
      * un-exclude `evaluate.py` from linting
      
      * readability
      
      * readability
      
      * add task name to build info message
      
      * fix link
      
      * nit
      
      * add functions for var and mean pooling
      
      * add functions for var and mean pooling
      
      * metadata compatibility with task
      
      * rename `override_config` to `set_config` and move to `Task`
      
      * add unit test
      
      * nit
      
      * nit
      
      * bugfix
      
      * nit
      
      * nit
      
      * nit
      
      * add docstrings
      
      * fix metadata-fewshot
      
      * revert metric refactor
      
      * nit
      
      * type checking
      
      * type hints
      
      * type hints
      
      * move `override_metric` to `Task`
      
      * change metadata
      
      * change name
      
      * pre-commit
      
      * rename
      
      * remove
      
      * remove
      
      * `override_metric` backwards compatible with `Task`
      
      * type hints
      
      * use generic
      
      * type hint
      1ff84897
  10. 28 Jan, 2024 1 commit
  11. 20 Dec, 2023 1 commit
    • Baber Abbasi's avatar
      Switch Linting to `ruff` (#1166) · 65b8761d
      Baber Abbasi authored
      * add ruff and isort. remove black and flake8
      
      * remove unnecessary dependencies
      
      * remove dependency from table
      
      * change order
      
      * ran ruff
      
      * check 3.9
      
      * exclude evaluator
      
      * update CI workflow
      
      * use ruff config in pyproject.toml
      
      * test
      
      * add isort rules to ruff
      
      * sort imports
      
      * import `make_table`
      
      * try stages for no-commit-to-branch
      
      * turn on mypy for pre-commit
      
      * test
      
      * test
      
      * test
      
      * change no-commit-to-branch to default
      
      * nits
      
      * fixed dependency
      65b8761d
  12. 02 Nov, 2023 2 commits
  13. 19 Oct, 2023 3 commits
  14. 18 Oct, 2023 2 commits
  15. 22 Sep, 2023 2 commits
  16. 01 Aug, 2023 1 commit
  17. 18 Jul, 2023 1 commit
  18. 04 Jul, 2023 1 commit
  19. 03 Jul, 2023 1 commit
  20. 02 Jul, 2023 1 commit
  21. 15 Jun, 2023 1 commit
  22. 13 Jun, 2023 2 commits
  23. 12 Jun, 2023 1 commit
    • Hailey Schoelkopf's avatar
      [Refactor] [WIP] New YAML advanced docs (#567) · 79b972d6
      Hailey Schoelkopf authored
      
      
      * add wip gsm8k yaml
      
      * cleanup tasks dir
      
      * push gsm8k yaml changes
      
      * rename gpt2.py
      
      * add updated gsm8k , triviaqa baseline
      
      * add new cot yaml
      
      * allow for multiple filter pipelines, new filter types
      
      * updated gsm8k + sampling gen configs
      
      * cleanup self-consistency yaml
      
      * push outline for advanced docs
      
      * push docs checklist
      
      * switch to inheritance for many tasks
      
      * acc_norm and acc_mutual_info fixed
      
      * fix missing newline in error msg
      
      * remove many .py tasks
      
      * updated GSM8k
      
      * added more doc
      
      * Update advanced_task_guide.md
      
      Added list of parameters
      
      * Update advanced_task_guide.md
      
      * Added details on listing metrics
      
      * Update advanced_task_guide.md
      
      * Added more explanation
      
      * modify current default filter name
      
      * add new tags to tasks
      
      * remove a lingering print()
      
      * add rest of param docs, cleanup deprecated fields
      
      * push docs update
      
      * move ALL_TASKS definition location
      
      * confirm write_out.py works if no description dict passed
      
      ---------
      Co-authored-by: default avatarlintangsutawika <lintang@sutawika.com>
      79b972d6