1. 04 Oct, 2025 1 commit
    • Baber Abbasi's avatar
      Fewshot refactor (#3227) · 003e5852
      Baber Abbasi authored
      
      
      * overhaul `ContextSampler`
      
      * refactor masakhapos
      
      * move multi_target to `exact_match`
      
      * remove doc_to_choice from `boolq-seq2seq`
      
      * remove doc_to_choice in generation process_results
      
      * Remove unused `doc_to_choice` and fix superglue whitespaces
      
      * require multiple_inputs and multiple_targets to be explicitly set in taskconfig
      
      * fix copa; better logging in task init
      
      * fix doc_to_target to return int rather than str (deprecated)
      
      * fix processing regression; recursively parse lists fron template
      
      * remove redundant jinja parsing logic
      
      * remove promptsource
      
      * for multiple_inputs use `doc_to_text: list[str]``
      
      * Refactor `ContextSampler` `fewshot_context`
      
      * fix multiple_input context
      
      * fix `target_delimiter` with `gen_prefix`
      
      * `doc_to_text` is list for multiple_inputs
      
      * Refactor `count_bytes` and `count_words` methods to `@staticmethod`
      
      * make has_*(train/test/validation) to properties
      
      * remove `multi_target` `generate_until`
      
      * `fix doc_to_target/multiple_targets handling add tests
      
      * rename `multi_target` to `multiple_targets`
      
      * evalaute list when multiple targets
      
      * allow doc_to_target to return list
      
      * Remove gen_prefix space and add warning (#3239)
      
      * Remove gen_prefix space and add warning
      
      * fix null gen_prefix bug again
      
      * use git tests
      
      ---------
      Co-authored-by: default avatarBoaz Ben-Dov <bendboaz@gmail.com>
      003e5852
  2. 25 Sep, 2025 5 commits
  3. 22 Sep, 2025 1 commit
    • priverabsc's avatar
      Add eqbench tasks in Spanish and Catalan (#3168) · de496b80
      priverabsc authored
      * Add eqbench tasks in Spanish and Catalan
      
      * Incremented catalan_bench and spanish_bench versions. Added 'multilingual' folder inside 'eq_bench' and moved the eqbench_ca and eqbench_es .yaml to that folder. Updated the tasks README with eqbench_es and eqbench_ca, expliciting inside each description both the Hugging Face link and the translation method.
      
      * Fixed tasks table.
      
      * remove test_task.sh and results folder
      
      * Add utils.py to multilingual folder
      de496b80
  4. 21 Sep, 2025 5 commits
  5. 08 Sep, 2025 1 commit
  6. 02 Sep, 2025 4 commits
  7. 27 Aug, 2025 3 commits
  8. 26 Aug, 2025 1 commit
    • Janna's avatar
      Support for AIME dataset (#3248) · 5ac7cdf8
      Janna authored
      * add AIME tasks
      
      * standardize the repeats
      
      * fix task naming
      
      * aime25 only has test set
      
      * edit readme
      
      * add utils
      
      * standardize
      
      * fix case sensitivity
      
      * repeat once
      
      * lint
      
      * more linting
      
      * lint huggingface.py
      5ac7cdf8
  9. 25 Aug, 2025 3 commits
  10. 23 Aug, 2025 1 commit
  11. 22 Aug, 2025 1 commit
  12. 21 Aug, 2025 6 commits
  13. 08 Aug, 2025 1 commit
  14. 04 Aug, 2025 4 commits
  15. 26 Jul, 2025 3 commits