- 03 Jul, 2024 2 commits
-
-
haileyschoelkopf authored
-
Nathan Habib authored
* adds leaderboard tasks * Delete lm_eval/tasks/leaderboard/leaderboard_chat_template.yaml * add readme * Delete lm_eval/tasks/leaderboard/mmlu_pro/mmlu_pro_chat_template.yaml * modify readme * fix bbh task * fix bbh salient task * modify the readme * Delete lm_eval/tasks/leaderboard/ifeval/README.md * Delete lm_eval/tasks/leaderboard/math/README.md * add leaderboard to the tasks repertory * add anouncment about new leaderbaord tasks * linting * Update README.md Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> * installs ifeval dependency in new_task github workflow --------- Co-authored-by:
Nathan Habib <nathan.habib@huggingface.com> Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 01 May, 2024 1 commit
-
-
Chujie Zheng authored
-
- 11 Mar, 2024 1 commit
-
-
Hailey Schoelkopf authored
-
- 11 Jan, 2024 1 commit
-
-
Hailey Schoelkopf authored
* fix incorrect lookback protections * bump generate_until task versions
-
- 21 Dec, 2023 1 commit
-
-
Hailey Schoelkopf authored
* change version field formatting in metadata * mention versioning in new task guide * add instructions for changelog * run linters
-
- 17 Dec, 2023 1 commit
-
-
Wis Kojohnjaratkul authored
* Add IFEval task * Check and download nltk punkt if not already downloaded * Update gen_max_toks to 2048 to support "900 words+" instructions * Resolve pre-commit linting issues * Reduce max_gen_toks to 1280 to conserve token usage * Add warning message in `process_results` call for non chat-finetuned models
-