- 10 Mar, 2024 1 commit
-
-
Hisham Alyahya authored
* Support jinja templating for "description" * Update task_guide.md * Update lm_eval/api/task.py * fix format? * whitespace errors * fix whitespace * fix bad variable reference --------- Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> Co-authored-by:
haileyschoelkopf <hailey@eleuther.ai>
-
- 01 Feb, 2024 1 commit
-
-
Hailey Schoelkopf authored
* allow tasks to specify printed fewshot val * fix to belebele * update metadata field's documentation
-
- 18 Jan, 2024 2 commits
-
-
kwrobel.eth authored
-
Danielle Pintz authored
-
- 15 Jan, 2024 1 commit
-
-
Lintang Sutawika authored
* benchmark yamls allow minor edits of already registered tasks * add documentation * removed print
-
- 19 Dec, 2023 1 commit
-
-
Paul McCann authored
Co-authored-by:Paul O'Leary McCann <polm@dampfkraft.com>
-
- 14 Dec, 2023 1 commit
-
-
Lintang Sutawika authored
* doc_to_decontamination_query can use function * add option for doc_to_decontamination_query to follow doc_to_text * added documentation for doc_to_decontamination_query * adjust description * format
-
- 28 Nov, 2023 1 commit
-
-
lintangsutawika authored
-
- 03 Nov, 2023 1 commit
-
-
haileyschoelkopf authored
-
- 18 Oct, 2023 1 commit
-
-
haileyschoelkopf authored
-
- 08 Oct, 2023 1 commit
-
-
baberabb authored
-
- 08 Aug, 2023 1 commit
-
-
lintangsutawika authored
-
- 18 Jul, 2023 1 commit
-
-
haileyschoelkopf authored
-
- 11 Jul, 2023 1 commit
-
-
lintangsutawika authored
-
- 03 Jul, 2023 1 commit
-
-
haileyschoelkopf authored
-
- 15 Jun, 2023 1 commit
-
-
lintangsutawika authored
-
- 12 Jun, 2023 5 commits
-
-
haileyschoelkopf authored
-
Lintang Sutawika authored
-
haileyschoelkopf authored
-
haileyschoelkopf authored
-
Hailey Schoelkopf authored
* add wip gsm8k yaml * cleanup tasks dir * push gsm8k yaml changes * rename gpt2.py * add updated gsm8k , triviaqa baseline * add new cot yaml * allow for multiple filter pipelines, new filter types * updated gsm8k + sampling gen configs * cleanup self-consistency yaml * push outline for advanced docs * push docs checklist * switch to inheritance for many tasks * acc_norm and acc_mutual_info fixed * fix missing newline in error msg * remove many .py tasks * updated GSM8k * added more doc * Update advanced_task_guide.md Added list of parameters * Update advanced_task_guide.md * Added details on listing metrics * Update advanced_task_guide.md * Added more explanation * modify current default filter name * add new tags to tasks * remove a lingering print() * add rest of param docs, cleanup deprecated fields * push docs update * move ALL_TASKS definition location * confirm write_out.py works if no description dict passed --------- Co-authored-by:lintangsutawika <lintang@sutawika.com>
-