- 19 Dec, 2023 1 commit
-
-
seungduk.kim.2304 authored
* Correct column names and dataset names * Remove kmmlu_general_physics.yaml and kmmlu_korean_language.yaml * Update _default_kmmlu_yaml --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 18 Dec, 2023 1 commit
-
-
Baber Abbasi authored
-
- 17 Dec, 2023 1 commit
-
-
Wis Kojohnjaratkul authored
* Add IFEval task * Check and download nltk punkt if not already downloaded * Update gen_max_toks to 2048 to support "900 words+" instructions * Resolve pre-commit linting issues * Reduce max_gen_toks to 1280 to conserve token usage * Add warning message in `process_results` call for non chat-finetuned models
-
- 15 Dec, 2023 1 commit
-
-
MorishT authored
* [fix] loading dataset from hub fails when the dataset name includes '.', as the program assumes it is on the local filesystem * add FLD benchmark * Update task.py * [update] add group 'fld' * [update] rename fld -> fld_default. add explanation to the readme * Update README.md --------- Co-authored-by:Lintang Sutawika <lintang@sutawika.com>
-
- 14 Dec, 2023 2 commits
-
-
Lintang Sutawika authored
* Additional process for doc_to_choice * doc_to_choice can also parse a string
-
momotori authored
-
- 13 Dec, 2023 5 commits
-
-
Baber Abbasi authored
* remove unlabled test sets * add note to readme
-
haileyschoelkopf authored
-
haileyschoelkopf authored
-
momotori authored
-
momotori authored
-
- 11 Dec, 2023 5 commits
-
-
weijie authored
-
weijie authored
-
weijie authored
-
h-albert-lee authored
-
h-albert-lee authored
-
- 10 Dec, 2023 6 commits
-
-
h-albert-lee authored
-
h-albert-lee authored
-
h-albert-lee authored
-
h-albert-lee authored
-
h-albert-lee authored
-
h-albert-lee authored
-
- 08 Dec, 2023 1 commit
-
-
h-albert-lee authored
-
- 07 Dec, 2023 8 commits
-
-
lintangsutawika authored
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-
Lintang Sutawika authored
BBH cot fewshot already has fewshot examples in the description. So num_fewshot needs to be set to 0 so that users won't mistakenly set other num_fewshot values.
-
- 04 Dec, 2023 2 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
-
- 28 Nov, 2023 7 commits
-
-
Stella Biderman authored
-
Stella Biderman authored
-
Stella Biderman authored
Changing the name since this is not actually the big bench format.
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-