Add a new task GPQA (the part without CoT) (#1434)
* add new task GPQA_n_shot
* add new task GPQA_zeroshot
* correct GPQA_zeroshot filename
* Add randomly shuffle choices
* Correct missing parentheses
* delete wrong tasks
* Add README
* Update lm_eval/tasks/gpqa/zeroshot/_gpqa_zeroshot_yaml
* Update lm_eval/tasks/gpqa/n_shot/utils.py
* Update lm_eval/tasks/gpqa/n_shot/utils.py
* Update lm_eval/tasks/gpqa/README.md
* placate linter
* linter
---------
Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
Showing
lm_eval/tasks/gpqa/README.md
0 → 100644
Please register or sign in to comment