New healthcare benchmark: careqa (#2714)
* New healthcare benchmark: careqa * LAUNCH_MN5_ACC <python main.py --config config/mn5.yml --models Llama-3.2-1B-Instruct --tasks careqa_open --num_fewshot 0> * Add fixes, READMES, and remove task_list.txt * pre-commit passed, add formatting updates; add nanmean agg_metric * Fix import error. * Wrapped imports in try excepts * Wrapped imports in try excepts; also metrics to catch bert_score import error * Try except to catch ImportErrors as well * use np.nan * pre-commit --------- Co-authored-by:PabloAgustin <pablo.martin@bsc.es> Co-authored-by:
Baber <baber@hey.com>
Showing
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment