Add EQ-Bench as per #1459 (#1511)
* Start adding eq-bench * Start adding to yaml and utils * Get metric working * Add README * Handle cases where answer is not parseable * Deal with unparseable answers and add percent_parseable metric * Update README
Showing
Please register or sign in to comment