• Peter Bevan's avatar
    Add EQ-Bench as per #1459 (#1511) · c5acce0c
    Peter Bevan authored
    * Start adding eq-bench
    
    * Start adding to yaml and utils
    
    * Get metric working
    
    * Add README
    
    * Handle cases where answer is not parseable
    
    * Deal with unparseable answers and add percent_parseable metric
    
    * Update README
    c5acce0c
utils.py 2.27 KB