1. 06 Mar, 2024 1 commit
    • Peter Bevan's avatar
      Add EQ-Bench as per #1459 (#1511) · c5acce0c
      Peter Bevan authored
      * Start adding eq-bench
      
      * Start adding to yaml and utils
      
      * Get metric working
      
      * Add README
      
      * Handle cases where answer is not parseable
      
      * Deal with unparseable answers and add percent_parseable metric
      
      * Update README
      c5acce0c