1. 14 Oct, 2025 1 commit
    • Janna's avatar
      Longbench v2 (#3338) · 655718d0
      Janna authored
      
      
      * initial commit
      
      * change to acc
      
      * fix long-dialogue tasks
      
      * fix versioning
      
      * more fixes
      
      * fix naming
      
      * fix naming
      
      * more renaming
      
      * maybe a dataset fix
      
      * fix dataset and use new dataset schema
      
      * add README
      
      * fix prompt and dataset naming
      
      * lint
      
      * remove utils.py
      
      * lint
      
      * more linting
      
      * fix typo
      
      * fix naming
      
      * add longbenchv2
      
      ---------
      Co-authored-by: default avatarBaber <baber@hey.com>
      655718d0