1. 14 Oct, 2025 2 commits
    • Baber Abbasi's avatar
      remove duplicate tags/groups (#3343) · ad506a13
      Baber Abbasi authored
      ad506a13
    • Janna's avatar
      Longbench v2 (#3338) · 655718d0
      Janna authored
      
      
      * initial commit
      
      * change to acc
      
      * fix long-dialogue tasks
      
      * fix versioning
      
      * more fixes
      
      * fix naming
      
      * fix naming
      
      * more renaming
      
      * maybe a dataset fix
      
      * fix dataset and use new dataset schema
      
      * add README
      
      * fix prompt and dataset naming
      
      * lint
      
      * remove utils.py
      
      * lint
      
      * more linting
      
      * fix typo
      
      * fix naming
      
      * add longbenchv2
      
      ---------
      Co-authored-by: default avatarBaber <baber@hey.com>
      655718d0