• Janna's avatar
    Longbench v2 (#3338) · 655718d0
    Janna authored
    
    
    * initial commit
    
    * change to acc
    
    * fix long-dialogue tasks
    
    * fix versioning
    
    * more fixes
    
    * fix naming
    
    * fix naming
    
    * more renaming
    
    * maybe a dataset fix
    
    * fix dataset and use new dataset schema
    
    * add README
    
    * fix prompt and dataset naming
    
    * lint
    
    * remove utils.py
    
    * lint
    
    * more linting
    
    * fix typo
    
    * fix naming
    
    * add longbenchv2
    
    ---------
    Co-authored-by: default avatarBaber <baber@hey.com>
    655718d0
README.md 121 KB