Longbench v2 (#3338)
* initial commit
* change to acc
* fix long-dialogue tasks
* fix versioning
* more fixes
* fix naming
* fix naming
* more renaming
* maybe a dataset fix
* fix dataset and use new dataset schema
* add README
* fix prompt and dataset naming
* lint
* remove utils.py
* lint
* more linting
* fix typo
* fix naming
* add longbenchv2
---------
Co-authored-by:
Baber <baber@hey.com>
Showing
Please register or sign in to comment