Longbench v2 (#3338)
* initial commit
* change to acc
* fix long-dialogue tasks
* fix versioning
* more fixes
* fix naming
* fix naming
* more renaming
* maybe a dataset fix
* fix dataset and use new dataset schema
* add README
* fix prompt and dataset naming
* lint
* remove utils.py
* lint
* more linting
* fix typo
* fix naming
* add longbenchv2
---------
Co-authored-by:
Baber <baber@hey.com>
Showing
This source diff could not be displayed because it is too large. You can view the blob instead.
Please register or sign in to comment