"README-EN.md" did not exist on "b6ada888e749fd4c95e6dee52c4748986aea6f24"
[longbench] fix metric calculation (#2983)
* use all answers * use middle truncation * maybe fix classification score * strip classification preds * [vllm] remove stop tokens post-hoc * strip all preds * pacify pre-commit * start on truncation utility * add to readme * add a footgun doc * fix newline in yaml templates * do not strip code_sim preds! * fix pre-commit config * fix instruction warning * add not to longbench readme
Showing
Please register or sign in to comment