"tests/test_utils/test_box3d.py" did not exist on "3c166ae5b1ecdb7434995f9c1377556f01cde7eb"
[longbench] fix metric calculation (#2983)
* use all answers * use middle truncation * maybe fix classification score * strip classification preds * [vllm] remove stop tokens post-hoc * strip all preds * pacify pre-commit * start on truncation utility * add to readme * add a footgun doc * fix newline in yaml templates * do not strip code_sim preds! * fix pre-commit config * fix instruction warning * add not to longbench readme
Showing
docs/footguns.md
0 → 100644
Please register or sign in to comment