"tests/triton_tests/info_mlp_autocast.jsonl" did not exist on "51f8bb713368ef00d48496ce76c0428e976236a9"
-
carlushuang authored
* add prenorm/postnorm support, refactor using generate.py * update README * update README * fix format * update some description and fix format * update format * format * use non-raw for loading * format and update n4096 * dynamic-quant ready * update readme * support fused dynamic-quant * update fused-quant, with smooth * update README * update args * update some based on comment
c3a4800c