"src/turbomind/kernels/online_softmax_beamsearch_kernels.h" did not exist on "720fc533da804ac3f46ee938864403e51fcd9fa7"
-
carlushuang authored
* add prenorm/postnorm support, refactor using generate.py * update README * update README * fix format * update some description and fix format * update format * format * use non-raw for loading * format and update n4096 * dynamic-quant ready * update readme * support fused dynamic-quant * update fused-quant, with smooth * update README * update args * update some based on comment
c3a4800c