"git@developer.sourcefind.cn:gaoqiong/composable_kernel.git" did not exist on "49d5af1002c37e19db071271b7560ae6c64fefd5"
[CK_TILE]naive attn support FP8 KVCache quant (#1747)
* quant
* fix bug
* simple smoothquant after softmax
* update kv-quant
* update stride
* fix fp8-pertoken-kvcache
* update int8/fp8 quant support
---------
Co-authored-by: so <a.com>
Co-authored-by:
Po Yen Chen <PoYen.Chen@amd.com>
Showing
This diff is collapsed.
Please register or sign in to comment