[ROCm][FP8][Kernel] FP8 quantization fused into Custom Paged Attention (#17139)
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
Showing
Please register or sign in to comment
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>