[Fix] MLA only supports decode-only full CUDAGraph capture. Make sure all...
[Fix] MLA only supports decode-only full CUDAGraph capture. Make sure all cudagraph capture sizes <= max_num_seq.
Showing
Please register or sign in to comment
[Fix] MLA only supports decode-only full CUDAGraph capture. Make sure all cudagraph capture sizes <= max_num_seq.