Use Tensor Core Decode when gqa group size >= 4 (#8624)
Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Showing
Please register or sign in to comment
Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>