-
larryli2-amd authored
[ROCm][Feature] Enable AITER MLA attention backend to work with Eagle3 speculative decoding on ROCm (#39616) Signed-off-by:
larryli2-amd <larryli2@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
7243e02a
[ROCm][Feature] Enable AITER MLA attention backend to work with Eagle3 speculative decoding on ROCm (#39616) Signed-off-by:larryli2-amd <larryli2@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>