[ROCm][Feature] Enable AITER MLA attention backend to work with Eagle3...
[ROCm][Feature] Enable AITER MLA attention backend to work with Eagle3 speculative decoding on ROCm (#39616) Signed-off-by:larryli2-amd <larryli2@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
Showing
Please register or sign in to comment