[Feat] Support FlashMLA backend with MTP and FP8 KV cache (#6109)
Co-authored-by:Yingyi <yingyihuang2000@outlook.com> Co-authored-by:
neiltian <neiltian@tencent.com> Co-authored-by:
lukec <118525388+sleepcoo@users.noreply.github.com> Co-authored-by:
kexueyu <kexueyu@tencent.com> Co-authored-by:
vincentmeng <vincentmeng@tencent.com> Co-authored-by:
pengmeng <pengmeng@tencent.com>
Showing
Please register or sign in to comment