[Bugfix] Correct num_q_heads on DCP for Flashinfer backends (#29487)
Signed-off-by:Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com>
Showing
Please register or sign in to comment