[DCP] Support Decode Context Parallel (DCP) for GQA with Flashinfer (#25438)
Signed-off-by:gaojc <1055866782@qq.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com> Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Co-authored-by:
gaojingchun (A) <g00955623@china.huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
QiuChunshuo <qiuchunshuo@huawei.com>
Showing
Please register or sign in to comment