[DCP] Support Decode Context Parallel (DCP) for GQA with FlashAttention (#24864)
Signed-off-by:yuanyongjie.yyj <yuanyongjie.yyj@antgroup.com> Signed-off-by:
FENP <32334296+FENP@users.noreply.github.com> Signed-off-by:
Jaya Yuan <yuanyongjie.yyj@antgroup.com>
Showing
Please register or sign in to comment