"test/git@developer.sourcefind.cn:jerrrrry/infinicore.git" did not exist on "601defcb6dc752010294f8d57fa6e2289e52eac6"
Fix attention mask type for Flash Attention + CP + THD (#1354)
* always have padding mask type for both flash and fused attentions Signed-off-by:Xiaowei Ren <xren@nvidia.com> * remove an redundant assert Signed-off-by:
Xiaowei Ren <xren@nvidia.com> --------- Signed-off-by:
Xiaowei Ren <xren@nvidia.com>
Showing
Please register or sign in to comment