[PyTorch] Make FP8 MHA work with RoPE when CP is on (#1297)
* Let fp8 mha work with rope when cp is on Signed-off-by:Xin Yao <xiny@nvidia.com> * fix and update ut Signed-off-by:
Xin Yao <xiny@nvidia.com> --------- Signed-off-by:
Xin Yao <xiny@nvidia.com>
Showing
Please register or sign in to comment