[PyTorch] Let Fused RoPE support CP with THD format (#1238)
* Let Fused RoPE support THD with CP Signed-off-by:Xin Yao <xiny@nvidia.com> * add comment Signed-off-by:
Xin Yao <xiny@nvidia.com> --------- Signed-off-by:
Xin Yao <xiny@nvidia.com> Co-authored-by:
Xiaowei Ren <103958965+xrennvidia@users.noreply.github.com>
Showing
Please register or sign in to comment