[FIX] Always support TP > 4 for FP4 Gemm (#31099)
Signed-off-by:dafrimi <dafrimi@nvidia.com> Co-authored-by:
root <root@gpu-51.slurm-workers-slurm.slurm.svc.cluster.local>
Showing
Please register or sign in to comment
Signed-off-by:dafrimi <dafrimi@nvidia.com> Co-authored-by:
root <root@gpu-51.slurm-workers-slurm.slurm.svc.cluster.local>