Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph...
Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph breaks during training (#22279)
Showing
Please register or sign in to comment
Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph breaks during training (#22279)