"tests/test_tokenization_mbart50.py" did not exist on "9336086ab5d232cccd9512333518cf4299528882"
Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph...
Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph breaks during training (#22279)
Showing
Please register or sign in to comment