"docs/tutorials/deployment.md" did not exist on "5b3792fc3ef9ab6a6f8f30634ab2e52fb0941af3"
Merge branch 'rescaling' into 'main'
Add support for signal-based dynamic checkpointing See merge request ADLR/megatron-lm!361
Showing
Please register or sign in to comment