Merge branch 'slym/partial_chpt' into 'main'
Checkpoint a set number of individual Transformer layers See merge request ADLR/megatron-lm!301
Showing
Please register or sign in to comment
Checkpoint a set number of individual Transformer layers See merge request ADLR/megatron-lm!301