Checkpoint a set number of individual Transformer layers See merge request ADLR/megatron-lm!301
Attach a file by drag & drop or click to upload