Distributed optimizer See merge request ADLR/megatron-lm!408
Attach a file by drag & drop or click to upload