attention_mask_func cleanup See merge request ADLR/megatron-lm!212
Attach a file by drag & drop or click to upload