Merge branch 'optimize_initialization' into 'main'
added gpu initialization and option to avoid master values for initialization See merge request ADLR/megatron-lm!105
Showing
Please register or sign in to comment
added gpu initialization and option to avoid master values for initialization See merge request ADLR/megatron-lm!105