Merge branch 'memory_optimization' into 'master'
memory optimization in mpu cross entropy See merge request ADLR/megatron-lm!32
Showing
Please register or sign in to comment
memory optimization in mpu cross entropy See merge request ADLR/megatron-lm!32