Merge branch 'slym/jit_warmup' into 'main'
jit functions warmpups to (1) match fprop and recompute results and (2) remove grad_enable See merge request ADLR/megatron-lm!404
Showing
Please register or sign in to comment
jit functions warmpups to (1) match fprop and recompute results and (2) remove grad_enable See merge request ADLR/megatron-lm!404