patch for megatron v0.12.0.rc3. When tp > 1 and combined_1f1b = true, megatron...
patch for megatron v0.12.0.rc3. When tp > 1 and combined_1f1b = true, megatron (main) cannot execute properly
Showing
Please register or sign in to comment
patch for megatron v0.12.0.rc3. When tp > 1 and combined_1f1b = true, megatron (main) cannot execute properly