Multiply lr scheduler steps by `num_processes`. (#3983)
* Multiply lr scheduler steps by `num_processes`. * Stop multiplying steps by gradient accumulation.
Showing
Please register or sign in to comment
* Multiply lr scheduler steps by `num_processes`. * Stop multiplying steps by gradient accumulation.