[FIX] not training when epoch is small (#3006)
* solving bug where for small epochs and large gradient_accumulation_steps we never train * black formatting * no need to change these files
Showing
Please register or sign in to comment
* solving bug where for small epochs and large gradient_accumulation_steps we never train * black formatting * no need to change these files