Fix dispatch, add wd after momentum option
Fix dispatch where we have a parameter group with multiple combinations of types Optionally apply weight decay after momentum
Showing
Please register or sign in to comment
Fix dispatch where we have a parameter group with multiple combinations of types Optionally apply weight decay after momentum