With float16, always use LossScaleOptimizer.
Before, it was too easy to accidentally forget to set runtime.loss_scale, which had to always be done if mixed precision is used, otherwise the model would converge to worse accuracy. Now, all that needs to be done to use mixed precision is to set runtime.mixed_precision_dtype=float16. PiperOrigin-RevId: 383767033
Showing
Please register or sign in to comment