"...stochastic_training/ondisk-dataset-specification.rst" did not exist on "65d83ad70c664359139870068aa8357a6553c0e6"
-
Reed authored
Before, there was a global default loss scale for all models. Currently, only resnet uses loss scaling, but this will be useful once more models support it.
42a8af1d