"vscode:/vscode.git/clone" did not exist on "730d084f2a4c25535c9942b7d15babf2a84102d2"
-
Reed authored
Also, do Transformer inference in fp16, as well as training, when --dtype=fp16. In TF 2, layers now cannot run in multiple different dtypes, so we must use the same dtype for training and inference.
58340818