official/transformer/v2/transformer.py · b9c1d1ca228819c212d0f1b227bebd3a0fb3e263 · ModelZoo / ResNet50_tensorflow

Add distribute strategies to transformer. (#6883) · b9c1d1ca

Igor authored May 28, 2019

* Fixes that make transformer run.

* Remove debug print statements.

* Changed the permissions to 644.

* Fix the rest of the permissions.

* enable static batch in all benchmarks

* Restrict dist strat hack to training mode

For now we will do predict/eval without dist strat, so remove that hack in non training cases.

* Use `inputs` instead of `x` as arg name for call

Keras has different behavior based on whether the inputs are called `inputs` or not. Using `inputs` gives expected behaviors.

* Avoid extra map fn on input in dist strat case

* Update how we handle custom metrics

This new approach works with and without dist strat. The previous one didn't work with dist strat. We need to fix that but this is reasonable in meantime (b/133724664).

* Update benchmarks

* typo in metrics code

* Revert metrics change

Didn't actually work in distributed case..

b9c1d1ca

transformer.py 19.3 KB

Replace transformer.py