Commits · 0f6845ce513d7be2cbffbf9a0e56b38cbba2fdd7 · ModelZoo / ResNet50_tensorflow

21 Jun, 2019 1 commit
- Fix Transformer Perfzero issue with fp16 (#7074) · b578aee9
  Reed authored Jun 20, 2019
  
  b578aee9
20 Jun, 2019 2 commits
- Increase the number of steps for the scheduled Keras Big run. (#7069) · 018ab4b5
  Igor authored Jun 20, 2019
  
  018ab4b5
- Transformer xla and FP16 benchmarks (#7061) · 695265c8
  Toby Boyd authored Jun 19, 2019
```
* Add XLA benchmark tests FP32 only for now.

* Add FP16 XLA tests.

* FP16 only tests.
```
  695265c8
19 Jun, 2019 2 commits

Add mixed precision support to Transformer (#7011) · f8ec01ae
Reed authored Jun 19, 2019

f8ec01ae

Add XLA to transformer (#7048) · 269581dc

Toby Boyd authored Jun 19, 2019



* set default steps to 300K.

* Log flags to perfzero.

* Add XLA support to transformer

- Moved config logic to keras_utils
- Added enable_xla flag to _performance flags
- Did not refactor enable_xla flag from keras resnet due to
  reliance on calling FLAGs in estimator keras and that is
  a needed refactor for another time.

* fix g3 lint complaint.

* Refactor set config into keras_utils.

* Move flags out of main.

* pipe through enable_xla

* Update official/transformer/v2/misc.py
Co-Authored-By: Reed <reedwm@google.com>

269581dc

18 Jun, 2019 1 commit
- set default steps to 300K. (#7046) · 01c3a4f3
  Toby Boyd authored Jun 18, 2019
  
  01c3a4f3
11 Jun, 2019 1 commit

Merged commit includes the following changes: (#6993) · e7b4d364

saberkun authored Jun 10, 2019

252534787  by hongkuny<hongkuny@google.com>:

    Transformer vocab fix to strip correctly in py2

--

PiperOrigin-RevId: 252534787

e7b4d364

06 Jun, 2019 3 commits
- Include flags when reporting transformer benchmark (#6957) · 0a83bef9
  Reed authored Jun 06, 2019
  
  0a83bef9
- transformer v2: fix gfile reading · 8b18491b
  guptapriya authored Jun 05, 2019
  
  8b18491b
- Merged commit includes the following changes: (#6969) · bc557d14
  saberkun authored Jun 05, 2019
```
251762562  by hongkuny<hongkuny@google.com>:

    Fix blue score inconsistency

--

PiperOrigin-RevId: 251762562
```
  bc557d14
05 Jun, 2019 7 commits
- fix lint errors · e4bf28fc
  guptapriya authored Jun 05, 2019
  
  e4bf28fc
- fix distributed tests · 2a56bb7e
  guptapriya authored Jun 05, 2019
  
  2a56bb7e
- add strategy specific tests · d967bfae
  guptapriya authored Jun 05, 2019
  
  d967bfae
- Fix existing tests · 3aee5697
  guptapriya authored Jun 05, 2019
  
  3aee5697
- Remove metrics hack for dist strat · 6cfa81a1
  guptapriya authored Jun 04, 2019
  
  6cfa81a1
- fix lint · 99a34a6c
  guptapriya authored Jun 05, 2019
  
  99a34a6c
- Separate 1 GPU transformer benchmarks · 97c1e898
  guptapriya authored Jun 04, 2019
  
  97c1e898
31 May, 2019 2 commits
- Fix internal lint errors (#6937) · 7546a9e3
  Haoyu Zhang authored May 31, 2019
  
  7546a9e3
- Fix various lint errors (#6934) · ba415414
  Haoyu Zhang authored May 31, 2019
```
* Fix various lint errors

* Fix logging format
```
  ba415414
29 May, 2019 4 commits
- Make max_length and static_batch configurable (#6893) · ab1c1dfc
  Zhang Xunkai authored May 29, 2019
```
* Make max_length and static_batch configurable.

* Fix line length.

* Fix incorrect parameters in building eval input.

* Improve comments for readability.
```
  ab1c1dfc
- update estimator benchmarks too · e80b385a
  guptapriya authored May 29, 2019
  
  e80b385a
- Reduce max_length to 64 in static_batch cases. · 39638d66
  guptapriya authored May 28, 2019
  
  39638d66
- fix num_gpus in benchmark · 3bb5dd6c
  guptapriya authored May 28, 2019
  
  3bb5dd6c
28 May, 2019 6 commits

Add static batch benchmarks to estimator (#6886) · 383c6e30

guptapriya authored May 28, 2019

* Add static batch benchmarks to estimator 

So we can distinguish how much static vs dynamic batch matter.

* change max_length for static_batch tests

* Add flag for max length

383c6e30

Make 'off' a string literal. · 3928d481
Igor authored May 28, 2019

3928d481
Turn dist strat off for 1 GPU benchmarks · 2be9ba5b
guptapriya authored May 28, 2019

2be9ba5b

undo shuffle change · df523d91

guptapriya authored May 28, 2019

this is not going to help with current tf.data semantics. so removing it.

df523d91

Add distribute strategies to transformer. (#6883) · b9c1d1ca

Igor authored May 28, 2019

* Fixes that make transformer run.

* Remove debug print statements.

* Changed the permissions to 644.

* Fix the rest of the permissions.

* enable static batch in all benchmarks

* Restrict dist strat hack to training mode

For now we will do predict/eval without dist strat, so remove that hack in non training cases.

* Use `inputs` instead of `x` as arg name for call

Keras has different behavior based on whether the inputs are called `inputs` or not. Using `inputs` gives expected behaviors.

* Avoid extra map fn on input in dist strat case

* Update how we handle custom metrics

This new approach works with and without dist strat. The previous one didn't work with dist strat. We need to fix that but this is reasonable in meantime (b/133724664).

* Update benchmarks

* typo in metrics code

* Revert metrics change

Didn't actually work in distributed case..

b9c1d1ca

Add shuffle to dataset records · 733a752d
guptapriya authored May 28, 2019
```
This shuffling should help in getting shuffling each epoch.
```
733a752d

24 May, 2019 2 commits

Transformer v2 benchmark (#6860) · f2ea2f53

Toby Boyd authored May 24, 2019

* Moved common keras code to utils.

* Initial 1 gpu benchmark

- Aligned flags with resnet example
- removed code/features that are not super useful
- eval as part of train if bleu source/ref provided
- add exp_per_second hook

* Rename benchmark classes, pass batch-size and log_steps.

* fix docstring

* Predict done with checkpoints inline

- perfzero baseclass

* steps not epochs with smoother training loop.

* do not initialize history outside loop.

* 5000 between eval not 500

* estimator to keras.

* remove epochs var.

* use range not xrange.

* 200K steps for 1 gpu

* fix global step

f2ea2f53

Merged commit that fixes transformer's predict and eval. (#6874) · b9cab01b

Tian Lin authored May 24, 2019

* Merged commit includes the following changes:
249776315  by tianlin<tianlin@google.com>:

    Internal change

249763206  by tianlin<tianlin@google.com>:

    For TF 2.0 (related to Beam Search), expand cond dims in tf.where(cond, x, y) to make all parameters broadcastable.

--
249392724  by hongkuny<hongkuny@google.com>:

    Internal change

PiperOrigin-RevId: 249776315

* Merged commit includes the following changes:
249823043  by tianlin<tianlin@google.com>:

    Bring back v2 test for predict and eval.

--

PiperOrigin-RevId: 249823043

b9cab01b

22 May, 2019 3 commits

fix lint issues. (#6855) · 3a97b68c
Toby Boyd authored May 22, 2019

3a97b68c

Add Transformer Big Benchmarks + FP16 for other tests. (#6838) · 23f75313

Toby Boyd authored May 22, 2019

* Add big tests.

* fix super

* Add fp16, increase 8xGPU batch-sizes

* Adding the rest of the fp16 tests.

* Big accuracy test batch_perf_gpu

* fix docstrings

* add _run_and_report

* Edited docstrings

23f75313

Merge Transformer V2 to Github (#6846) · c4f34e58

Tian Lin authored May 22, 2019

* Merged commit includes the following changes:
249218656  by tianlin<tianlin@google.com>:

    Deal with imports, fix a typo and make unit tests fast.

--
249198645  by tianlin<tianlin@google.com>:

    Trivial: Remove one empty line before "import tensorflow"

--
249195490  by tianlin<tianlin@google.com>:

    Initialize Transformer TF V2 Model with Keras subclassing implementation. (Compatible with TF V1)

--
249195008  by tianlin<tianlin@google.com>:

    Internal change

249173564  by hongkuny<hongkuny@google.com>:

    Internal change

249079258  by hongkuny<hongkuny@google.com>:

    Internal change

247691534  by haoyuzhang<haoyuzhang@google.com>:

    Internal change

247533725  by haoyuzhang<haoyuzhang@google.com>:

    Internal change

247509295  by haoyuzhang<haoyuzhang@google.com>:

    Internal change

247311355  by wangtz<wangtz@google.com>:

    Internal change

247303127  by wangtz<wangtz@google.com>:

  ...

c4f34e58

11 May, 2019 1 commit

Add FP16 to transformer with benchmark tests. (#6756) · b7e97bec

Toby Boyd authored May 10, 2019

* Add FP16 and benchmarks.

* add missing run and report.

* Add loss_scale as option not included with dtype.

* move loss_scale validation under dtype conditional.

* add loss_scale to flags tested.

b7e97bec

09 May, 2019 1 commit

Transformer instrumented for benchmarking (#6734) · 40543869

Toby Boyd authored May 09, 2019

* Add first benchmark and return stats.

* Remove print statements update training steps.

* Revert print T: in print statement.

* Remove print(stats)

* add 2 gpu accuracy test for base.

* Fixed total_batch_size when using gpu + gFile deprecations.

* 8 GPU test name fix

* Add 4 and 8 GPU tests.

* typo fixes.

* Clean up test names and methods.

* bleu uncased.  docstring format fix.

40543869

07 May, 2019 1 commit
- Move tests_data to gcs and upgrade data_download. (#6722) · 0f76239b
  Toby Boyd authored May 06, 2019
  
  0f76239b
29 Apr, 2019 2 commits

Replace per_device with per_replica and PerDevice with PerReplica, because the... · b00783d7

Igor authored Apr 29, 2019

Replace per_device with per_replica and PerDevice with PerReplica, because the PerDevice concept was renamed and doesn't exist anymore. (#6693)

* Replace per_device with per_replica and PerDevice with PerReplica, because the PerReplica concept was renamed and doesn't exist anymore.

b00783d7

fixed simple typo (#6686) · d087c89b
Songyi Blair Han authored Apr 30, 2019

d087c89b

12 Apr, 2019 1 commit
- Update README.md (#6569) · b4b8c723
  Yash Katariya authored Apr 12, 2019
```
* Update README.md

* Update README.md

* Update README.md
```
  b4b8c723