Commits · e0e6d981bc2023ceec3fc5f73d3c1266ebb9d97e · ModelZoo / ResNet50_tensorflow

19 Jun, 2019 2 commits

Add XLA to transformer (#7048) · 269581dc

Toby Boyd authored Jun 19, 2019



* set default steps to 300K.

* Log flags to perfzero.

* Add XLA support to transformer

- Moved config logic to keras_utils
- Added enable_xla flag to _performance flags
- Did not refactor enable_xla flag from keras resnet due to
  reliance on calling FLAGs in estimator keras and that is
  a needed refactor for another time.

* fix g3 lint complaint.

* Refactor set config into keras_utils.

* Move flags out of main.

* pipe through enable_xla

* Update official/transformer/v2/misc.py
Co-Authored-By: Reed <reedwm@google.com>

269581dc

Use PerfZeroBenchmark and log Flags. (#7052) · d3610769
Toby Boyd authored Jun 19, 2019

d3610769

14 Jun, 2019 2 commits
- Add ResNet tests for NHWC and layout optimizer off. (#7018) · 097c8051
  Toby Boyd authored Jun 14, 2019
```
* layout off for some tests and channels last.

* 8 gpu tests channels_last

* more layout off tests.
```
  097c8051
- Resnet56 forced eager benchmark and no_dist_strat eager accuracy test. (#7017) · 7b329985
  Toby Boyd authored Jun 14, 2019
```
* Add 1 gpu force_eager benchmark

* Add accuracy for no dist strat eager

* remvove return.
```
  7b329985
13 Jun, 2019 1 commit
- Add run_eagerly and end-to-end test. (#7012) · d8a09064
  Toby Boyd authored Jun 13, 2019
  
  d8a09064
10 Jun, 2019 1 commit
- Code cleanup. (#6989) · f7a44074
  rxsang authored Jun 10, 2019
  
  f7a44074
06 Jun, 2019 3 commits
- Have each model provide a default loss scale. (#6930) · 42a8af1d
  Reed authored Jun 06, 2019
```
Before, there was a global default loss scale for all models. Currently, only resnet uses loss scaling, but this will be useful once more models support it.
```
  42a8af1d
- Add pure eager fp16 benchmark (#6977) · ce797486
  Haoyu Zhang authored Jun 06, 2019
  
  ce797486
- Modify tweaked tests for better performance in no cloning mode (#6965) · 152baba5
  Haoyu Zhang authored Jun 05, 2019
```
* Modify tweaked tests for better performance in no cloning mode

* Tweak trivial models
```
  152baba5
05 Jun, 2019 1 commit
- Add more optional_next tests (#6955) · 346b570f
  rxsang authored Jun 04, 2019
  
  346b570f
03 Jun, 2019 2 commits

Do not use XLA in warmup tests (#6951) · dcdc45bd

Haoyu Zhang authored Jun 03, 2019

Because we run warmup tests in all real data benchmarks, XLA bugs will cause non-XLA tests to fail as well.

dcdc45bd

Resnet mlperf like (#6942) · 69e2e3f6

Toby Boyd authored Jun 03, 2019

* Add mlperf like test.

* Final comments.

* docstring wording tweak.

* non-tweaked version

69e2e3f6

31 May, 2019 3 commits
- Fix internal lint errors (#6937) · 7546a9e3
  Haoyu Zhang authored May 31, 2019
  
  7546a9e3
- Update the is_v2_0 function to work internally as well. (#6935) · ab53cb74
  Goldie Gadde authored May 31, 2019
  
  ab53cb74
- Support pure eager execution in ResNet50 (#6929) · f6c2d9f8
  Haoyu Zhang authored May 30, 2019
```
* Support pure eager execution in ResNet50

* Use smaller batch size
```
  f6c2d9f8
29 May, 2019 1 commit
- Add tweaked cloning tests (#6916) · ab993a21
  Haoyu Zhang authored May 29, 2019
  
  ab993a21
28 May, 2019 2 commits
- Remove assert_broadcastable monkey patch (#6901) · 1d16f473
  Haoyu Zhang authored May 28, 2019
  
  1d16f473
- Use more warmup steps for 96 core tests (#6881) · 8b52cd23
  Haoyu Zhang authored May 28, 2019
```
* Run different numbers of steps on different platforms

* Add new tests for delayed performance measurement
```
  8b52cd23
24 May, 2019 3 commits

Add a graph optional_next Reset benchmark. (#6876) · 49eaaaf2
rxsang authored May 24, 2019
```
* Add a graph optional_next Reset benchmark.

* Fix lint error.
```
49eaaaf2
Moved common keras code to utils. (#6859) · 3254cabb
Toby Boyd authored May 24, 2019

3254cabb

Merged commit that fixes transformer's predict and eval. (#6874) · b9cab01b

Tian Lin authored May 24, 2019

* Merged commit includes the following changes:
249776315  by tianlin<tianlin@google.com>:

    Internal change

249763206  by tianlin<tianlin@google.com>:

    For TF 2.0 (related to Beam Search), expand cond dims in tf.where(cond, x, y) to make all parameters broadcastable.

--
249392724  by hongkuny<hongkuny@google.com>:

    Internal change

PiperOrigin-RevId: 249776315

* Merged commit includes the following changes:
249823043  by tianlin<tianlin@google.com>:

    Bring back v2 test for predict and eval.

--

PiperOrigin-RevId: 249823043

b9cab01b

23 May, 2019 3 commits

Add a test enabling get_next_as_optional behavior. (#6862) · 92bad0d2

rxsang authored May 23, 2019

* Add a test enabling get_next_as_optional behavior.

* Remove repeated flag.

* Remove trailing space.

* Make the name shorter.

* Fix lint error.

* Refine the benchmark name.

92bad0d2

Fix non dist strat case. (#6867) · 68650c42
rxsang authored May 23, 2019

68650c42

Add enable_get_next_as_optional flag. (#6858) · 272a2baa

rxsang authored May 22, 2019

* Add enable_get_next_as_optional flag.

* Set enable_get_next_as_optional to strategy.

* Add comments to explain the flag.

* Remove trailing whitespace.

* Remove trailing space.

272a2baa

22 May, 2019 1 commit
- Add tweaked fp32 test; Run accuracy tests with LR ops (#6843) · 4726c5b9
  Haoyu Zhang authored May 22, 2019
  
  4726c5b9
21 May, 2019 1 commit
- Replace GlobalAvgPooling with reduce mean to reduce cross-device overhead (#6837) · 1529b82c
  Haoyu Zhang authored May 21, 2019
  
  1529b82c
15 May, 2019 2 commits

Set the --clone_model_in_keras_dist_strat to None. (#6781) · 2d4cfad0

Igor authored May 15, 2019

* Set the --clone_model_in_keras_dist_strat to None.  Remove the separate no_cloning benchmarks and add a couple of cloning ones.  Fixes the learning rate schedule to cache its ops per graph.

2d4cfad0

Adds keras imagenet benchmarks which use tf.data's `experimental_slack` option. (#6744) · 6aa6bac5

Rachel Lim authored May 15, 2019

* Added 'tfdata_exp' version of all benchmarks which set
FLAGS.tf_data_experimental_slack = True. Renamed
`data_prefetch_with_slack` to `data_delay_prefetch` (haoyu's change)
to make the names more distinct.

* Add flag to resnet input pipeline and surface through
keras_imagenet_main.py

6aa6bac5

10 May, 2019 5 commits
- Fix trivial model to work properly with fp16 (#6760) · c0b31c51
  Haoyu Zhang authored May 10, 2019
```
* Fix trivial model to work properly with fp16

* Add comment on manual casting
```
  c0b31c51
- Minimize variables and computation in trivial model (#6759) · 5e876e6e
  Haoyu Zhang authored May 10, 2019
```
Previously we had one dense layer in trivial model. The weight was [224*224*3, num_classes].
Using two dense layers, the weights are [224*224*3, 1] and [1, num_classes].
```
  5e876e6e
- Do not report accuracy metrics for benchmark tests (#6757) · cfa37aab
  Haoyu Zhang authored May 10, 2019
```
* Do not report metrics in performance benchmarks

* Rename flag
```
  cfa37aab
- Fix broken test in V2 (#6755) · bae940dc
  Haoyu Zhang authored May 10, 2019
  
  bae940dc
- Use LR schedule ops instead of LR callback for tweaked tests (#6745) · 4b4dbad1
  Haoyu Zhang authored May 09, 2019
```
* Modified tweaked tests to use tensor learning rate
```
  4b4dbad1
09 May, 2019 1 commit

Use TensorFlow ops for Keras LearningRateSchedule (#6739) · 9d38e894

Haoyu Zhang authored May 09, 2019

* Add learning rate tensor. This makes training slower

* Improve LearningRateSchedule with better efficiency

* Fix lint error

* Replace constant definition with existing one

9d38e894

07 May, 2019 1 commit
- Use flags to define collective ops when initializing MirroredStrategy (#6724) · f5073f49
  Haoyu Zhang authored May 07, 2019
  
  f5073f49
06 May, 2019 1 commit
- Fix ResNet model convergence problem (#6721) · a182abc1
  Haoyu Zhang authored May 06, 2019
  
  a182abc1
04 May, 2019 1 commit

Enable CuDNN BatchNorm spatial persistent by default (#6710) · 58deb059

Haoyu Zhang authored May 03, 2019

* Enable CuDNN BatchNorm spatial persistent by default; Remove 2nd zero padding layer

* Apply scale=False and fused=True consistently to BatchNorm layers

* Undo remove padding layer

* Replace zero padding with padding attribute in max pooling for better performance

* Resolve comments

* Revert "Replace zero padding with padding attribute in max pooling for better performance"

This reverts commit ad49db057c800ecac008eec1057005bd2c08ac73.

58deb059

30 Apr, 2019 1 commit
- Eval every 10 epochs to better match estimator tests. (#6696) · 3ee027fb
  Toby Boyd authored Apr 30, 2019
  
  3ee027fb
29 Apr, 2019 2 commits

bug fix (#6695) · 0f6f656f
Shining Sun authored Apr 29, 2019
```
* bug fix

* bug fix
```
0f6f656f

Add benchmarks with the --cloning flag to Resnet and NFC. (#6675) · af47736d

Igor authored Apr 29, 2019

* Add benchmarks with the --cloning flag to Resnet and NFC.

* Renamed cloning to clone_model_in_keras_dist_strat. Dropped a few tests that aren't essential.

* Fixed up the formatting after re-naming the flag to a much longer  name.  Thanks, lint.
* Fixed the lint error in nfc_common.py

af47736d