Commits · 85bdf764e7ae6eaba594eb6cea3384eb16e39666 · ModelZoo / ResNet50_tensorflow

22 May, 2019 5 commits

Merged commit includes the following changes: (#6856) · 85bdf764
saberkun authored May 22, 2019
```
249500988  by hongkuny<hongkuny@google.com>:

    Lints

--

PiperOrigin-RevId: 249500988
```
85bdf764

Add Transformer Big Benchmarks + FP16 for other tests. (#6838) · 23f75313

Toby Boyd authored May 22, 2019

* Add big tests.

* fix super

* Add fp16, increase 8xGPU batch-sizes

* Adding the rest of the fp16 tests.

* Big accuracy test batch_perf_gpu

* fix docstrings

* add _run_and_report

* Edited docstrings

23f75313

Merge Transformer V2 to Github (#6846) · c4f34e58

Tian Lin authored May 22, 2019

* Merged commit includes the following changes:
249218656  by tianlin<tianlin@google.com>:

    Deal with imports, fix a typo and make unit tests fast.

--
249198645  by tianlin<tianlin@google.com>:

    Trivial: Remove one empty line before "import tensorflow"

--
249195490  by tianlin<tianlin@google.com>:

    Initialize Transformer TF V2 Model with Keras subclassing implementation. (Compatible with TF V1)

--
249195008  by tianlin<tianlin@google.com>:

    Internal change

249173564  by hongkuny<hongkuny@google.com>:

    Internal change

249079258  by hongkuny<hongkuny@google.com>:

    Internal change

247691534  by haoyuzhang<haoyuzhang@google.com>:

    Internal change

247533725  by haoyuzhang<haoyuzhang@google.com>:

    Internal change

247509295  by haoyuzhang<haoyuzhang@google.com>:

    Internal change

247311355  by wangtz<wangtz@google.com>:

    Internal change

247303127  by wangtz<wangtz@google.com>:

  ...

c4f34e58

Add tweaked fp32 test; Run accuracy tests with LR ops (#6843) · 4726c5b9
Haoyu Zhang authored May 22, 2019

4726c5b9

Merged commit includes the following changes: (#6847) · 30d14a96

saberkun authored May 21, 2019

249377254  by hongkuny<hongkuny@google.com>:

    Internal change

249373328  by hongkuny<hongkuny@google.com>:

    Clean up tf import

--
249333938  by hongkuny<hongkuny@google.com>:

    Fix tf1 import

--
249325089  by hongkuny<hongkuny@google.com>:

    BERT 2.0

--
249173564  by hongkuny<hongkuny@google.com>:

    Internal change

PiperOrigin-RevId: 249377254

30d14a96

21 May, 2019 1 commit
- Replace GlobalAvgPooling with reduce mean to reduce cross-device overhead (#6837) · 1529b82c
  Haoyu Zhang authored May 21, 2019
  
  1529b82c
20 May, 2019 1 commit
- Handle empty eval results in estimator_benchmark. (#6827) · 7ac267a8
  Ayush Dubey authored May 20, 2019
```
* Delete accuracy if exists in eval results.

* get global_step only if it exists in eval results
```
  7ac267a8
18 May, 2019 2 commits
- Include flags when reporting benchmark. (#6809) · bdae51af
  Reed authored May 17, 2019
```
This will allow one to easily reproduce a benchmark by running with the flags.
```
  bdae51af
- Do not return immediately after train_and_evaluate. (#6807) · 23662bb4
  Ayush Dubey authored May 17, 2019
  
  23662bb4
15 May, 2019 3 commits

Fix lint errors. (#6788) · 03027e33
Rachel Lim authored May 15, 2019

03027e33

Set the --clone_model_in_keras_dist_strat to None. (#6781) · 2d4cfad0

Igor authored May 15, 2019

* Set the --clone_model_in_keras_dist_strat to None.  Remove the separate no_cloning benchmarks and add a couple of cloning ones.  Fixes the learning rate schedule to cache its ops per graph.

2d4cfad0

Adds keras imagenet benchmarks which use tf.data's `experimental_slack` option. (#6744) · 6aa6bac5

Rachel Lim authored May 15, 2019

* Added 'tfdata_exp' version of all benchmarks which set
FLAGS.tf_data_experimental_slack = True. Renamed
`data_prefetch_with_slack` to `data_delay_prefetch` (haoyu's change)
to make the names more distinct.

* Add flag to resnet input pipeline and surface through
keras_imagenet_main.py

6aa6bac5

11 May, 2019 2 commits

Remove flacky test: test_bad_seed (#6761) · 03242e38
Toby Boyd authored May 10, 2019
```
- Test passes locally python3 and test is already
    skipped for python2.
```
03242e38

Add FP16 to transformer with benchmark tests. (#6756) · b7e97bec

Toby Boyd authored May 10, 2019

* Add FP16 and benchmarks.

* add missing run and report.

* Add loss_scale as option not included with dtype.

* move loss_scale validation under dtype conditional.

* add loss_scale to flags tested.

b7e97bec

10 May, 2019 5 commits
- Fix trivial model to work properly with fp16 (#6760) · c0b31c51
  Haoyu Zhang authored May 10, 2019
```
* Fix trivial model to work properly with fp16

* Add comment on manual casting
```
  c0b31c51
- Minimize variables and computation in trivial model (#6759) · 5e876e6e
  Haoyu Zhang authored May 10, 2019
```
Previously we had one dense layer in trivial model. The weight was [224*224*3, num_classes].
Using two dense layers, the weights are [224*224*3, 1] and [1, num_classes].
```
  5e876e6e
- Do not report accuracy metrics for benchmark tests (#6757) · cfa37aab
  Haoyu Zhang authored May 10, 2019
```
* Do not report metrics in performance benchmarks

* Rename flag
```
  cfa37aab
- Fix broken test in V2 (#6755) · bae940dc
  Haoyu Zhang authored May 10, 2019
  
  bae940dc
- Use LR schedule ops instead of LR callback for tweaked tests (#6745) · 4b4dbad1
  Haoyu Zhang authored May 09, 2019
```
* Modified tweaked tests to use tensor learning rate
```
  4b4dbad1
09 May, 2019 2 commits

Transformer instrumented for benchmarking (#6734) · 40543869

Toby Boyd authored May 09, 2019

* Add first benchmark and return stats.

* Remove print statements update training steps.

* Revert print T: in print statement.

* Remove print(stats)

* add 2 gpu accuracy test for base.

* Fixed total_batch_size when using gpu + gFile deprecations.

* 8 GPU test name fix

* Add 4 and 8 GPU tests.

* typo fixes.

* Clean up test names and methods.

* bleu uncased.  docstring format fix.

40543869

Use TensorFlow ops for Keras LearningRateSchedule (#6739) · 9d38e894

Haoyu Zhang authored May 09, 2019

* Add learning rate tensor. This makes training slower

* Improve LearningRateSchedule with better efficiency

* Fix lint error

* Replace constant definition with existing one

9d38e894

08 May, 2019 1 commit
- r/tf.random_uniform/tf.random.uniform (#6735) · 9c5253f1
  Toby Boyd authored May 08, 2019
  
  9c5253f1
07 May, 2019 2 commits
- Use flags to define collective ops when initializing MirroredStrategy (#6724) · f5073f49
  Haoyu Zhang authored May 07, 2019
  
  f5073f49
- Move tests_data to gcs and upgrade data_download. (#6722) · 0f76239b
  Toby Boyd authored May 06, 2019
  
  0f76239b
06 May, 2019 1 commit
- Fix ResNet model convergence problem (#6721) · a182abc1
  Haoyu Zhang authored May 06, 2019
  
  a182abc1
04 May, 2019 1 commit

Enable CuDNN BatchNorm spatial persistent by default (#6710) · 58deb059

Haoyu Zhang authored May 03, 2019

* Enable CuDNN BatchNorm spatial persistent by default; Remove 2nd zero padding layer

* Apply scale=False and fused=True consistently to BatchNorm layers

* Undo remove padding layer

* Replace zero padding with padding attribute in max pooling for better performance

* Resolve comments

* Revert "Replace zero padding with padding attribute in max pooling for better performance"

This reverts commit ad49db057c800ecac008eec1057005bd2c08ac73.

58deb059

03 May, 2019 1 commit
- Add graph rewrite convergence benchmark (#6712) · 0a96c7b4
  Reed authored May 02, 2019
  
  0a96c7b4
02 May, 2019 1 commit
- Add graph rewrite benchmarks (#6708) · e172ac82
  Reed authored May 02, 2019
  
  e172ac82
01 May, 2019 1 commit

Add --fp16_implementation option. (#6703) · b691578c

Reed authored May 01, 2019

This options allows the new tf.train.experimental.enable_mixed_precision_graph_rewrite() function to be used for fp16, instead of manual casts.

b691578c

30 Apr, 2019 1 commit
- Eval every 10 epochs to better match estimator tests. (#6696) · 3ee027fb
  Toby Boyd authored Apr 30, 2019
  
  3ee027fb
29 Apr, 2019 5 commits

Replace per_device with per_replica and PerDevice with PerReplica, because the... · b00783d7

Igor authored Apr 29, 2019

Replace per_device with per_replica and PerDevice with PerReplica, because the PerDevice concept was renamed and doesn't exist anymore. (#6693)

* Replace per_device with per_replica and PerDevice with PerReplica, because the PerReplica concept was renamed and doesn't exist anymore.

b00783d7

Add accuracy check. (#6694) · 294660bd

Toby Boyd authored Apr 29, 2019

* Add accuracy check.

* Avoid double flag init, move data_dir to real data.

* Comment on lower accuracy target.

294660bd

bug fix (#6695) · 0f6f656f
Shining Sun authored Apr 29, 2019
```
* bug fix

* bug fix
```
0f6f656f

Add benchmarks with the --cloning flag to Resnet and NFC. (#6675) · af47736d

Igor authored Apr 29, 2019

* Add benchmarks with the --cloning flag to Resnet and NFC.

* Renamed cloning to clone_model_in_keras_dist_strat. Dropped a few tests that aren't essential.

* Fixed up the formatting after re-naming the flag to a much longer  name.  Thanks, lint.
* Fixed the lint error in nfc_common.py

af47736d

fixed simple typo (#6686) · d087c89b
Songyi Blair Han authored Apr 30, 2019

d087c89b

26 Apr, 2019 3 commits

Combined imagenet and cifar-10 estimator tests (#6672) · acc6f6d7

Toby Boyd authored Apr 26, 2019

* Combined imagenet and cifar-10 benchmarks

* Comments and epochs_between_evals.

* Added tuned tests and cleaned up benchmark flags

* Fix names.

* Return results and add images/sec hook.

* updated doc strings for return values.

* 128 to 256 batch for FP16 test

* added more doc strings to fix lint.

acc6f6d7

Add num_packs flag for MirroredStrategy's cross device ops. (#6676) · 4a1fba0b

Ayush Dubey authored Apr 26, 2019

* Add num_packs flag for MirroredStrategy's cross device ops.

* fix parens

* Fix lint errors and make all_reduce_alg more robust.

* Set default num_packs to 1

4a1fba0b

Do not query GPU compatibility before app main (#6679) · 9b17d796

Gaurav Jain authored Apr 25, 2019

tf.test.is_gpu_available() should not be called in flags since this is
called before app.main() and the runtime has not yet been initialized.

9b17d796

25 Apr, 2019 2 commits
- Remove contrib cross device ops and update all_reduce_alg options. (#6673) · ece99414
  Ayush Dubey authored Apr 25, 2019
```
* Remove contrib AllReduceCrossDeviceOps and update all_reduce_alg options with MirroredStrategy.

* cleanup
```
  ece99414
- Revert "Specify NCCL as the all reduce algorithm (#6662)" (#6671) · a7338771
  Haoyu Zhang authored Apr 25, 2019
```
Reason: test failures because contrib is not available in V2

This reverts commit 325dd761.
```
  a7338771