Commits · dc4c5f1acbd085a6cd92330451736b44d60ce915 · ModelZoo / ResNet50_tensorflow

01 Aug, 2019 1 commit

Merged commit includes the following changes: (#7354) · dc4c5f1a

Haoyu Zhang authored Aug 01, 2019

261171038  by gjn<gjn@google.com>:

    Remove weight_decay_rate 0 early exit check

    Removing this code path should be fine since this was actually not doing
    what it meant to do. Since weight_decay_rate is actually a tensor, the
    equality check was only looking at the id of the object and comparing to
    0. This should never be true. Evaluating a tensor is also not what we
    want to do at this point of the code. Thus it should be fine to simply
    remove this code.

--
261169862  by haoyuzhang<haoyuzhang@google.com>:

    Internal change

261153520  by haoyuzhang<haoyuzhang@google.com>:

    Internal change

261140302  by hongkuny<hongkuny@google.com>:

    Clean up

--

PiperOrigin-RevId: 261171038

dc4c5f1a

24 Jul, 2019 5 commits
- Unskip tests with 1.x · 296d0d3f
  guptapriya authored Jul 18, 2019
  
  296d0d3f
- Remove loss layer test · 3a796b5a
  guptapriya authored Jul 18, 2019
  
  3a796b5a
- Remove loss layer · 67f81649
  guptapriya authored Jul 18, 2019
  
  67f81649
- Update synth data pipeline dtype · ffbada72
  guptapriya authored Jul 18, 2019
  
  ffbada72
- Use add_loss in transformer model · 13cc0f70
  guptapriya authored Jul 18, 2019
  
  13cc0f70
23 Jul, 2019 1 commit

Single execution path tests for ResNet50, ResNet56, NCF, and Shakespeare LSTM. (#7276) · 9d8c9aa4

Toby Boyd authored Jul 23, 2019

* Add force_run_distributed tests.

* Added enable_eager

* r/force_run_distributed/force_v2_in_keras_compile

* Adding force_v2 tests and FLAGs.

* Rename method to avoid conflict.

* Add cpu force_v2 tests.

* fix lint, wrap line.

* change to force_v2_in_keras_compile

* Update method name.

* Lower mlperf target to 0.736.

9d8c9aa4

20 Jul, 2019 1 commit
- [Transformer] Use float16 input and output for softmax in mixed-precision training · 448c31b6
  Zongwei Zhou authored Jul 12, 2019
  
  448c31b6
11 Jul, 2019 3 commits
- Reduce transformer fp16 test to 12 iterations. (#7183) · 81123ebf
  Toby Boyd authored Jul 11, 2019
  
  81123ebf
- Record highest uncased bleu found. (#7196) · 35620eaf
  Toby Boyd authored Jul 11, 2019
```
* Record highest uncased bleu found.

* change to bleu_best_score_iteration
```
  35620eaf
- Merged commit includes the following changes: (#7191) · 13feba3c
  saberkun authored Jul 10, 2019
```
257314238  by hongkuny<hongkuny@google.com>:

    Creates transformer v2 README.
    Remove contents that are not implemented.

--

PiperOrigin-RevId: 257314238
```
  13feba3c
08 Jul, 2019 1 commit
- Reduce iterations from 20 to 12 and add FP16 dynamic. (#7168) · cf1a276a
  Toby Boyd authored Jul 08, 2019
```
* reduce iterations from 20 to 12.

* add fp16 dynamic batch accuracy check.

* fix existing lint issue.
```
  cf1a276a
03 Jul, 2019 1 commit

Unit tests pass TF 2.0 GPU and CPU locally. (#7101) · 49097655

Toby Boyd authored Jul 03, 2019

* Fix unit tests failures.

* 96% of TF 2.0 tests on GPU are passing.

* Currently all passing GPU and CPU TF 2.0

* Address code comments.

* use tf 2.0 cast.

* Comment about working on TF 2.0 CPU

* Uses contrib turn off for TF 2.0.

* Fix wide_deep and add keras_common_tests.

* use context to get num_gpus.

* Switch to tf.keras.metrics

49097655

28 Jun, 2019 1 commit
- Add FP16 end-to-end tests (#7122) · 58a3de6c
  Toby Boyd authored Jun 28, 2019
  
  58a3de6c
22 Jun, 2019 1 commit
- Fix unit tests failures. (#7086) · 47a59023
  Toby Boyd authored Jun 22, 2019
  
  47a59023
21 Jun, 2019 2 commits
- Transformer 2.0: Make metrics layer optional (#7075) · 513fdbb2
  guptapriya authored Jun 21, 2019
```
* trying fake merge call

* make metrics optional

* Remove extra print
```
  513fdbb2
- Fix Transformer Perfzero issue with fp16 (#7074) · b578aee9
  Reed authored Jun 20, 2019
  
  b578aee9
20 Jun, 2019 2 commits
- Increase the number of steps for the scheduled Keras Big run. (#7069) · 018ab4b5
  Igor authored Jun 20, 2019
  
  018ab4b5
- Transformer xla and FP16 benchmarks (#7061) · 695265c8
  Toby Boyd authored Jun 19, 2019
```
* Add XLA benchmark tests FP32 only for now.

* Add FP16 XLA tests.

* FP16 only tests.
```
  695265c8
19 Jun, 2019 2 commits

Add mixed precision support to Transformer (#7011) · f8ec01ae
Reed authored Jun 19, 2019

f8ec01ae

Add XLA to transformer (#7048) · 269581dc

Toby Boyd authored Jun 19, 2019



* set default steps to 300K.

* Log flags to perfzero.

* Add XLA support to transformer

- Moved config logic to keras_utils
- Added enable_xla flag to _performance flags
- Did not refactor enable_xla flag from keras resnet due to
  reliance on calling FLAGs in estimator keras and that is
  a needed refactor for another time.

* fix g3 lint complaint.

* Refactor set config into keras_utils.

* Move flags out of main.

* pipe through enable_xla

* Update official/transformer/v2/misc.py
Co-Authored-By: Reed <reedwm@google.com>

269581dc

18 Jun, 2019 1 commit
- set default steps to 300K. (#7046) · 01c3a4f3
  Toby Boyd authored Jun 18, 2019
  
  01c3a4f3
06 Jun, 2019 1 commit
- Include flags when reporting transformer benchmark (#6957) · 0a83bef9
  Reed authored Jun 06, 2019
  
  0a83bef9
05 Jun, 2019 7 commits
- fix lint errors · e4bf28fc
  guptapriya authored Jun 05, 2019
  
  e4bf28fc
- fix distributed tests · 2a56bb7e
  guptapriya authored Jun 05, 2019
  
  2a56bb7e
- add strategy specific tests · d967bfae
  guptapriya authored Jun 05, 2019
  
  d967bfae
- Fix existing tests · 3aee5697
  guptapriya authored Jun 05, 2019
  
  3aee5697
- Remove metrics hack for dist strat · 6cfa81a1
  guptapriya authored Jun 04, 2019
  
  6cfa81a1
- fix lint · 99a34a6c
  guptapriya authored Jun 05, 2019
  
  99a34a6c
- Separate 1 GPU transformer benchmarks · 97c1e898
  guptapriya authored Jun 04, 2019
  
  97c1e898
31 May, 2019 2 commits
- Fix internal lint errors (#6937) · 7546a9e3
  Haoyu Zhang authored May 31, 2019
  
  7546a9e3
- Fix various lint errors (#6934) · ba415414
  Haoyu Zhang authored May 31, 2019
```
* Fix various lint errors

* Fix logging format
```
  ba415414
29 May, 2019 3 commits
- Make max_length and static_batch configurable (#6893) · ab1c1dfc
  Zhang Xunkai authored May 29, 2019
```
* Make max_length and static_batch configurable.

* Fix line length.

* Fix incorrect parameters in building eval input.

* Improve comments for readability.
```
  ab1c1dfc
- Reduce max_length to 64 in static_batch cases. · 39638d66
  guptapriya authored May 28, 2019
  
  39638d66
- fix num_gpus in benchmark · 3bb5dd6c
  guptapriya authored May 28, 2019
  
  3bb5dd6c
28 May, 2019 5 commits

Make 'off' a string literal. · 3928d481
Igor authored May 28, 2019

3928d481
Turn dist strat off for 1 GPU benchmarks · 2be9ba5b
guptapriya authored May 28, 2019

2be9ba5b

undo shuffle change · df523d91

guptapriya authored May 28, 2019

this is not going to help with current tf.data semantics. so removing it.

df523d91

Add distribute strategies to transformer. (#6883) · b9c1d1ca

Igor authored May 28, 2019

* Fixes that make transformer run.

* Remove debug print statements.

* Changed the permissions to 644.

* Fix the rest of the permissions.

* enable static batch in all benchmarks

* Restrict dist strat hack to training mode

For now we will do predict/eval without dist strat, so remove that hack in non training cases.

* Use `inputs` instead of `x` as arg name for call

Keras has different behavior based on whether the inputs are called `inputs` or not. Using `inputs` gives expected behaviors.

* Avoid extra map fn on input in dist strat case

* Update how we handle custom metrics

This new approach works with and without dist strat. The previous one didn't work with dist strat. We need to fix that but this is reasonable in meantime (b/133724664).

* Update benchmarks

* typo in metrics code

* Revert metrics change

Didn't actually work in distributed case..

b9c1d1ca

Add shuffle to dataset records · 733a752d
guptapriya authored May 28, 2019
```
This shuffling should help in getting shuffling each epoch.
```
733a752d