Commits · 1fb34e76c1f43dc3917445bf4cb5f8559b49941e · ModelZoo / ResNet50_tensorflow

18 Jul, 2019 3 commits

Merged commit includes the following changes: (#7252) · 1fb34e76

Hongkun Yu authored Jul 18, 2019

258597234  by rxsang<rxsang@google.com>:

    Update all the TPUStrategy examples to use the new v2 APIs, i.e.
    make_dataset_iterator -> experimental_distribute_dataset,
    make_input_fn_iterator -> experimental_distribute_datasets_from_function,
    unwrap -> experimental_local_results,
    experimental_run -> experimental_run_v2

--
258581998  by taylorrobie<taylorrobie@google.com>:

    Update keras v2 optimizers to reuse coefficients which are shared across all updates, which reduces the total number of ops created by between 5% (for simple optimizers such as SGD and Adagrad) and 25% (for complicated optimizers such as Adam and NAdam). Separate copies are made for each device and dtype.

    The effect of this change on run time is fairly minimal since Grappler is expected to consolidate most of these ops; however it does improve graph construction time.

--

PiperOrigin-RevId: 258597234

1fb34e76

Refactor and add benchmarks as well as accuracy tests for GPU and CPU (#7248) · e0a2b8c3

Toby Boyd authored Jul 18, 2019

* Added benchmarks and common flags.

* Add cpu tests.

* Add tracking epoch times.

* fix transformer.

* Add examples_per_second.

* fix pylint

e0a2b8c3

Improve Keras graph performance for ResNet56 (#7241) · dd5a91d3

Haoyu Zhang authored Jul 18, 2019

* Config threadpool, cuDNN persistent BN, and grappler layout optimizer properly for ResNet56

* Add tweaked tests for Resnet56

* Avoid triggering the last partial batch overhead by explicitly dropping remainder

dd5a91d3

16 Jul, 2019 2 commits

Merged commit includes the following changes: (#7221) · e21dcdd0

Hongkun Yu authored Jul 16, 2019

258208153  by hongkuny<hongkuny@google.com>:

    Adds run_eagerly option for bert.

--

PiperOrigin-RevId: 258208153

e21dcdd0

Ncf perf optimizations for CTL and multi GPU (#7206) · 492f8c92

nnigania authored Jul 16, 2019

* Ncf perf changes 1)exclude metric layer from CTL train step 2)dataset optimization to fix size of the sample_weights, preventing a costly broadcast during loss calculation for multi-gpu case

492f8c92

15 Jul, 2019 2 commits
- Initial implementation of Shakespeare character LSTM. (#7218) · 395f6d2d
  Bruce Fontaine authored Jul 15, 2019
```
* Initial implementation of Shakespeare character LSTM.

* Fix import order
```
  395f6d2d
- Merged commit includes the following changes: (#7209) · dc8c6ce1
  Hongkun Yu authored Jul 15, 2019
```
257883986  by hongkuny<hongkuny@google.com>:

    Adds tf.summary for bert training

--

PiperOrigin-RevId: 257883986
```
  dc8c6ce1
11 Jul, 2019 5 commits
- Reduce transformer fp16 test to 12 iterations. (#7183) · 81123ebf
  Toby Boyd authored Jul 11, 2019
  
  81123ebf
- Record highest uncased bleu found. (#7196) · 35620eaf
  Toby Boyd authored Jul 11, 2019
```
* Record highest uncased bleu found.

* change to bleu_best_score_iteration
```
  35620eaf
- Add stdev to the Dense layer. (#7189) · fa28535d
  Toby Boyd authored Jul 10, 2019
  
  fa28535d
- Merged commit includes the following changes: (#7191) · 13feba3c
  saberkun authored Jul 10, 2019
```
257314238  by hongkuny<hongkuny@google.com>:

    Creates transformer v2 README.
    Remove contents that are not implemented.

--

PiperOrigin-RevId: 257314238
```
  13feba3c
- Move Keras Hook to use global step to resolve issues across epochs. (#7186) · f4b02d15
  Toby Boyd authored Jul 10, 2019
```
* Move to global_step.

* Hook to use global_step.

* fix comment start step 1 not step 0.

* remove hack used for testing.

* Add docstring.
```
  f4b02d15
09 Jul, 2019 1 commit
- Improve performance for Cifar ResNet benchmarks (#7178) · 2ed43e66
  Haoyu Zhang authored Jul 09, 2019
```
* Improve performance for Cifar ResNet benchmarks

* Revert batch size changes to benchmarks
```
  2ed43e66
08 Jul, 2019 2 commits
- Reorder and then add CTL XLA tests. (#7169) · 18e477c6
  Toby Boyd authored Jul 08, 2019
  
  18e477c6
- Reduce iterations from 20 to 12 and add FP16 dynamic. (#7168) · cf1a276a
  Toby Boyd authored Jul 08, 2019
```
* reduce iterations from 20 to 12.

* add fp16 dynamic batch accuracy check.

* fix existing lint issue.
```
  cf1a276a
03 Jul, 2019 1 commit

Unit tests pass TF 2.0 GPU and CPU locally. (#7101) · 49097655

Toby Boyd authored Jul 03, 2019

* Fix unit tests failures.

* 96% of TF 2.0 tests on GPU are passing.

* Currently all passing GPU and CPU TF 2.0

* Address code comments.

* use tf 2.0 cast.

* Comment about working on TF 2.0 CPU

* Uses contrib turn off for TF 2.0.

* Fix wide_deep and add keras_common_tests.

* use context to get num_gpus.

* Switch to tf.keras.metrics

49097655

02 Jul, 2019 3 commits

Merged commit includes the following changes: (#7141) · 5175b7e6

saberkun authored Jul 02, 2019

256204636  by hongkuny<hongkuny@google.com>:

    Internal

--
256079834  by hongkuny<hongkuny@google.com>:

    Clean up: move common flags together for further refactoring
    Enable steps_per_loop option for all applications.

--

PiperOrigin-RevId: 256204636

5175b7e6

Add StepCounterHook to hooks_helper.py (#7134) · 8155eb9d
Yuefeng Zhou authored Jul 02, 2019
```
* Add StepCounterHook to hooks_helper.py

* Update symbol.
```
8155eb9d
Allow distibution_utils.py to worker with PSStrategy or none strategy (#7135) · 680eb35c
Yuefeng Zhou authored Jul 02, 2019
```
when there are multiple workers.
```
680eb35c

28 Jun, 2019 4 commits

Add FP16 end-to-end tests (#7122) · 58a3de6c
Toby Boyd authored Jun 28, 2019

58a3de6c

NCF CTL Perf optimization to convert gradients from sparse to dense (#7102) · 44ff121d

nnigania authored Jun 28, 2019

* borrowing a tf1.x optimization which converts gradients from sparse to dense for better perf

* cleanup after code review

44ff121d

Merged commit includes the following changes: (#7119) · 5afa9569

saberkun authored Jun 27, 2019

* Merged commit includes the following changes:
255493073  by hongkuny<hongkuny@google.com>:

    BERT initial OSS readme update.

--
255470372  by dmchen<dmchen@google.com>:

    Slightly expand expected range for F1 score in BERT SQuAD accuracy test

--
255109240  by hongkuny<hongkuny@google.com>:

    Update eval/predict batch sizes.

--
255010016  by hongkuny<hongkuny@google.com>:

    Internal

--
254874613  by hongkuny<hongkuny@google.com>:

    Update glue tasks enum to match directory name

--
254866171  by taylorrobie<taylorrobie@google.com>:

    Internal change

254785517  by zongweiz<zongweiz@google.com>:

    Use train_single_step for BERT GPU models to temporarily work around some performance bugs in GPU runs

--
254497647  by hongkuny<hongkuny@google.com>:

    Fix device placement for TPU export model.

--

PiperOrigin-RevId: 255493073

* Update README.md

5afa9569

Merged commit includes the following changes: (#7116) · 76995053

David M. Chen authored Jun 27, 2019

255493073  by hongkuny<hongkuny@google.com>:

    BERT initial OSS readme update.

--
255470372  by dmchen<dmchen@google.com>:

    Slightly expand expected range for F1 score in BERT SQuAD accuracy test

--
255109240  by hongkuny<hongkuny@google.com>:

    Update eval/predict batch sizes.

--
255010016  by hongkuny<hongkuny@google.com>:

    Internal

--

PiperOrigin-RevId: 255493073

76995053

25 Jun, 2019 1 commit

Merged commit includes the following changes: (#7100) · a156e203

saberkun authored Jun 25, 2019

254874613  by hongkuny<hongkuny@google.com>:

    Update glue tasks enum to match directory name

--
254866171  by taylorrobie<taylorrobie@google.com>:

    Internal change

PiperOrigin-RevId: 254874613

a156e203

24 Jun, 2019 2 commits

Merged commit includes the following changes: (#7093) · 240623ac

saberkun authored Jun 24, 2019

254785517  by A. Unique TensorFlower<gardener@tensorflow.org>:

    Use train_single_step for BERT GPU models to temporarily work around some performance bugs in GPU runs

--
254497647  by hongkuny<hongkuny@google.com>:

    Fix device placement for TPU export model.

--

PiperOrigin-RevId: 254785517

240623ac

adding 8 gpu test for ncf (#7092) · 1157c738
nnigania authored Jun 24, 2019

1157c738

22 Jun, 2019 1 commit
- Fix unit tests failures. (#7086) · 47a59023
  Toby Boyd authored Jun 22, 2019
  
  47a59023
21 Jun, 2019 5 commits

Transformer 2.0: Make metrics layer optional (#7075) · 513fdbb2
guptapriya authored Jun 21, 2019
```
* trying fake merge call

* make metrics optional

* Remove extra print
```
513fdbb2
Fix help print error when stdout/stderr not use utf-8 encoding (#7079) · 0f6845ce
Neil authored Jun 22, 2019

0f6845ce
Add ResNet56 CPU benchmark and accuracy tests. (#7070) · f21337b1
Toby Boyd authored Jun 21, 2019
```
* cpu benchmark and accuracy tests.

* add docstrings to fix lint.
```
f21337b1

NCF XLA and Eager tests with a refactor of resnet flags to make this cleaner. (#7067) · a68f65f8

Toby Boyd authored Jun 21, 2019

* XLA FP32 and first test

* More XLA benchmarks FP32.

* Add eager to NCF and refactor resnet.

* fix v2_0 calls and more flag refactor.

* Remove extra flag args.

* 90 epoch default

* add return

* remove xla not used by estimator.

* Remove duplicate run_eagerly.

* fix flag defaults.

* Remove fp16_implementation flag option.

* Remove stop early on mlperf test.

* remove unneeded args.

* load flags from keras mains.

a68f65f8

Fix Transformer Perfzero issue with fp16 (#7074) · b578aee9
Reed authored Jun 20, 2019

b578aee9

20 Jun, 2019 8 commits

Fix test that requires xla flag to be defined (#7072) · adc8f11b
Haoyu Zhang authored Jun 20, 2019

adc8f11b
Fix resnet tests (#7071) · 092def7b
Haoyu Zhang authored Jun 20, 2019

092def7b

Merged commit includes the following changes: (#7068) · 1636acc9

saberkun authored Jun 20, 2019

254134531  by yuefengz<yuefengz@google.com>:

    Fix a typo in bert_benchmark.py

--

PiperOrigin-RevId: 254134531

1636acc9

Increase the number of steps for the scheduled Keras Big run. (#7069) · 018ab4b5
Igor authored Jun 20, 2019

018ab4b5

Improve performance of Keras ResNet models when not using distribution strategy (#7055) · cf3c2407

Haoyu Zhang authored Jun 20, 2019

* Do not set learning phase when skipping eval

* Do not set learning phase in no dist strat case

* Added device placement, tweaked benchmarks

* Added tweaked benchmarks for Cifar

* Fix device scope

* Fix lint

* Add explicit GPU placement flag

* Also run accuracy test with explicit GPU placement

* Added doc string

cf3c2407

Merged commit includes the following changes: (#7060) · e0e6d981

saberkun authored Jun 19, 2019

254069984  by hongkuny<hongkuny@google.com>:
    Automated rollback of changelist 254060732.

254061429  by hongkuny<hongkuny@google.com>:

    Use host while loop for training steps.

--
254060732  by yifeif<yifeif@google.com>:
    Automated rollback of changelist 254027750.

254027750  by hongkuny<hongkuny@google.com>:

    Internal change

PiperOrigin-RevId: 254069984

e0e6d981

Transformer xla and FP16 benchmarks (#7061) · 695265c8
Toby Boyd authored Jun 19, 2019
```
* Add XLA benchmark tests FP32 only for now.

* Add FP16 XLA tests.

* FP16 only tests.
```
695265c8
Fix lint error (Trailing whitespace) (#7059) · 0cc52905
anj-s authored Jun 19, 2019
```
* .

* .
```
0cc52905